GraphPrior: Mutation-based Test Input Prioritization for Graph Neural Networks

被引：5

作者：

Dang, Xueqi ^{[1
]}

Li, Yinghua ^{[1
]}

Papadakis, Mike ^{[1
]}

Klein, Jacques ^{[1
]}

Bissyande, Tegawende F. ^{[1
]}

Le Traon, Yves ^{[1
]}

机构：

[1] Univ Luxembourg, Luxembourg, Luxembourg

来源：

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY | 2024年 / 33卷 / 01期

基金：

欧洲研究理事会;

关键词：

Test Input Prioritization; Graph Neural Networks; Mutation; Labelling;

D O I：

10.1145/3607191

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Graph Neural Networks (GNNs) have achieved promising performance in a variety of practical applications. Similar to traditional DNNs, GNNs could exhibit incorrect behavior that may lead to severe consequences, and thus testing is necessary and crucial. However, labeling all the test inputs for GNNs can be costly and time-consuming, especially when dealing with large and complex graphs, which seriously affects the efficiency of GNN testing. Existing studies have focused on test prioritization for DNNs, which aims to identify and prioritize fault-revealing tests (i.e., test inputs that are more likely to be misclassified) to detect system bugs earlier in a limited time. Although some DNN prioritization approaches have been demonstrated effective, there is a significant problem when applying them to GNNs: They do not take into account the connections (edges) between GNN test inputs (nodes), which play a significant role in GNN inference. In general, DNN test inputs are independent of each other, while GNN test inputs are usually represented as a graph with complex relationships between each test. In this article, we propose GraphPrior (GNN-oriented Test Prioritization), a set of approaches to prioritize test inputs specifically for GNNs via mutation analysis. Inspired by mutation testing in traditional software engineering, in which test suites are evaluated based on the mutants they kill, GraphPrior generates mutated models for GNNs and regards test inputs that kill many mutated models as more likely to be misclassified. Then, GraphPrior leverages the mutation results in two ways, killing-based and feature-based methods. When scoring a test input, the killing-based method considers each mutated model equally important, while feature-based methods learn different importance for each mutated model through ranking models. Finally, GraphPrior ranks all the test inputs based on their scores. We conducted an extensive study based on 604 subjects to evaluate GraphPrior on both natural and adversarial test inputs. The results demonstrate that KMGP, the killing-based GraphPrior approach, outperforms the compared approaches in a majority of cases, with an average improvement of 4.76%similar to 49.60% in terms of APFD. Furthermore, the feature-based GraphPrior approach, RFGP, performs the best among all the Graph-Prior approaches. On adversarial test inputs, RFGP outperforms the compared approaches across different adversarial attacks, with the average improvement of 2.95%similar to 46.69%.

引用

页数：40

共 50 条

[1] Test Input Prioritization for Graph Neural Networks
Li, Yinghua
Dang, Xueqi
Pian, Weiguo
Habib, Andrew
Klein, Jacques
Bissyande, Tegawende F.
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (06) : 1396 - 1424
[2] Empirical evaluation of mutation-based test case prioritization techniques
Shin, Donghwan
Yoo, Shin
Papadakis, Mike
Bae, Doo-Hwan
[J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2019, 29 (1-2):
[3] Mutation-based Test-Case Prioritization in Software Evolution
Lou, Yiling
Hao, Dan
Zhang, Lu
[J]. 2015 IEEE 26TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2015, : 46 - 57
[4] TPFL: Test Input Prioritization for Deep Neural Networks Based on Fault Localization
Tao, Yali
Tao, Chuanqi
Guo, Hongjing
Li, Bohan
[J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 368 - 383
[5] Mutation-based Fault Localization of Deep Neural Networks
Ghanbari, Ali
Thomas, Deepak-George
Arshad, Muhammad Arbab
Rajan, Hridesh
[J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1301 - 1313
[6] MBEANN: Mutation-based evolving artificial neural networks
Ohkura, Kazuhiro
Yasuda, Toshiyuki
Kawamatsu, Yuichi
Matsumura, Yoshiyuki
Ueda, Kanji
[J]. ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 936 - +
[7] Input Prioritization for Testing Neural Networks
Byun, Taejoon
Sharma, Vaibhav
Vijayakumar, Abhishek
Rayadurgam, Sanjai
Cofer, Darren
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST), 2019, : 63 - 70
[8] Mutation-based genetic neural network
Palmes, PP
Hayasaka, T
Usui, S
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03): : 587 - 600
[9] Predictive Mutation Analysis of Test Case Prioritization for Deep Neural Networks
Wei, Zhengyuan
Wang, Haipeng
Ashraf, Imran
Chan, W. K.
[J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 682 - 693
[10] Mutation-Based Graph Inference for Fault Localization
Musco, Vincenzo
Monperrus, Martin
Preux, Philippe
[J]. 2016 IEEE 16TH INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2016, : 97 - 106

← 1 2 3 4 5 →