GraphPrior: Mutation-based Test Input Prioritization for Graph Neural Networks

被引:5
|
作者
Dang, Xueqi [1 ]
Li, Yinghua [1 ]
Papadakis, Mike [1 ]
Klein, Jacques [1 ]
Bissyande, Tegawende F. [1 ]
Le Traon, Yves [1 ]
机构
[1] Univ Luxembourg, Luxembourg, Luxembourg
基金
欧洲研究理事会;
关键词
Test Input Prioritization; Graph Neural Networks; Mutation; Labelling;
D O I
10.1145/3607191
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Graph Neural Networks (GNNs) have achieved promising performance in a variety of practical applications. Similar to traditional DNNs, GNNs could exhibit incorrect behavior that may lead to severe consequences, and thus testing is necessary and crucial. However, labeling all the test inputs for GNNs can be costly and time-consuming, especially when dealing with large and complex graphs, which seriously affects the efficiency of GNN testing. Existing studies have focused on test prioritization for DNNs, which aims to identify and prioritize fault-revealing tests (i.e., test inputs that are more likely to be misclassified) to detect system bugs earlier in a limited time. Although some DNN prioritization approaches have been demonstrated effective, there is a significant problem when applying them to GNNs: They do not take into account the connections (edges) between GNN test inputs (nodes), which play a significant role in GNN inference. In general, DNN test inputs are independent of each other, while GNN test inputs are usually represented as a graph with complex relationships between each test. In this article, we propose GraphPrior (GNN-oriented Test Prioritization), a set of approaches to prioritize test inputs specifically for GNNs via mutation analysis. Inspired by mutation testing in traditional software engineering, in which test suites are evaluated based on the mutants they kill, GraphPrior generates mutated models for GNNs and regards test inputs that kill many mutated models as more likely to be misclassified. Then, GraphPrior leverages the mutation results in two ways, killing-based and feature-based methods. When scoring a test input, the killing-based method considers each mutated model equally important, while feature-based methods learn different importance for each mutated model through ranking models. Finally, GraphPrior ranks all the test inputs based on their scores. We conducted an extensive study based on 604 subjects to evaluate GraphPrior on both natural and adversarial test inputs. The results demonstrate that KMGP, the killing-based GraphPrior approach, outperforms the compared approaches in a majority of cases, with an average improvement of 4.76%similar to 49.60% in terms of APFD. Furthermore, the feature-based GraphPrior approach, RFGP, performs the best among all the Graph-Prior approaches. On adversarial test inputs, RFGP outperforms the compared approaches across different adversarial attacks, with the average improvement of 2.95%similar to 46.69%.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Test Input Prioritization for Graph Neural Networks
    Li, Yinghua
    Dang, Xueqi
    Pian, Weiguo
    Habib, Andrew
    Klein, Jacques
    Bissyande, Tegawende F.
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (06) : 1396 - 1424
  • [2] Empirical evaluation of mutation-based test case prioritization techniques
    Shin, Donghwan
    Yoo, Shin
    Papadakis, Mike
    Bae, Doo-Hwan
    [J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2019, 29 (1-2):
  • [3] Mutation-based Test-Case Prioritization in Software Evolution
    Lou, Yiling
    Hao, Dan
    Zhang, Lu
    [J]. 2015 IEEE 26TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2015, : 46 - 57
  • [4] TPFL: Test Input Prioritization for Deep Neural Networks Based on Fault Localization
    Tao, Yali
    Tao, Chuanqi
    Guo, Hongjing
    Li, Bohan
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 368 - 383
  • [5] Mutation-based Fault Localization of Deep Neural Networks
    Ghanbari, Ali
    Thomas, Deepak-George
    Arshad, Muhammad Arbab
    Rajan, Hridesh
    [J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1301 - 1313
  • [6] MBEANN: Mutation-based evolving artificial neural networks
    Ohkura, Kazuhiro
    Yasuda, Toshiyuki
    Kawamatsu, Yuichi
    Matsumura, Yoshiyuki
    Ueda, Kanji
    [J]. ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 936 - +
  • [7] Input Prioritization for Testing Neural Networks
    Byun, Taejoon
    Sharma, Vaibhav
    Vijayakumar, Abhishek
    Rayadurgam, Sanjai
    Cofer, Darren
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST), 2019, : 63 - 70
  • [8] Mutation-based genetic neural network
    Palmes, PP
    Hayasaka, T
    Usui, S
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03): : 587 - 600
  • [9] Predictive Mutation Analysis of Test Case Prioritization for Deep Neural Networks
    Wei, Zhengyuan
    Wang, Haipeng
    Ashraf, Imran
    Chan, W. K.
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 682 - 693
  • [10] Mutation-Based Graph Inference for Fault Localization
    Musco, Vincenzo
    Monperrus, Martin
    Preux, Philippe
    [J]. 2016 IEEE 16TH INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2016, : 97 - 106