Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis

被引:67
|
作者
Wang, Zan [1 ]
You, Hanmo [1 ]
Chen, Junjie [1 ]
Zhang, Yingyi [1 ]
Dong, Xuyuan [2 ]
Zhang, Wenbin [2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tianjin Univ, Informat & Network Ctr, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Test Prioritization; Deep Neural Network; Mutation; Label; Deep Learning Testing; SELECTION;
D O I
10.1109/ICSE43902.2021.00046
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Neural Network (DNN) testing is one of the most widely-used ways to guarantee the quality of DNNs. However, labeling test inputs to check the correctness of DNN prediction is very costly, which could largely affect the efficiency of DNN testing, even the whole process of DNN development. To relieve the labeling-cost problem, we propose a novel test input prioritization approach (called PRIMA) for DNNs via intelligent mutation analysis in order to label more hug-revealing test inputs earlier for a limited time, which facilitates to improve the efficiency of DNN testing. PRIMA is based on the key insight: a test input that is able to kill many mutated models and produce different prediction results with many mutated inputs, is more likely to reseal DNN bugs, and thus it should be prioritized higher. After obtaining a number of mutation results from a series of our designed model and input mutation rules for each test input, PRIMA further incorporates learning-to-rank (a kind of supervised machine learning to solve ranking problems) to intelligently combine these mutation results for effective test input prioritization. We conducted an extensive study based on M popular subjects by carefully considering their diversity from five dimensions (i.e, different domains of test inputs. different DNN tasks, different network structures, different types of test inputs, and different training scenarios). Our experimental results demonstrate the effectiveness of PRIMA, significantly outperforming the state-of-the-art approaches (with the average improvement of 8.50%similar to 131.01% in terms of prioritization effectiveness). In particular, we have applied PRIMA to the practical autonomous-vehicle testing in a large motor company, and the results on 4 real-world scene-recognition models in autonomous vehicles further confirm the practicability of PRIMA.
引用
收藏
页码:397 / 409
页数:13
相关论文
共 50 条
  • [1] Efficient generation of valid test inputs for deep neural networks via gradient search
    Jiang, Zhouxian
    Li, Honghui
    Wang, Rui
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
  • [2] Predictive Mutation Analysis of Test Case Prioritization for Deep Neural Networks
    Wei, Zhengyuan
    Wang, Haipeng
    Ashraf, Imran
    Chan, W. K.
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 682 - 693
  • [3] Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
    Anh Nguyen
    Dosovitskiy, Alexey
    Yosinski, Jason
    Brox, Thomas
    Clune, Jeff
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [4] Robustness evaluation for deep neural networks via mutation decision boundaries analysis
    Lin, Renhao
    Zhou, Qinglei
    Wu, Bin
    Nan, Xiaofei
    [J]. INFORMATION SCIENCES, 2022, 601 : 147 - 161
  • [5] DeepAbstraction: 2-Level Prioritization for Unlabeled Test Inputs in Deep Neural Networks
    Al-Qadasi, Hamzah
    Wu, Changshun
    Falcone, Ylie
    Bensalem, Saddek
    [J]. 2022 FOURTH IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST 2022), 2022, : 64 - 71
  • [6] On Detection of Out of Distribution Inputs in Deep Neural Networks
    Jha, Susmit
    Roy, Anirban
    [J]. 2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 282 - 288
  • [7] Deep Component Analysis via Alternating Direction Neural Networks
    Murdock, Calvin
    Chang, Ming-Fang
    Lucey, Simon
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 851 - 867
  • [8] Linear MIMO Precoders With Finite Alphabet Inputs via Stochastic Optimization and Deep Neural Networks (DNNs)
    Jing, Shusen
    Xiao, Chengshan
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 4269 - 4281
  • [9] Analysis of Bank Leverage via Dynamical Systems and Deep Neural Networks
    Lillo, Fabrizio
    Livieri, Giulia
    Marmi, Stefano
    Solomko, Anton
    Vaienti, Sandro
    [J]. SIAM JOURNAL ON FINANCIAL MATHEMATICS, 2023, 14 (02): : 598 - 643
  • [10] Adaptive Test Selection for Deep Neural Networks
    Gao, Xinyu
    Feng, Yang
    Yin, Yining
    Liu, Zixi
    Chen, Zhenyu
    Xu, Baowen
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 73 - 85