Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis

被引：67

作者：

Wang, Zan ^{[1
]}

You, Hanmo ^{[1
]}

Chen, Junjie ^{[1
]}

Zhang, Yingyi ^{[1
]}

Dong, Xuyuan ^{[2
]}

Zhang, Wenbin ^{[2
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[2] Tianjin Univ, Informat & Network Ctr, Tianjin, Peoples R China

来源：

2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

Test Prioritization; Deep Neural Network; Mutation; Label; Deep Learning Testing; SELECTION;

D O I：

10.1109/ICSE43902.2021.00046

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep Neural Network (DNN) testing is one of the most widely-used ways to guarantee the quality of DNNs. However, labeling test inputs to check the correctness of DNN prediction is very costly, which could largely affect the efficiency of DNN testing, even the whole process of DNN development. To relieve the labeling-cost problem, we propose a novel test input prioritization approach (called PRIMA) for DNNs via intelligent mutation analysis in order to label more hug-revealing test inputs earlier for a limited time, which facilitates to improve the efficiency of DNN testing. PRIMA is based on the key insight: a test input that is able to kill many mutated models and produce different prediction results with many mutated inputs, is more likely to reseal DNN bugs, and thus it should be prioritized higher. After obtaining a number of mutation results from a series of our designed model and input mutation rules for each test input, PRIMA further incorporates learning-to-rank (a kind of supervised machine learning to solve ranking problems) to intelligently combine these mutation results for effective test input prioritization. We conducted an extensive study based on M popular subjects by carefully considering their diversity from five dimensions (i.e, different domains of test inputs. different DNN tasks, different network structures, different types of test inputs, and different training scenarios). Our experimental results demonstrate the effectiveness of PRIMA, significantly outperforming the state-of-the-art approaches (with the average improvement of 8.50%similar to 131.01% in terms of prioritization effectiveness). In particular, we have applied PRIMA to the practical autonomous-vehicle testing in a large motor company, and the results on 4 real-world scene-recognition models in autonomous vehicles further confirm the practicability of PRIMA.

引用

页码：397 / 409

页数：13

共 50 条

[1] Efficient generation of valid test inputs for deep neural networks via gradient search
Jiang, Zhouxian
Li, Honghui
Wang, Rui
[J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
[2] Predictive Mutation Analysis of Test Case Prioritization for Deep Neural Networks
Wei, Zhengyuan
Wang, Haipeng
Ashraf, Imran
Chan, W. K.
[J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 682 - 693
[3] Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
Anh Nguyen
Dosovitskiy, Alexey
Yosinski, Jason
Brox, Thomas
Clune, Jeff
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[4] Robustness evaluation for deep neural networks via mutation decision boundaries analysis
Lin, Renhao
Zhou, Qinglei
Wu, Bin
Nan, Xiaofei
[J]. INFORMATION SCIENCES, 2022, 601 : 147 - 161
[5] DeepAbstraction: 2-Level Prioritization for Unlabeled Test Inputs in Deep Neural Networks
Al-Qadasi, Hamzah
Wu, Changshun
Falcone, Ylie
Bensalem, Saddek
[J]. 2022 FOURTH IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST 2022), 2022, : 64 - 71
[6] On Detection of Out of Distribution Inputs in Deep Neural Networks
Jha, Susmit
Roy, Anirban
[J]. 2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 282 - 288
[7] Deep Component Analysis via Alternating Direction Neural Networks
Murdock, Calvin
Chang, Ming-Fang
Lucey, Simon
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 851 - 867
[8] Linear MIMO Precoders With Finite Alphabet Inputs via Stochastic Optimization and Deep Neural Networks (DNNs)
Jing, Shusen
Xiao, Chengshan
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 4269 - 4281
[9] Analysis of Bank Leverage via Dynamical Systems and Deep Neural Networks
Lillo, Fabrizio
Livieri, Giulia
Marmi, Stefano
Solomko, Anton
Vaienti, Sandro
[J]. SIAM JOURNAL ON FINANCIAL MATHEMATICS, 2023, 14 (02): : 598 - 643
[10] Adaptive Test Selection for Deep Neural Networks
Gao, Xinyu
Feng, Yang
Yin, Yining
Liu, Zixi
Chen, Zhenyu
Xu, Baowen
[J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 73 - 85

← 1 2 3 4 5 →