Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis

被引：67

作者：

Wang, Zan ^{[1
]}

You, Hanmo ^{[1
]}

Chen, Junjie ^{[1
]}

Zhang, Yingyi ^{[1
]}

Dong, Xuyuan ^{[2
]}

Zhang, Wenbin ^{[2
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[2] Tianjin Univ, Informat & Network Ctr, Tianjin, Peoples R China

来源：

2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

Test Prioritization; Deep Neural Network; Mutation; Label; Deep Learning Testing; SELECTION;

D O I：

10.1109/ICSE43902.2021.00046

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep Neural Network (DNN) testing is one of the most widely-used ways to guarantee the quality of DNNs. However, labeling test inputs to check the correctness of DNN prediction is very costly, which could largely affect the efficiency of DNN testing, even the whole process of DNN development. To relieve the labeling-cost problem, we propose a novel test input prioritization approach (called PRIMA) for DNNs via intelligent mutation analysis in order to label more hug-revealing test inputs earlier for a limited time, which facilitates to improve the efficiency of DNN testing. PRIMA is based on the key insight: a test input that is able to kill many mutated models and produce different prediction results with many mutated inputs, is more likely to reseal DNN bugs, and thus it should be prioritized higher. After obtaining a number of mutation results from a series of our designed model and input mutation rules for each test input, PRIMA further incorporates learning-to-rank (a kind of supervised machine learning to solve ranking problems) to intelligently combine these mutation results for effective test input prioritization. We conducted an extensive study based on M popular subjects by carefully considering their diversity from five dimensions (i.e, different domains of test inputs. different DNN tasks, different network structures, different types of test inputs, and different training scenarios). Our experimental results demonstrate the effectiveness of PRIMA, significantly outperforming the state-of-the-art approaches (with the average improvement of 8.50%similar to 131.01% in terms of prioritization effectiveness). In particular, we have applied PRIMA to the practical autonomous-vehicle testing in a large motor company, and the results on 4 real-world scene-recognition models in autonomous vehicles further confirm the practicability of PRIMA.

引用

页码：397 / 409

页数：13

共 50 条

[11] Robust Test Selection for Deep Neural Networks
Sun, Weifeng
Yan, Meng
Liu, Zhongxin
Lo, David
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (12) : 5250 - 5278
[12] A probabilistic framework for mutation testing in deep neural networks
Tambon, Florian
Khomh, Foutse
Antoniol, Giuliano
[J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 155
[13] MuNN: Mutation Analysis of Neural Networks
Shen, Weijun
Wan, Jun
Chen, Zhenyu
[J]. 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2018, : 108 - 115
[14] Load Forecasting via Deep Neural Networks
He, Wan
[J]. 5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 308 - 314
[15] NIM: MODELING AND GENERATION OF SIMULATION INPUTS VIA GENERATIVE NEURAL NETWORKS
Cen, Wang
Herbert, Emily A.
Haas, Peter J.
[J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 584 - 595
[16] Neural Networks with Dependent Inputs
Mostafa Boskabadi
Mahdi Doostparast
[J]. Neural Processing Letters, 2023, 55 : 7337 - 7350
[17] Surprise Adequacy-Guided Deep Neural Network Test Inputs Generation
Guo, Hongjing
Tao, Chuanqi
Huang, Zhiqiu
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 1003 - 1017
[18] Neural Networks with Dependent Inputs
Boskabadi, Mostafa
Doostparast, Mahdi
[J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7337 - 7350
[19] Sentiment Analysis via Deep Multichannel Neural Networks With Variational Information Bottleneck
Gu, Tong
Xu, Guoliang
Luo, Jiangtao
[J]. IEEE ACCESS, 2020, 8 : 121014 - 121021
[20] The #DNN-Verification Problem: Counting Unsafe Inputs for Deep Neural Networks
Marzari, Luca
Corsi, Davide
Cicalese, Ferdinando
Farinelli, Alessandro
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 217 - 224

← 1 2 3 4 5 →