Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis

被引:67
|
作者
Wang, Zan [1 ]
You, Hanmo [1 ]
Chen, Junjie [1 ]
Zhang, Yingyi [1 ]
Dong, Xuyuan [2 ]
Zhang, Wenbin [2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tianjin Univ, Informat & Network Ctr, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Test Prioritization; Deep Neural Network; Mutation; Label; Deep Learning Testing; SELECTION;
D O I
10.1109/ICSE43902.2021.00046
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Neural Network (DNN) testing is one of the most widely-used ways to guarantee the quality of DNNs. However, labeling test inputs to check the correctness of DNN prediction is very costly, which could largely affect the efficiency of DNN testing, even the whole process of DNN development. To relieve the labeling-cost problem, we propose a novel test input prioritization approach (called PRIMA) for DNNs via intelligent mutation analysis in order to label more hug-revealing test inputs earlier for a limited time, which facilitates to improve the efficiency of DNN testing. PRIMA is based on the key insight: a test input that is able to kill many mutated models and produce different prediction results with many mutated inputs, is more likely to reseal DNN bugs, and thus it should be prioritized higher. After obtaining a number of mutation results from a series of our designed model and input mutation rules for each test input, PRIMA further incorporates learning-to-rank (a kind of supervised machine learning to solve ranking problems) to intelligently combine these mutation results for effective test input prioritization. We conducted an extensive study based on M popular subjects by carefully considering their diversity from five dimensions (i.e, different domains of test inputs. different DNN tasks, different network structures, different types of test inputs, and different training scenarios). Our experimental results demonstrate the effectiveness of PRIMA, significantly outperforming the state-of-the-art approaches (with the average improvement of 8.50%similar to 131.01% in terms of prioritization effectiveness). In particular, we have applied PRIMA to the practical autonomous-vehicle testing in a large motor company, and the results on 4 real-world scene-recognition models in autonomous vehicles further confirm the practicability of PRIMA.
引用
收藏
页码:397 / 409
页数:13
相关论文
共 50 条
  • [11] Robust Test Selection for Deep Neural Networks
    Sun, Weifeng
    Yan, Meng
    Liu, Zhongxin
    Lo, David
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (12) : 5250 - 5278
  • [12] A probabilistic framework for mutation testing in deep neural networks
    Tambon, Florian
    Khomh, Foutse
    Antoniol, Giuliano
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 155
  • [13] MuNN: Mutation Analysis of Neural Networks
    Shen, Weijun
    Wan, Jun
    Chen, Zhenyu
    [J]. 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2018, : 108 - 115
  • [14] Load Forecasting via Deep Neural Networks
    He, Wan
    [J]. 5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 308 - 314
  • [15] NIM: MODELING AND GENERATION OF SIMULATION INPUTS VIA GENERATIVE NEURAL NETWORKS
    Cen, Wang
    Herbert, Emily A.
    Haas, Peter J.
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 584 - 595
  • [16] Neural Networks with Dependent Inputs
    Mostafa Boskabadi
    Mahdi Doostparast
    [J]. Neural Processing Letters, 2023, 55 : 7337 - 7350
  • [17] Surprise Adequacy-Guided Deep Neural Network Test Inputs Generation
    Guo, Hongjing
    Tao, Chuanqi
    Huang, Zhiqiu
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 1003 - 1017
  • [18] Neural Networks with Dependent Inputs
    Boskabadi, Mostafa
    Doostparast, Mahdi
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7337 - 7350
  • [19] Sentiment Analysis via Deep Multichannel Neural Networks With Variational Information Bottleneck
    Gu, Tong
    Xu, Guoliang
    Luo, Jiangtao
    [J]. IEEE ACCESS, 2020, 8 : 121014 - 121021
  • [20] The #DNN-Verification Problem: Counting Unsafe Inputs for Deep Neural Networks
    Marzari, Luca
    Corsi, Davide
    Cicalese, Ferdinando
    Farinelli, Alessandro
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 217 - 224