DeepWeak: Weak Mutation Testing for Deep Learning Systems

被引:0
|
作者
Xue, Yinjie [1 ]
Zhang, Zhiyi [1 ,2 ]
Liu, Chen [3 ]
Chen, Shuxian [1 ]
Huang, Zhiqiu [1 ,4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Peoples R China
[3] Yangzhou Univ, Sch Marxism, Yangzhou, Jiangsu, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Minist Key Lab Safety Crit Software Dev & Verific, Nanjing, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
software testing; mutation testing; weak mutation; deep learning;
D O I
10.1109/QRS62785.2024.00015
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The widespread application of deep learning (DL) makes it crucial to ensure its reliability. Mutation testing has been employed in DL testing to evaluate the quality of test suite. However, the largest problem of DL mutation testing is the high cost of executing numbers of mutants. Weak mutation technology can alleviate this problem by reducing the execution time of mutants in traditional software testing. However, the compared components in traditional software are too trivial to apply weak mutation to DL models directly for that it is impractical for testers to track and monitor massive parameters during execution process. In this paper, we propose a novel weak mutation framework for mutants generated by source-level mutation operators. DeepWeak treats all layers that make up the DL model directly as a set of components of model to replace trivial parameters. And it pays attention to the last convolutioanl layer for that they not only have impacts on prediction results but also are evident for weak analysis. By quantifying contribution of feature maps to the prediction, weight maps will be obtained on the basis of their weights. Finally, the judgements on whether mutants have been killed will be reached by comparing the maps. To evaluate the applicability and effectiveness of our approach, we conduct experiments on three widely used datasets and four deep learning models using three metrics. Experimental results show that DeepWeak is effective at alleviating costs problem, reducing runtime by 11.21% to 18.21% compared with the DL mutation testing with little accuracy loss.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [1] DeepMutation: Mutation Testing of Deep Learning Systems
    Ma, Lei
    Zhang, Fuyuan
    Sun, Jiyuan
    Xue, Minhui
    Li, Bo
    Juefei-Xu, Felix
    Xie, Chao
    Li, Li
    Liu, Yang
    Zhao, Jianjun
    Wang, Yadong
    2018 29TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2018, : 100 - 111
  • [2] DeepMutation plus plus : a Mutation Testing Framework for Deep Learning Systems
    Hu, Qiang
    Ma, Lei
    Xie, Xiaofei
    Yu, Bing
    Liu, Yang
    Zhao, Jianjun
    34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2019), 2019, : 1158 - 1161
  • [3] DeepCrime: Mutation Testing of Deep Learning Systems Based on Real Faults
    Humbatova, Nargiz
    Jahangirova, Gunel
    Tonella, Paolo
    ISSTA '21: PROCEEDINGS OF THE 30TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, 2021, : 67 - 78
  • [4] A fine-grained evaluation of mutation operators to boost mutation testing for deep learning systems
    Zhang, Zhiyi
    Wang, Yichun
    Yao, Yongming
    Wang, Ziyuan
    Huang, Zhiqiu
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [5] Mutation testing of unsupervised learning systems
    Lu, Yuteng
    Shao, Kaicheng
    Zhao, Jia
    Sun, Weidi
    Sun, Meng
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 146
  • [6] Towards mutation testing of Reinforcement Learning systems
    Lu, Yuteng
    Sun, Weidi
    Sun, Meng
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 131
  • [7] Mutation Testing of Deep Reinforcement Learning Based on Real Faults
    Tambon, Florian
    Majdinasab, Vahid
    Nikanjam, Amin
    Khomh, Foutse
    Antoniol, Giuliano
    2023 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST, 2023, : 188 - 198
  • [8] Boundary sampling to boost mutation testing for deep learning models
    Shen, Weijun
    Li, Yanhui
    Han, Yuanlei
    Chen, Lin
    Wu, Di
    Zhou, Yuming
    Xu, Baowen
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 130 (130)
  • [9] Improving Testing of Deep-Learning Systems
    Deokuliar, Harsh
    Sangwan, Raghvinder S.
    Badr, Yoaukim
    Srinivasan, Satish M.
    COMMUNICATIONS OF THE ACM, 2024, 67 (03) : 44 - 48
  • [10] Improving Testing of Deep-learning Systems
    Deokuliar H.
    Sangwan R.S.
    Badr Y.
    Srinivasan S.M.
    Queue, 2023, 21 (05):