DeepWeak: Weak Mutation Testing for Deep Learning Systems

被引:0
|
作者
Xue, Yinjie [1 ]
Zhang, Zhiyi [1 ,2 ]
Liu, Chen [3 ]
Chen, Shuxian [1 ]
Huang, Zhiqiu [1 ,4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Peoples R China
[3] Yangzhou Univ, Sch Marxism, Yangzhou, Jiangsu, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Minist Key Lab Safety Crit Software Dev & Verific, Nanjing, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
software testing; mutation testing; weak mutation; deep learning;
D O I
10.1109/QRS62785.2024.00015
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The widespread application of deep learning (DL) makes it crucial to ensure its reliability. Mutation testing has been employed in DL testing to evaluate the quality of test suite. However, the largest problem of DL mutation testing is the high cost of executing numbers of mutants. Weak mutation technology can alleviate this problem by reducing the execution time of mutants in traditional software testing. However, the compared components in traditional software are too trivial to apply weak mutation to DL models directly for that it is impractical for testers to track and monitor massive parameters during execution process. In this paper, we propose a novel weak mutation framework for mutants generated by source-level mutation operators. DeepWeak treats all layers that make up the DL model directly as a set of components of model to replace trivial parameters. And it pays attention to the last convolutioanl layer for that they not only have impacts on prediction results but also are evident for weak analysis. By quantifying contribution of feature maps to the prediction, weight maps will be obtained on the basis of their weights. Finally, the judgements on whether mutants have been killed will be reached by comparing the maps. To evaluate the applicability and effectiveness of our approach, we conduct experiments on three widely used datasets and four deep learning models using three metrics. Experimental results show that DeepWeak is effective at alleviating costs problem, reducing runtime by 11.21% to 18.21% compared with the DL mutation testing with little accuracy loss.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [31] DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems
    Ma, Lei
    Juefei-Xu, Felix
    Zhang, Fuyuan
    Sun, Jiyuan
    Xue, Minhui
    Li, Bo
    Chen, Chunyang
    Su, Ting
    Li, Li
    Liu, Yang
    Zhao, Jianjun
    Wang, Yadong
    PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 120 - 131
  • [32] Markov model based coverage testing of deep learning software systems
    Shi, Ying
    Yin, Beibei
    Shi, Jing-Ao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 179
  • [33] Efficient and Effective Feature Space Exploration for Testing Deep Learning Systems
    Zohdinasab, Tahereh
    Riccio, Vincenzo
    Gambi, Alessio
    Tonella, Paolo
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (02)
  • [34] Condition-Guided Adversarial Generative Testing for Deep Learning Systems
    Dai, Qiyin
    Zhang, Pengcheng
    Ji, Shunhui
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST), 2019, : 71 - 72
  • [35] Uncertainty-Guided Testing and Robustness Enhancement for Deep Learning Systems
    Zhang, Xiyue
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2020), 2020, : 101 - 103
  • [36] FRAFOL: FRAmework FOr Learning mutation testing
    Tavares, Pedro
    Paiva, Ana
    Amalfitano, Domenico
    Just, Rene
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1846 - 1850
  • [37] WeMuTe - A Weak Mutation Testing Tool for WS-BPEL
    Boonyakulsrirung, Panya
    Suwannasart, Taratip
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 810 - 815
  • [38] Mutant reduction based on dominance relation for weak mutation testing
    Gong, Dunwei
    Zhang, Gongjie
    Yao, Xiangjuan
    Meng, Fanlin
    INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 81 : 82 - 96
  • [39] CWMT; A Concurrency Based Acceleration Technique for Weak Mutation Testing
    Sun, Chang-Ai
    Zeng, Guo-Feng
    Zhang, Shou-Feng
    Tang, Jin
    Li, Ning
    Zhang, Shi-Yong
    Chen, Yan
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (07): : 1409 - 1426
  • [40] Interpreting deep learning models for weak lensing
    Matilla, Jose Manuel Zorrilla
    Sharma, Manasi
    Hsu, Daniel
    Haiman, Zoltan
    PHYSICAL REVIEW D, 2020, 102 (12)