DeepWeak: Weak Mutation Testing for Deep Learning Systems

被引:0
|
作者
Xue, Yinjie [1 ]
Zhang, Zhiyi [1 ,2 ]
Liu, Chen [3 ]
Chen, Shuxian [1 ]
Huang, Zhiqiu [1 ,4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Peoples R China
[3] Yangzhou Univ, Sch Marxism, Yangzhou, Jiangsu, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Minist Key Lab Safety Crit Software Dev & Verific, Nanjing, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
software testing; mutation testing; weak mutation; deep learning;
D O I
10.1109/QRS62785.2024.00015
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The widespread application of deep learning (DL) makes it crucial to ensure its reliability. Mutation testing has been employed in DL testing to evaluate the quality of test suite. However, the largest problem of DL mutation testing is the high cost of executing numbers of mutants. Weak mutation technology can alleviate this problem by reducing the execution time of mutants in traditional software testing. However, the compared components in traditional software are too trivial to apply weak mutation to DL models directly for that it is impractical for testers to track and monitor massive parameters during execution process. In this paper, we propose a novel weak mutation framework for mutants generated by source-level mutation operators. DeepWeak treats all layers that make up the DL model directly as a set of components of model to replace trivial parameters. And it pays attention to the last convolutioanl layer for that they not only have impacts on prediction results but also are evident for weak analysis. By quantifying contribution of feature maps to the prediction, weight maps will be obtained on the basis of their weights. Finally, the judgements on whether mutants have been killed will be reached by comparing the maps. To evaluate the applicability and effectiveness of our approach, we conduct experiments on three widely used datasets and four deep learning models using three metrics. Experimental results show that DeepWeak is effective at alleviating costs problem, reducing runtime by 11.21% to 18.21% compared with the DL mutation testing with little accuracy loss.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [21] ARTDL: Adaptive Random Testing for Deep Learning Systems
    Yan, Min
    Wang, Li
    Fei, Aiguo
    IEEE ACCESS, 2020, 8 : 3055 - 3064
  • [22] DeepCT: Tomographic Combinatorial Testing for Deep Learning Systems
    Ma, Lei
    Juefei-Xu, Felix
    Xue, Minhui
    Li, Bo
    Li, Li
    Liu, Yang
    Zhao, Jianjun
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 614 - 618
  • [23] Deep Learning for Hyperspectral Image Classification: A Critical Evaluation via Mutation Testing
    Chen, Zhifei
    Hao, Yang
    Liu, Qichao
    Liu, Yuyong
    Zhu, Mingyang
    Xiao, Liang
    REMOTE SENSING, 2024, 16 (24)
  • [24] Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment
    Zou, Yinglong
    Liu, Jiawei
    Zhai, Juan
    Zheng, Tao
    Fang, Chunrong
    Chen, Zhenyu
    arXiv,
  • [25] Coverage Guided Differential Adversarial Testing of Deep Learning Systems
    Guo, Jianmin
    Zhao, Yue
    Song, Houbing
    Jiang, Yu
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 933 - 942
  • [26] QuoTe: Quality-oriented Testing for Deep Learning Systems
    Chen, Jialuo
    Wang, Jingyi
    Ma, Xingjun
    Sun, Youcheng
    Sun, Jun
    Zhang, Peixin
    Cheng, Peng
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (05)
  • [27] A lightweight adaptive random testing method for deep learning systems
    Mao, Chengying
    Song, Yue
    Chen, Jifu
    SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (11): : 2271 - 2295
  • [28] RobOT: Robustness-Oriented Testing for Deep Learning Systems
    Wang, Jingyi
    Chen, Jialuo
    Sun, Youcheng
    Ma, Xingjun
    Wang, Dongxia
    Sun, Jun
    Cheng, Peng
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, : 300 - 311
  • [29] Mixed and constrained input mutation for effective fuzzing of deep learning systems
    Park, Leo Hyun
    Kim, Jaeuk
    Park, Jaewoo
    Kwon, Taekyoung
    INFORMATION SCIENCES, 2022, 614 : 497 - 517
  • [30] MMOS: Multi-Staged Mutation Operator Scheduling for Deep Learning Library Testing
    Li, Junqiang
    Li, Senyi
    Wu, Jiawei
    Luo, Long
    Bai, Yang
    Yu, Hongfang
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6103 - 6108