Markov model based coverage testing of deep learning software systems

被引:0
|
作者
Shi, Ying [1 ]
Yin, Beibei [1 ]
Shi, Jing-Ao [1 ]
机构
[1] Beihang Univ BUAA, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
关键词
Deep learning software systems; Deep learning testing; Markov chains; Coverage criteria; Information theory; REPRESENTATION; FRAMEWORK;
D O I
10.1016/j.infsof.2024.107628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Deep Learning (DL) software systems have been widely deployed in safety and security-critical domains, which calls for systematic testing to guarantee their accuracy and reliability. Objective measurement of test quality is one of the key issues in software testing. Recently, many coverage criteria have been proposed to measure the testing adequacy of Deep Neural Networks (DNNs). Objective: Recent research demonstrates that existing criteria have some limitations on interpreting the increasingly diverse behaviors of DNNs or clarifying the relationship between the coverage and the decision logic of DNNs. Moreover, some evaluations argue against the correlation between coverage and defect detection. In this paper, a novel coverage approach is proposed to interpret the internal information of programs. Methods: The process of coverage testing is formalized and quantified by constructing Markov models based on critical neurons extracted using Layer-wise Relevance Propagation in the structure of DNNs. The difference in the transition matrix of Markov chains between training and testing data is measured by KL divergence, and it is developed as a coverage criterion. Results: The values of the proposed coverage increase as the number of classes increases. The values are different for various test suites, and they become higher with the addition of new samples. Higher coverage values are observed to correlate with an increased fault detection capability. Conclusion: The experimental results illustrate that the proposed approach can effectively measure actual diversity and exhibit more adaptability to additional test cases. Furthermore, there is a positive correlation between the proposed coverage and fault detection, which provides support for test case selection guided by coverage.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] A model learning based testing approach for kernel P systems
    Ipate, Florentin
    Niculescu, Ionut
    Lefticaru, Raluca
    Konur, Savas
    Gheorghe, Marian
    THEORETICAL COMPUTER SCIENCE, 2023, 965
  • [42] Automated Software Vulnerability Testing Using Deep Learning Methods
    Kuznetsov, Alexandr
    Yeromin, Yehor
    Shapoval, Oleksiy
    Chernov, Kyrylo
    Popova, Mariia
    Serdukov, Kostyantyn
    2019 IEEE 2ND UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON-2019), 2019, : 837 - 841
  • [43] DeepMutation: Mutation Testing of Deep Learning Systems
    Ma, Lei
    Zhang, Fuyuan
    Sun, Jiyuan
    Xue, Minhui
    Li, Bo
    Juefei-Xu, Felix
    Xie, Chao
    Li, Li
    Liu, Yang
    Zhao, Jianjun
    Wang, Yadong
    2018 29TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2018, : 100 - 111
  • [44] Improving Testing of Deep-Learning Systems
    Deokuliar, Harsh
    Sangwan, Raghvinder S.
    Badr, Yoaukim
    Srinivasan, Satish M.
    COMMUNICATIONS OF THE ACM, 2024, 67 (03) : 44 - 48
  • [45] Software Testing, Software Quality and Trust in Software-Based Systems
    Budnik, Christof
    2012 IEEE 36TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2012, : 253 - 253
  • [46] Improving Testing of Deep-learning Systems
    Deokuliar H.
    Sangwan R.S.
    Badr Y.
    Srinivasan S.M.
    Queue, 2023, 21 (05):
  • [47] Prediction and Comparative Analysis of Software Reliability Model Based on NHPP and Deep Learning
    Kim, Youn Su
    Song, Kwang Yoon
    Chang, In Hong
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [48] Fault localization based on wide & deep learning model by mining software behavior
    Wang, TianTian
    Yu, HaiLong
    Wang, KeChao
    Su, XiaoHong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 309 - 319
  • [49] Learning Tree-based Deep Model for Recommender Systems
    Zhu, Han
    Li, Xiang
    Zhang, Pengye
    Li, Guozheng
    He, Jie
    Li, Han
    Gai, Kun
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1079 - 1088
  • [50] A Study on Test Coverage in Software Testing
    Shahid, Muhammad
    Ibrahim, Suhaimi
    Mahrin, Mohd Naz'ri
    COMPUTER COMMUNICATION AND MANAGEMENT, 2011, 5 : 207 - 215