Cost Measures Matter for Mutation Testing Study Validity

被引:7
|
作者
Guizzo, Giovani [1 ]
Sarro, Federica [1 ]
Harman, Mark [1 ]
机构
[1] UCL, Dept Comp Sci, London, England
基金
欧洲研究理事会;
关键词
Software Testing; Mutation Testing; Mutation Analysis; Cost Reduction; Number of Mutants; Execution Time; Mutant Reduction;
D O I
10.1145/3368089.3409742
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mutation testing research has often used the number of mutants as a surrogate measure for the true execution cost of generating and executing mutants. This poses a potential threat to the validity of the scientific findings reported in the literature. Out of 75 works surveyed in this paper, we found that 54 (72%) are vulnerable to this threat. To investigate the magnitude of the threat, we conducted an empirical evaluation using 10 real-world programs. The results reveal that: i) percentages of randomly sampled mutants differ from the true execution time, on average, by 44%, varying in difference from 19% to 91%; ii) errors arising from using the surrogate correlate with program size (rho = 0.74) and number of mutants (rho = 0.76), making the problem more pernicious for more realistic programs; iii) scientific findings concerning sampling strategies would have approximately 37% rank disagreement, indicating potentially dramatic impact on experiment validity. To investigate whether this threat matters in practice, we reproduced a seminal study on Selective Mutation (widely relied upon for more than two decades). The impact is stark: an inconclusive scientific finding using the surrogate is transformed to an unequivocal finding when using the true execution cost.
引用
收藏
页码:1127 / 1139
页数:13
相关论文
共 50 条
  • [21] Mutation testing cost reduction by clustering overlapped mutants
    Ma, Yu-Seung
    Kim, Sang-Woon
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 115 : 18 - 30
  • [22] Improved Genetic Algorithm to Reduce Mutation Testing Cost
    Bashir, Muhammad Bilal
    Nadeem, Aamer
    IEEE ACCESS, 2017, 5 : 3657 - 3674
  • [23] RSM: REDUCING MUTATION TESTING COST USING RANDOM SELECTIVE MUTATION TECHNIQUE
    Falah, Bouchaib
    Akour, Mohammed
    Bouriat, Salwa
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2015, 28 (04) : 338 - 347
  • [24] The validity of pulp testing A clinical study
    Weisleder, Rebeca
    Yamauchi, Shizuko
    Caplan, Daniel J.
    Trope, Martin
    Teixeira, Fabricio B.
    JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 2009, 140 (08): : 1013 - 1017
  • [25] A STUDY OF VALIDITY OF OPEN-FIELD MEASURES
    IVINSKIS, A
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 1970, 22 (02) : 175 - &
  • [26] Identification of propagated defects to reduce software testing cost via mutation testing
    Lee, Dong-Gun
    Seo, Yeong-Seok
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (06) : 6124 - 6140
  • [27] TESTING THE VALIDITY-CONDITIONS OF REPEATED MEASURES F-TESTS
    KESELMAN, HJ
    ROGAN, JC
    MENDOZA, JL
    BREEN, LJ
    PSYCHOLOGICAL BULLETIN, 1980, 87 (03) : 479 - 481
  • [28] Testing the reliability and validity of computer-mediated social support measures among older adults - A pilot study
    Nahm, ES
    Resnick, B
    Gaines, J
    CIN-COMPUTERS INFORMATICS NURSING, 2004, 22 (04) : 211 - 219
  • [29] Decreasing the cost of mutation testing with second-order mutants
    Polo, Macario
    Piattini, Mario
    Garcia-Rodriguez, Ignacio
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2009, 19 (02): : 111 - 131
  • [30] Reducing the Cost of Mutation Testing with the Use of Primitive Arcs Concept
    Kuroishi, Pedro Henrique
    Delamaro, Marcio Eduardo
    Maldonado, Jose Carlos
    Rizzo Vincenzi, Auri Marcelo
    PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON SOFTWARE QUALITY, SBOS 2020, 2020,