Cost Measures Matter for Mutation Testing Study Validity

被引：7

作者：

Guizzo, Giovani ^{[1
]}

Sarro, Federica ^{[1
]}

Harman, Mark ^{[1
]}

机构：

[1] UCL, Dept Comp Sci, London, England

来源：

PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20) | 2020年

基金：

欧洲研究理事会;

关键词：

Software Testing; Mutation Testing; Mutation Analysis; Cost Reduction; Number of Mutants; Execution Time; Mutant Reduction;

D O I：

10.1145/3368089.3409742

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Mutation testing research has often used the number of mutants as a surrogate measure for the true execution cost of generating and executing mutants. This poses a potential threat to the validity of the scientific findings reported in the literature. Out of 75 works surveyed in this paper, we found that 54 (72%) are vulnerable to this threat. To investigate the magnitude of the threat, we conducted an empirical evaluation using 10 real-world programs. The results reveal that: i) percentages of randomly sampled mutants differ from the true execution time, on average, by 44%, varying in difference from 19% to 91%; ii) errors arising from using the surrogate correlate with program size (rho = 0.74) and number of mutants (rho = 0.76), making the problem more pernicious for more realistic programs; iii) scientific findings concerning sampling strategies would have approximately 37% rank disagreement, indicating potentially dramatic impact on experiment validity. To investigate whether this threat matters in practice, we reproduced a seminal study on Selective Mutation (widely relied upon for more than two decades). The impact is stark: an inconclusive scientific finding using the surrogate is transformed to an unequivocal finding when using the true execution cost.

引用

页码：1127 / 1139

页数：13

共 50 条

[21] Mutation testing cost reduction by clustering overlapped mutants
Ma, Yu-Seung
Kim, Sang-Woon
JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 115 : 18 - 30
[22] Improved Genetic Algorithm to Reduce Mutation Testing Cost
Bashir, Muhammad Bilal
Nadeem, Aamer
IEEE ACCESS, 2017, 5 : 3657 - 3674
[23] RSM: REDUCING MUTATION TESTING COST USING RANDOM SELECTIVE MUTATION TECHNIQUE
Falah, Bouchaib
Akour, Mohammed
Bouriat, Salwa
MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2015, 28 (04) : 338 - 347
[24] The validity of pulp testing A clinical study
Weisleder, Rebeca
Yamauchi, Shizuko
Caplan, Daniel J.
Trope, Martin
Teixeira, Fabricio B.
JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 2009, 140 (08): : 1013 - 1017
[25] A STUDY OF VALIDITY OF OPEN-FIELD MEASURES
IVINSKIS, A
AUSTRALIAN JOURNAL OF PSYCHOLOGY, 1970, 22 (02) : 175 - &
[26] Identification of propagated defects to reduce software testing cost via mutation testing
Lee, Dong-Gun
Seo, Yeong-Seok
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (06) : 6124 - 6140
[27] TESTING THE VALIDITY-CONDITIONS OF REPEATED MEASURES F-TESTS
KESELMAN, HJ
ROGAN, JC
MENDOZA, JL
BREEN, LJ
PSYCHOLOGICAL BULLETIN, 1980, 87 (03) : 479 - 481
[28] Testing the reliability and validity of computer-mediated social support measures among older adults - A pilot study
Nahm, ES
Resnick, B
Gaines, J
CIN-COMPUTERS INFORMATICS NURSING, 2004, 22 (04) : 211 - 219
[29] Decreasing the cost of mutation testing with second-order mutants
Polo, Macario
Piattini, Mario
Garcia-Rodriguez, Ignacio
SOFTWARE TESTING VERIFICATION & RELIABILITY, 2009, 19 (02): : 111 - 131
[30] Reducing the Cost of Mutation Testing with the Use of Primitive Arcs Concept
Kuroishi, Pedro Henrique
Delamaro, Marcio Eduardo
Maldonado, Jose Carlos
Rizzo Vincenzi, Auri Marcelo
PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON SOFTWARE QUALITY, SBOS 2020, 2020,

← 1 2 3 4 5 →