Equivalence hypothesis testing in experimental software engineering

被引：5

作者：

Javier Dolado, Jose ^{[1
]}

Carmen Otero, Mari ^{[2
]}

Harman, Mark ^{[3
]}

机构：

[1] UPV EHU Univ Basque Country, Fac Informat, San Sebastian, Spain

[2] UPV EHU Univ Basque Country, Escuela Univ Ingn Vitoria Gasteiz, Vitoria, Spain

[3] UCL, CREST, London WC1E 6BT, England

来源：

SOFTWARE QUALITY JOURNAL | 2014年 / 22卷 / 02期

关键词：

Equivalence hypothesis testing; Bioequivalence analysis; Program comprehension; Side-effect free programs; Crossover design; Experimental software engineering; CONFIDENCE-INTERVALS; MODEL VALIDATION; POWER; DIFFERENCE;

D O I：

10.1007/s11219-013-9196-0

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This article introduces the application of equivalence hypothesis testing (EHT) into the Empirical Software Engineering field. Equivalence (also known as bioequivalence in pharmacological studies) is a statistical approach that answers the question "is product T equivalent to some other reference product R within some range ?." The approach of "null hypothesis significance test" used traditionally in Empirical Software Engineering seeks to assess evidence for differences between T and R, not equivalence. In this paper, we explain how EHT can be applied in Software Engineering, thereby extending it from its current application within pharmacological studies, to Empirical Software Engineering. We illustrate the application of EHT to Empirical Software Engineering, by re-examining the behavior of experts and novices when handling code with side effects compared to side-effect free code; a study previously investigated using traditional statistical testing. We also review two other previous published data of software engineering experiments: a dataset compared the comprehension of UML and OML specifications, and the last dataset studied the differences between the specification methods UML-B and B. The application of EHT allows us to extract additional conclusions to the previous results. EHT has an important application in Empirical Software Engineering, which motivate its wider adoption and use: EHT can be used to assess the statistical confidence with which we can claim that two software engineering methods, algorithms of techniques, are equivalent.

引用

下载

页码：215 / 238

页数：24

共 50 条

[1] Equivalence hypothesis testing in experimental software engineering
José Javier Dolado
Mari Carmen Otero
Mark Harman
Software Quality Journal, 2014, 22 : 215 - 238
[2] Bayesian Hypothesis Testing Illustrated: An Introduction for Software Engineering Researchers
Erdogmus, Hakan
ACM COMPUTING SURVEYS, 2023, 55 (06)
[3] Equivalence hypothesis testing: Reply to Bi
Ennis, Daniel M.
Ennis, John M.
FOOD QUALITY AND PREFERENCE, 2010, 21 (03) : 261 - 261
[4] Design level hypothesis testing through reverse engineering of object-oriented software
Counsell, S
Newson, P
Mendes, E
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2004, 14 (02) : 207 - 220
[5] Architectural level hypothesis testing through reverse engineering of object-oriented software
Counsell, S
Newson, P
Mendes, E
8TH INTERNATIONAL WORKSHOP ON PROGRAM COMPREHENSION (IWPC 2000), PROCEEDINGS, 2000, : 60 - 66
[6] Credibility, hypothesis testing and regression software
Taylor, Greg
ASTIN BULLETIN, 2007, 37 (02): : 517 - 535
[7] The exact equivalence of distance and kernel methods in hypothesis testing
Cencheng Shen
Joshua T. Vogelstein
AStA Advances in Statistical Analysis, 2021, 105 : 385 - 403
[8] Null Hypothesis Significance Testing Does Not Show Equivalence
Barchard, Kimberly A.
ANALYSES OF SOCIAL ISSUES AND PUBLIC POLICY, 2015, 15 (01) : 418 - 421
[9] The exact equivalence of distance and kernel methods in hypothesis testing
Shen, Cencheng
Vogelstein, Joshua T.
ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2021, 105 (03) : 385 - 403
[10] Hypothesis Testing in Noninferiority and Equivalence MRMC ROC Studies
Chen, Weijie
Petrick, Nicholas A.
Sahiner, Berkman
ACADEMIC RADIOLOGY, 2012, 19 (09) : 1158 - 1165

← 1 2 3 4 5 →