Equivalence hypothesis testing in experimental software engineering

被引:5
|
作者
Javier Dolado, Jose [1 ]
Carmen Otero, Mari [2 ]
Harman, Mark [3 ]
机构
[1] UPV EHU Univ Basque Country, Fac Informat, San Sebastian, Spain
[2] UPV EHU Univ Basque Country, Escuela Univ Ingn Vitoria Gasteiz, Vitoria, Spain
[3] UCL, CREST, London WC1E 6BT, England
关键词
Equivalence hypothesis testing; Bioequivalence analysis; Program comprehension; Side-effect free programs; Crossover design; Experimental software engineering; CONFIDENCE-INTERVALS; MODEL VALIDATION; POWER; DIFFERENCE;
D O I
10.1007/s11219-013-9196-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This article introduces the application of equivalence hypothesis testing (EHT) into the Empirical Software Engineering field. Equivalence (also known as bioequivalence in pharmacological studies) is a statistical approach that answers the question "is product T equivalent to some other reference product R within some range ?." The approach of "null hypothesis significance test" used traditionally in Empirical Software Engineering seeks to assess evidence for differences between T and R, not equivalence. In this paper, we explain how EHT can be applied in Software Engineering, thereby extending it from its current application within pharmacological studies, to Empirical Software Engineering. We illustrate the application of EHT to Empirical Software Engineering, by re-examining the behavior of experts and novices when handling code with side effects compared to side-effect free code; a study previously investigated using traditional statistical testing. We also review two other previous published data of software engineering experiments: a dataset compared the comprehension of UML and OML specifications, and the last dataset studied the differences between the specification methods UML-B and B. The application of EHT allows us to extract additional conclusions to the previous results. EHT has an important application in Empirical Software Engineering, which motivate its wider adoption and use: EHT can be used to assess the statistical confidence with which we can claim that two software engineering methods, algorithms of techniques, are equivalent.
引用
下载
收藏
页码:215 / 238
页数:24
相关论文
共 50 条
  • [1] Equivalence hypothesis testing in experimental software engineering
    José Javier Dolado
    Mari Carmen Otero
    Mark Harman
    Software Quality Journal, 2014, 22 : 215 - 238
  • [3] Equivalence hypothesis testing: Reply to Bi
    Ennis, Daniel M.
    Ennis, John M.
    FOOD QUALITY AND PREFERENCE, 2010, 21 (03) : 261 - 261
  • [4] Design level hypothesis testing through reverse engineering of object-oriented software
    Counsell, S
    Newson, P
    Mendes, E
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2004, 14 (02) : 207 - 220
  • [5] Architectural level hypothesis testing through reverse engineering of object-oriented software
    Counsell, S
    Newson, P
    Mendes, E
    8TH INTERNATIONAL WORKSHOP ON PROGRAM COMPREHENSION (IWPC 2000), PROCEEDINGS, 2000, : 60 - 66
  • [6] Credibility, hypothesis testing and regression software
    Taylor, Greg
    ASTIN BULLETIN, 2007, 37 (02): : 517 - 535
  • [7] The exact equivalence of distance and kernel methods in hypothesis testing
    Cencheng Shen
    Joshua T. Vogelstein
    AStA Advances in Statistical Analysis, 2021, 105 : 385 - 403
  • [8] Null Hypothesis Significance Testing Does Not Show Equivalence
    Barchard, Kimberly A.
    ANALYSES OF SOCIAL ISSUES AND PUBLIC POLICY, 2015, 15 (01) : 418 - 421
  • [9] The exact equivalence of distance and kernel methods in hypothesis testing
    Shen, Cencheng
    Vogelstein, Joshua T.
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2021, 105 (03) : 385 - 403
  • [10] Hypothesis Testing in Noninferiority and Equivalence MRMC ROC Studies
    Chen, Weijie
    Petrick, Nicholas A.
    Sahiner, Berkman
    ACADEMIC RADIOLOGY, 2012, 19 (09) : 1158 - 1165