Equivalence hypothesis testing in experimental software engineering

被引:5
|
作者
Javier Dolado, Jose [1 ]
Carmen Otero, Mari [2 ]
Harman, Mark [3 ]
机构
[1] UPV EHU Univ Basque Country, Fac Informat, San Sebastian, Spain
[2] UPV EHU Univ Basque Country, Escuela Univ Ingn Vitoria Gasteiz, Vitoria, Spain
[3] UCL, CREST, London WC1E 6BT, England
关键词
Equivalence hypothesis testing; Bioequivalence analysis; Program comprehension; Side-effect free programs; Crossover design; Experimental software engineering; CONFIDENCE-INTERVALS; MODEL VALIDATION; POWER; DIFFERENCE;
D O I
10.1007/s11219-013-9196-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This article introduces the application of equivalence hypothesis testing (EHT) into the Empirical Software Engineering field. Equivalence (also known as bioequivalence in pharmacological studies) is a statistical approach that answers the question "is product T equivalent to some other reference product R within some range ?." The approach of "null hypothesis significance test" used traditionally in Empirical Software Engineering seeks to assess evidence for differences between T and R, not equivalence. In this paper, we explain how EHT can be applied in Software Engineering, thereby extending it from its current application within pharmacological studies, to Empirical Software Engineering. We illustrate the application of EHT to Empirical Software Engineering, by re-examining the behavior of experts and novices when handling code with side effects compared to side-effect free code; a study previously investigated using traditional statistical testing. We also review two other previous published data of software engineering experiments: a dataset compared the comprehension of UML and OML specifications, and the last dataset studied the differences between the specification methods UML-B and B. The application of EHT allows us to extract additional conclusions to the previous results. EHT has an important application in Empirical Software Engineering, which motivate its wider adoption and use: EHT can be used to assess the statistical confidence with which we can claim that two software engineering methods, algorithms of techniques, are equivalent.
引用
下载
收藏
页码:215 / 238
页数:24
相关论文
共 50 条
  • [31] CONVENTIONAL NULL HYPOTHESIS-TESTING IN ACTIVE CONTROL EQUIVALENCE STUDIES
    NG, TH
    CONTROLLED CLINICAL TRIALS, 1995, 16 (05): : 356 - 358
  • [32] Outperformance portfolio optimization via the equivalence of pure and randomized hypothesis testing
    Tim Leung
    Qingshuo Song
    Jie Yang
    Finance and Stochastics, 2013, 17 : 839 - 870
  • [34] METHODOLOGY AND EXPERIMENTAL RESEARCH IN SOFTWARE ENGINEERING
    MOHER, T
    SCHNEIDER, GM
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1982, 16 (01): : 65 - 87
  • [35] A framework for software engineering experimental replications
    Mendongal, Manoel G.
    Maldonado, Jose C.
    de Oliveira, Maria C. F.
    Carver, Jeffrey
    Fabbri, Sandra C. P. F.
    Shull, Forrest
    Travassos, Guilherme H.
    Hohn, Erika Nina
    Basili, Victor R.
    ICECCS 2008: THIRTEENTH IEEE INTERNATIONAL CONFERENCE ON THE ENGINEERING OF COMPLEX COMPUTER SYSTEMS, PROCEEDINGS, 2008, : 203 - +
  • [36] Experimental software engineering: A new conference
    Reed, Karl
    IEEE SOFTWARE, 2006, 23 (04) : 98 - 99
  • [37] Modeling the experimental software engineering process
    Goulao, Miguel
    Abreu, Fernando Brito e
    QUATIC 2007: 6TH INTERNATIONAL CONFERENCE ON THE QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY, PROCEEDINGS, 2007, : 77 - 88
  • [38] Method Mining in Experimental Software Engineering
    Uchimiya, Hideaki
    Ogata, Shinpei
    Kaijiri, Kenji
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 1012 - 1016
  • [39] FINDING AN EXPERIMENTAL BASIS FOR SOFTWARE ENGINEERING
    BASILI, V
    IEEE SOFTWARE, 1995, 12 (03) : 92 - 93
  • [40] Experimental Software Engineering in Educational Context
    Alves, Luis M.
    Machado, Ricardo J.
    Ribeiro, Pedro
    2012 EIGHTH INTERNATIONAL CONFERENCE ON THE QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (QUATIC 2012), 2012, : 336 - 341