Statistical significance testing - a panacea for software technology experiments?

被引:24
|
作者
Miller, J [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, STEAM Res Ctr, Edmonton, AB T6H 5M3, Canada
关键词
empirical; hypothesis; replication;
D O I
10.1016/j.jss.2003.12.019
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Empirical software engineering has a long history of utilizing statistical significance testing, and in many ways, it has become the backbone of the topic. What is less obvious is how much consideration has been given to its adoption. Statistical significance testing was initially designed for testing hypotheses in a very different area, and hence the question must be asked: does it transfer into empirical software engineering research? This paper attempts to address this question. The paper finds that this transference is far from straightforward, resulting in several problems in its deployment within the area. Principally problems exist in: formulating hypotheses, the calculation of the probability values and its associated cut-off value, and the construction of the sample and its distribution. Hence, the paper concludes that the topic should explore other avenues of analysis, in an attempt to establish which analysis approaches are preferable under which conditions, when conducting empirical software engineering studies. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 50 条
  • [1] STATISTICAL SIGNIFICANCE - IS IT A PANACEA OR A PITFALL?
    Lambova, Margarita
    [J]. MATHEMATICS AND INFORMATICS, 2021, 64 (02): : 153 - 172
  • [2] Applying Design of Experiments in Testing and Validation of Statistical Software
    King, Caleb B.
    Lekivetz, Ryan A.
    Morgan, Joseph A.
    [J]. 2024 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, RAMS, 2024,
  • [3] On the Testing of Statistical Software
    Ryan Lekivetz
    Joseph Morgan
    [J]. Journal of Statistical Theory and Practice, 2021, 15
  • [4] On the Testing of Statistical Software
    Lekivetz, Ryan
    Morgan, Joseph
    [J]. JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2021, 15 (04)
  • [5] TESTING AND EVALUATION OF STATISTICAL SOFTWARE
    GENTLE, JE
    [J]. LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1982, 199 : 248 - 257
  • [6] PERSPECTIVES ON STATISTICAL SIGNIFICANCE TESTING
    WOOLSON, RF
    KLEINMAN, JC
    [J]. ANNUAL REVIEW OF PUBLIC HEALTH, 1989, 10 : 423 - 440
  • [7] The insignificance of statistical significance testing
    Johnson, DH
    [J]. JOURNAL OF WILDLIFE MANAGEMENT, 1999, 63 (03): : 763 - 772
  • [8] Retirement of statistical significance testing
    Klungel, Olaf
    Rothman, Kenneth J.
    Hillege, Hans
    Fletcher, John
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 10 - 10
  • [9] STATISTICAL SIGNIFICANCE IN COMPARATIVE ETHOLOGICAL EXPERIMENTS
    HOEKSTRA, JA
    JANSEN, J
    [J]. APPLIED ANIMAL BEHAVIOUR SCIENCE, 1986, 16 (04) : 303 - 308
  • [10] FORECASTING SOFTWARE - A PANACEA
    BEAUMONT, C
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1985, 36 (12) : 1154 - 1154