The Prevalence of Errors in Machine Learning Experiments

被引:6
|
作者
Shepperd, Martin [1 ]
Guo, Yuchen [2 ]
Li, Ning [3 ]
Arzoky, Mahir [1 ]
Capiluppi, Andrea [1 ]
Counsell, Steve [1 ]
Destefanis, Giuseppe [1 ]
Swift, Stephen [1 ]
Tucker, Allan [1 ]
Yousefi, Leila [1 ]
机构
[1] Brunel Univ London, London, England
[2] Xi An Jiao Tong Univ, Xian, Peoples R China
[3] Northwestern Polytech Univ, Xian, Peoples R China
关键词
Classifier; Computational experiment; Reliability; Error;
D O I
10.1007/978-3-030-33607-3_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context: Conducting experiments is central to research machine learning research to benchmark, evaluate and compare learning algorithms. Consequently it is important we conduct reliable, trustworthy experiments. Objective: We investigate the incidence of errors in a sample of machine learning experiments in the domain of software defect prediction. Our focus is simple arithmetical and statistical errors. Method: We analyse 49 papers describing 2456 individual experimental results from a previously undertaken systematic review comparing supervised and unsupervised defect prediction classifiers. We extract the confusion matrices and test for relevant constraints, e.g., the marginal probabilities must sum to one. We also check for multiple statistical significance testing errors. Results: We find that a total of 22 out of 49 papers contain demonstrable errors. Of these 7 were statistical and 16 related to confusion matrix inconsistency (one paper contained both classes of error). Conclusions: Whilst some errors may be of a relatively trivial nature, e.g., transcription errors their presence does not engender confidence. We strongly urge researchers to follow open science principles so errors can be more easily be detected and corrected, thus as a community reduce this worryingly high error rate with our computational experiments.
引用
收藏
页码:102 / 109
页数:8
相关论文
共 50 条
  • [21] Leveraging Business Transformation with Machine Learning Experiments
    Mattos, David Issa
    Bosch, Jan
    Olsson, Helena Holmstrom
    SOFTWARE BUSINESS (ICSOB 2019), 2019, 370 : 183 - 191
  • [22] APPLICATION OF MACHINE LEARNING METHODS IN NEUTRINO EXPERIMENTS
    Yermolenko, R.
    Falko, A.
    Gogota, O.
    Onishchuk, Yu.
    Aushev, V.
    JOURNAL OF PHYSICAL STUDIES, 2024, 28 (03):
  • [23] Preliminary Experiments on the Performance of Machine Learning Models
    Banda, Misheck
    Ngassam, Ernest Ketcha
    Mnkandla, Ernest
    2022 IST-AFRICA CONFERENCE, 2022,
  • [24] Experiments on machine learning techniques for sensor fusion
    Faceli, K
    de Carvalho, ACPLF
    Rezende, SO
    ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 395 - 399
  • [25] Replicating Machine Learning Experiments in Materials Science
    Pouchard, Line
    Lin, Yuewei
    Van Dam, Hubertus
    PARALLEL COMPUTING: TECHNOLOGY TRENDS, 2020, 36 : 743 - 755
  • [26] Machine Learning to Enhance Electronic Detection of Diagnostic Errors
    Zimolzak, Andrew J.
    Wei, Li
    Mir, Usman
    Gupta, Ashish
    Vaghani, Viralkumar
    Subramanian, Devika
    Singh, Hardeep
    JAMA NETWORK OPEN, 2024, 7 (09)
  • [27] A machine learning approach to determine refractive errors of the eye
    Ohlendorf, Arne
    Leube, Alexander
    Leibig, Christian
    Wahl, Siegfried
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (08)
  • [28] Finding errors in astronomical catalogs using machine learning
    Fuentes, O
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XI, 2002, 281 : 148 - 151
  • [29] SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets
    Li, Cathy Yuanchen
    Wenger, Emily
    Allen-Zhu, Zeyuan
    Charton, Francois
    Lauter, Kristin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Identification of human errors and influencing factors: A machine learning approach
    Morais, Caroline
    Yung, Ka Lai
    Johnson, Karl
    Moura, Raphael
    Beer, Michael
    Patelli, Edoardo
    SAFETY SCIENCE, 2022, 146