Using machine learning to support debugging with Tarantula

被引:59
|
作者
Briand, Lionel C. [1 ]
Labiche, Yvan [2 ]
Liu, Xuetao [2 ]
机构
[1] Univ Oslo, Simula Res Lab, POB 134, Lysaker, Norway
[2] Carleton Univ, Software Qual Engn Lab, Ottawa, ON K1S 5B6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/ISSRE.2007.31
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Using a specific machine learning technique, this paper proposes a way to identify suspicious statements during debugging. The technique is based on principles similar to Tarantula but addresses its main flaw: its difficulty to deal with the presence of multiple faults as it assumes that failing test cases execute the same fault(s). The improvement we present in this paper results from the use of C4.5 decision trees to identify various failure conditions based on information regarding the test cases' inputs and outputs. Failing test cases executing under similar conditions are then assumed to fail due to the same fault(s). Statements are then considered suspicious if they are covered by a large proportion of failing test cases that execute under similar conditions. We report on a case study that demonstrates improvement over the original Tarantula technique in terms of statement ranking. Another contribution of this paper is to show that failure conditions as modeled by a C4.5 decision tree accurate v predict failures and can therefore be used as well to help debugging.
引用
收藏
页码:137 / +
页数:2
相关论文
共 50 条
  • [1] Debugging Support for Machine Learning Applications in Bioengineering Text Corpora
    Cheng, Kwok Sun
    Ahn, Tae-Hyuk
    Song, Myoungkyu
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1062 - 1069
  • [2] Debugging Machine Learning Pipelines
    Lourenco, Raoni
    Freire, Juliana
    Shasha, Dennis
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2019, 2019,
  • [3] Data distribution debugging in machine learning pipelines
    Grafberger, Stefan
    Groth, Paul
    Stoyanovich, Julia
    Schelter, Sebastian
    [J]. VLDB JOURNAL, 2022, 31 (05): : 1103 - 1126
  • [4] Data distribution debugging in machine learning pipelines
    Stefan Grafberger
    Paul Groth
    Julia Stoyanovich
    Sebastian Schelter
    [J]. The VLDB Journal, 2022, 31 : 1103 - 1126
  • [5] Using Conceptual Modeling to Support Machine Learning
    Lukyanenko, Roman
    Castellanos, Arturo
    Parsons, Jeffrey
    Tremblay, Monica Chiarini
    Storey, Veda C.
    [J]. INFORMATION SYSTEMS ENGINEERING IN RESPONSIBLE INFORMATION SYSTEMS, CAISE FORUM 2019, 2019, 350 : 170 - 181
  • [6] Using machine learning to support pedagogy in the arts
    Dan Morris
    Rebecca Fiebrink
    [J]. Personal and Ubiquitous Computing, 2013, 17 : 1631 - 1635
  • [7] Using machine learning to support pedagogy in the arts
    Morris, Dan
    Fiebrink, Rebecca
    [J]. PERSONAL AND UBIQUITOUS COMPUTING, 2013, 17 (08) : 1631 - 1635
  • [8] Continuous support for rehabilitation using machine learning
    Philipp, Patrick
    Merkle, Nicole
    Gand, Kai
    Gisske, Carola
    [J]. IT-INFORMATION TECHNOLOGY, 2019, 61 (5-6): : 273 - 284
  • [9] Fairness in machine learning: definition, testing,debugging, and application
    Xuanqi GAO
    Chao SHEN
    Weipeng JIANG
    Chenhao LIN
    Qian LI
    Qian WANG
    Qi LI
    Xiaohong GUAN
    [J]. ScienceChina(InformationSciences), 2024, 67 (09) : 41 - 61
  • [10] Explanatory and Actionable Debugging for Machine Learning: A TableQA Demonstration
    Cho, Minseok
    Lee, Gyeongbok
    Hwang, Seung-won
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1333 - 1336