Using machine learning to support debugging with Tarantula

被引：59

作者：

Briand, Lionel C. ^{[1
]}

Labiche, Yvan ^{[2
]}

Liu, Xuetao ^{[2
]}

机构：

[1] Univ Oslo, Simula Res Lab, POB 134, Lysaker, Norway

[2] Carleton Univ, Software Qual Engn Lab, Ottawa, ON K1S 5B6, Canada

来源：

ISSRE 2007: 18TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS | 2007年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/ISSRE.2007.31

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Using a specific machine learning technique, this paper proposes a way to identify suspicious statements during debugging. The technique is based on principles similar to Tarantula but addresses its main flaw: its difficulty to deal with the presence of multiple faults as it assumes that failing test cases execute the same fault(s). The improvement we present in this paper results from the use of C4.5 decision trees to identify various failure conditions based on information regarding the test cases' inputs and outputs. Failing test cases executing under similar conditions are then assumed to fail due to the same fault(s). Statements are then considered suspicious if they are covered by a large proportion of failing test cases that execute under similar conditions. We report on a case study that demonstrates improvement over the original Tarantula technique in terms of statement ranking. Another contribution of this paper is to show that failure conditions as modeled by a C4.5 decision tree accurate v predict failures and can therefore be used as well to help debugging.

引用

页码：137 / +

页数：2

共 50 条

[1] Debugging Support for Machine Learning Applications in Bioengineering Text Corpora
Cheng, Kwok Sun
Ahn, Tae-Hyuk
Song, Myoungkyu
[J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1062 - 1069
[2] Debugging Machine Learning Pipelines
Lourenco, Raoni
Freire, Juliana
Shasha, Dennis
[J]. PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2019, 2019,
[3] Data distribution debugging in machine learning pipelines
Grafberger, Stefan
Groth, Paul
Stoyanovich, Julia
Schelter, Sebastian
[J]. VLDB JOURNAL, 2022, 31 (05): : 1103 - 1126
[4] Data distribution debugging in machine learning pipelines
Stefan Grafberger
Paul Groth
Julia Stoyanovich
Sebastian Schelter
[J]. The VLDB Journal, 2022, 31 : 1103 - 1126
[5] Using Conceptual Modeling to Support Machine Learning
Lukyanenko, Roman
Castellanos, Arturo
Parsons, Jeffrey
Tremblay, Monica Chiarini
Storey, Veda C.
[J]. INFORMATION SYSTEMS ENGINEERING IN RESPONSIBLE INFORMATION SYSTEMS, CAISE FORUM 2019, 2019, 350 : 170 - 181
[6] Using machine learning to support pedagogy in the arts
Dan Morris
Rebecca Fiebrink
[J]. Personal and Ubiquitous Computing, 2013, 17 : 1631 - 1635
[7] Using machine learning to support pedagogy in the arts
Morris, Dan
Fiebrink, Rebecca
[J]. PERSONAL AND UBIQUITOUS COMPUTING, 2013, 17 (08) : 1631 - 1635
[8] Continuous support for rehabilitation using machine learning
Philipp, Patrick
Merkle, Nicole
Gand, Kai
Gisske, Carola
[J]. IT-INFORMATION TECHNOLOGY, 2019, 61 (5-6): : 273 - 284
[9] Fairness in machine learning: definition, testing,debugging, and application
Xuanqi GAO
Chao SHEN
Weipeng JIANG
Chenhao LIN
Qian LI
Qian WANG
Qi LI
Xiaohong GUAN
[J]. ScienceChina(InformationSciences), 2024, 67 (09) : 41 - 61
[10] Explanatory and Actionable Debugging for Machine Learning: A TableQA Demonstration
Cho, Minseok
Lee, Gyeongbok
Hwang, Seung-won
[J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1333 - 1336

← 1 2 3 4 5 →