An Empirical Study on Just-in-time Conformal Defect Prediction

被引：0

作者：

Shahini, Xhulja ^{[1
]}

Metzger, Andreas ^{[1
]}

Pohl, Klaus ^{[1
]}

机构：

[1] Paluno Univ Duisburg Essen, Essen, Germany

来源：

2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2024年

关键词：

Defect prediction; quality assurance; conformal prediction; machine learning; deep learning; correctness guarantees; uncertainty;

D O I：

10.1145/3643991.3644928

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Code changes can introduce defects that affect software quality and reliability. Just-in-time ( JIT) defect prediction techniques provide feedback at check-in time on whether a code change is likely to contain defects. This immediate feedback allows practitioners to make timely decisions regarding potential defects. However, a prediction model may deliver false predictions, that may negatively affect practitioners' decisions. False positive predictions lead to unnecessarily spending resources on investigating clean code changes, while false negative predictions may result in overlooking defective changes. Knowing how uncertain a defect prediction is, would help practitioners to avoid wrong decisions. Previous research in defect prediction explored different approaches to quantify prediction uncertainty for supporting decision-making activities. However, these approaches only offer a heuristic quantification of uncertainty and do not provide guarantees. In this study, we use conformal prediction (CP) as a rigorous uncertainty quantification approach on top of JIT defect predictors. We assess how often CP can provide guarantees for JIT defect predictions. We also assess how many false JIT defect predictions CP can filter out. We experiment with two state-of-the-art JIT defect prediction techniques (DeepJIT and CC2Vec) and two widely used datasets (Qt and OpenStack). Our experiments show that CP can ensure correctness with a 95% probability, for only 27% (for DeepJIT) and 9% (for CC2Vec) of the JIT defect predictions. Additionally, our experiments indicate that CP might be a valuable technique for filtering out the false predictions of JIT defect predictors. CP can filter out up to 100% of false negative predictions and 90% of false positives generated by CC2Vec, and up to 86% of false negative predictions and 83% of false positives generated by DeepJIT.

引用

页码：88 / 99

页数：12

共 50 条

[31] An empirical study of the impact of just-in-time task scope versus just-in-time workflow integration on organizational design
Germain, R
Droge, C
DECISION SCIENCES, 1997, 28 (03) : 615 - 635
[32] Poster: Bridging Effort-Aware Prediction and Strong Classification - a Just-in-Time Software Defect Prediction Study
Guo, Yuchen
Shepperd, Martin
Li, Ning
PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 325 - 326
[33] A Study on Just-in-Time Implementation in Portugal: Some Empirical Evidence
Oliveira Alves Moreira, Maria Rosario Mota
Ferreira dos Santos Alves, Rui Alberto
BRAZILIAN JOURNAL OF OPERATIONS & PRODUCTION MANAGEMENT, 2008, 5 (01): : 5 - 21
[34] A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction
Song, Liyan
Minku, Leandro Lei
Teng, Cong
Yao, Xin
PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 605 - 617
[35] Just-in-time defect prediction for mobile applications: using shallow or deep learning?
Raymon van Dinter
Cagatay Catal
Görkem Giray
Bedir Tekinerdogan
Software Quality Journal, 2023, 31 : 1281 - 1302
[36] Studying just-in-time defect prediction using cross-project models
Yasutaka Kamei
Takafumi Fukushima
Shane McIntosh
Kazuhiro Yamashita
Naoyasu Ubayashi
Ahmed E. Hassan
Empirical Software Engineering, 2016, 21 : 2072 - 2106
[37] Just-in-time defect prediction for mobile applications: using shallow or deep learning?
van Dinter, Raymon
Catal, Cagatay
Giray, Goerkem
Tekinerdogan, Bedir
SOFTWARE QUALITY JOURNAL, 2023, 31 (04) : 1281 - 1302
[38] Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction
Cabral, George G.
Minku, Leandro L.
Shihab, Emad
Mujahid, Suhaib
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 666 - 676
[39] Just-in-time software defect prediction using deep temporal convolutional networks
Ardimento, Pasquale
Aversano, Lerina
Bernardi, Mario Luca
Cimitile, Marta
Iammarino, Martina
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05): : 3981 - 4001
[40] Effort-Aware semi-Supervised just-in-Time defect prediction
Li, Weiwei
Zhang, Wenzhou
Jia, Xiuyi
Huang, Zhiqiu
INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 126

← 1 2 3 4 5 →