An Empirical Study on Just-in-time Conformal Defect Prediction

被引:0
|
作者
Shahini, Xhulja [1 ]
Metzger, Andreas [1 ]
Pohl, Klaus [1 ]
机构
[1] Paluno Univ Duisburg Essen, Essen, Germany
关键词
Defect prediction; quality assurance; conformal prediction; machine learning; deep learning; correctness guarantees; uncertainty;
D O I
10.1145/3643991.3644928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Code changes can introduce defects that affect software quality and reliability. Just-in-time ( JIT) defect prediction techniques provide feedback at check-in time on whether a code change is likely to contain defects. This immediate feedback allows practitioners to make timely decisions regarding potential defects. However, a prediction model may deliver false predictions, that may negatively affect practitioners' decisions. False positive predictions lead to unnecessarily spending resources on investigating clean code changes, while false negative predictions may result in overlooking defective changes. Knowing how uncertain a defect prediction is, would help practitioners to avoid wrong decisions. Previous research in defect prediction explored different approaches to quantify prediction uncertainty for supporting decision-making activities. However, these approaches only offer a heuristic quantification of uncertainty and do not provide guarantees. In this study, we use conformal prediction (CP) as a rigorous uncertainty quantification approach on top of JIT defect predictors. We assess how often CP can provide guarantees for JIT defect predictions. We also assess how many false JIT defect predictions CP can filter out. We experiment with two state-of-the-art JIT defect prediction techniques (DeepJIT and CC2Vec) and two widely used datasets (Qt and OpenStack). Our experiments show that CP can ensure correctness with a 95% probability, for only 27% (for DeepJIT) and 9% (for CC2Vec) of the JIT defect predictions. Additionally, our experiments indicate that CP might be a valuable technique for filtering out the false predictions of JIT defect predictors. CP can filter out up to 100% of false negative predictions and 90% of false positives generated by CC2Vec, and up to 86% of false negative predictions and 83% of false positives generated by DeepJIT.
引用
收藏
页码:88 / 99
页数:12
相关论文
共 50 条
  • [31] An empirical study of the impact of just-in-time task scope versus just-in-time workflow integration on organizational design
    Germain, R
    Droge, C
    DECISION SCIENCES, 1997, 28 (03) : 615 - 635
  • [32] Poster: Bridging Effort-Aware Prediction and Strong Classification - a Just-in-Time Software Defect Prediction Study
    Guo, Yuchen
    Shepperd, Martin
    Li, Ning
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 325 - 326
  • [33] A Study on Just-in-Time Implementation in Portugal: Some Empirical Evidence
    Oliveira Alves Moreira, Maria Rosario Mota
    Ferreira dos Santos Alves, Rui Alberto
    BRAZILIAN JOURNAL OF OPERATIONS & PRODUCTION MANAGEMENT, 2008, 5 (01): : 5 - 21
  • [34] A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction
    Song, Liyan
    Minku, Leandro Lei
    Teng, Cong
    Yao, Xin
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 605 - 617
  • [35] Just-in-time defect prediction for mobile applications: using shallow or deep learning?
    Raymon van Dinter
    Cagatay Catal
    Görkem Giray
    Bedir Tekinerdogan
    Software Quality Journal, 2023, 31 : 1281 - 1302
  • [36] Studying just-in-time defect prediction using cross-project models
    Yasutaka Kamei
    Takafumi Fukushima
    Shane McIntosh
    Kazuhiro Yamashita
    Naoyasu Ubayashi
    Ahmed E. Hassan
    Empirical Software Engineering, 2016, 21 : 2072 - 2106
  • [37] Just-in-time defect prediction for mobile applications: using shallow or deep learning?
    van Dinter, Raymon
    Catal, Cagatay
    Giray, Goerkem
    Tekinerdogan, Bedir
    SOFTWARE QUALITY JOURNAL, 2023, 31 (04) : 1281 - 1302
  • [38] Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction
    Cabral, George G.
    Minku, Leandro L.
    Shihab, Emad
    Mujahid, Suhaib
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 666 - 676
  • [39] Just-in-time software defect prediction using deep temporal convolutional networks
    Ardimento, Pasquale
    Aversano, Lerina
    Bernardi, Mario Luca
    Cimitile, Marta
    Iammarino, Martina
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05): : 3981 - 4001
  • [40] Effort-Aware semi-Supervised just-in-Time defect prediction
    Li, Weiwei
    Zhang, Wenzhou
    Jia, Xiuyi
    Huang, Zhiqiu
    INFORMATION AND SOFTWARE TECHNOLOGY, 2020, 126