Software Defect Prediction and Localization with Attention-Based Models and Ensemble Learning

被引:5
|
作者
Zhang, Tianhang [1 ]
Du, Qingfeng [1 ]
Xu, Jincheng [1 ]
Li, Jiechu [1 ]
Li, Xiaojun [2 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
[2] Tongji Univ, Coll Civil Engn, Dept Geotech Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Software defect prediction; Ensemble learning; Attention model; Deep learning;
D O I
10.1109/APSEC51365.2020.00016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction (SDP) utilizes a trained prediction model to predict the defect proneness of code modules in a software system by mining the inherent characteristics of historical defect data. An effective model can optimize the allocation of testing resources, thus improving the quality of software products. Most previous studies use handcrafted features to represent code snippets, but the main problem is that it is difficult to capture the semantic and structural information of the code context, which is often crucial for software defect prediction. Meanwhile, most of the existing software defect prediction models cannot make predictions at the code line level, which makes it extremely arduous to provide developers with more detailed reference information. To address these issues, in this paper, we propose a model based on ensemble learning techniques and attention mechanisms to offer more comprehensive prediction information to developers by locating suspect lines of code when making method-level defect predictions. This model leverages abstract syntax trees (ASTs) as the intermediate representation of code snippets. Since the historical defect data has a striking characteristic of classimbalance, an approach based on Self-organizing Map (SOM) clustering is employed to handle noisy data. Experimental results show that, on average, the proposed model improves the F-measure by 17.7% and AUC by 37.8%, compared with the other four machine learning algorithms.
引用
收藏
页码:81 / 90
页数:10
相关论文
共 50 条
  • [1] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    [J]. JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [2] Software Defect Prediction via Attention-Based Recurrent Neural Network
    Fan, Guisheng
    Diao, Xuyang
    Yu, Huiqun
    Yang, Kang
    Chen, Liqiong
    [J]. SCIENTIFIC PROGRAMMING, 2019, 2019
  • [3] Attention-Based Ensemble for Deep Metric Learning
    Kim, Wonsik
    Goyal, Bhavya
    Chawla, Kunal
    Lee, Jungmin
    Kwon, Keunjoo
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 760 - 777
  • [4] Effective ensemble learning approach for SST field prediction using attention-based PredRNN
    Qiao, Baiyou
    Wu, Zhongqiang
    Ma, Ling
    Zhou, Yicheng
    Sun, Yunjiao
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (01)
  • [5] Multiple kernel ensemble learning for software defect prediction
    Tiejian Wang
    Zhiwu Zhang
    Xiaoyuan Jing
    Liqiang Zhang
    [J]. Automated Software Engineering, 2016, 23 : 569 - 590
  • [6] Multiple kernel ensemble learning for software defect prediction
    Wang, Tiejian
    Zhang, Zhiwu
    Jing, Xiaoyuan
    Zhang, Liqiang
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2016, 23 (04) : 569 - 590
  • [7] Software Defect Prediction Based Ensemble Approach
    Harikiran, J.
    Chandana, B. Sai
    Srinivasarao, B.
    Raviteja, B.
    Reddy, Tatireddy Subba
    [J]. Computer Systems Science and Engineering, 2023, 45 (03): : 2313 - 2331
  • [8] Effective ensemble learning approach for SST field prediction using attention-based PredRNN
    Baiyou Qiao
    Zhongqiang Wu
    Ling Ma
    Yicheng Zhou
    Yunjiao Sun
    [J]. Frontiers of Computer Science, 2023, 17
  • [9] Effective ensemble learning approach for SST field prediction using attention-based PredRNN
    Baiyou QIAO
    Zhongqiang WU
    Ling MA
    Yicheng Zhou
    Yunjiao SUN
    [J]. Frontiers of Computer Science., 2023, 17 (01) - 156