Software Defect Prediction and Localization with Attention-Based Models and Ensemble Learning

被引:5
|
作者
Zhang, Tianhang [1 ]
Du, Qingfeng [1 ]
Xu, Jincheng [1 ]
Li, Jiechu [1 ]
Li, Xiaojun [2 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
[2] Tongji Univ, Coll Civil Engn, Dept Geotech Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Software defect prediction; Ensemble learning; Attention model; Deep learning;
D O I
10.1109/APSEC51365.2020.00016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction (SDP) utilizes a trained prediction model to predict the defect proneness of code modules in a software system by mining the inherent characteristics of historical defect data. An effective model can optimize the allocation of testing resources, thus improving the quality of software products. Most previous studies use handcrafted features to represent code snippets, but the main problem is that it is difficult to capture the semantic and structural information of the code context, which is often crucial for software defect prediction. Meanwhile, most of the existing software defect prediction models cannot make predictions at the code line level, which makes it extremely arduous to provide developers with more detailed reference information. To address these issues, in this paper, we propose a model based on ensemble learning techniques and attention mechanisms to offer more comprehensive prediction information to developers by locating suspect lines of code when making method-level defect predictions. This model leverages abstract syntax trees (ASTs) as the intermediate representation of code snippets. Since the historical defect data has a striking characteristic of classimbalance, an approach based on Self-organizing Map (SOM) clustering is employed to handle noisy data. Experimental results show that, on average, the proposed model improves the F-measure by 17.7% and AUC by 37.8%, compared with the other four machine learning algorithms.
引用
收藏
页码:81 / 90
页数:10
相关论文
共 50 条
  • [31] Dictionary Learning Based Software Defect Prediction
    Jing, Xiao-Yuan
    Ying, Shi
    Zhang, Zhi-Wu
    Wu, Shan-Shan
    Liu, Jin
    [J]. 36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 414 - 423
  • [32] Attention-Based Models for Snow-Water Equivalent Prediction
    Thapa, Krishu K.
    Singh, Bhupinderjeet
    Savalkar, Supriya
    Fern, Alan
    Rajagopalan, Kirti
    Kalyanaraman, Ananth
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 22969 - 22975
  • [33] Attention-based deep learning for chip-surface-defect detection
    Wang, Shuo
    Wang, Hongyu
    Yang, Fan
    Liu, Fei
    Zeng, Long
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 121 (3-4): : 1957 - 1971
  • [34] Dual Attention-Based Federated Learning for Wireless Traffic Prediction
    Zhang, Chuanting
    Dang, Shuping
    Shihada, Basem
    Alouini, Mohamed-Slim
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [35] Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs
    Nathani, Deepak
    Chauhan, Jatin
    Sharma, Charu
    Kaul, Manohar
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4710 - 4723
  • [36] Attention-based deep learning for chip-surface-defect detection
    Shuo Wang
    Hongyu Wang
    Fan Yang
    Fei Liu
    Long Zeng
    [J]. The International Journal of Advanced Manufacturing Technology, 2022, 121 : 1957 - 1971
  • [37] Mobile traffic prediction with attention-based hybrid deep learning
    Wang, Li
    Che, Linxiao
    Lam, Kwok-Yan
    Liu, Wenqiang
    Li, Feng
    [J]. PHYSICAL COMMUNICATION, 2024, 66
  • [38] Attention based GRU-LSTM for software defect prediction
    Munir, Hafiz Shahbaz
    Ren, Shengbing
    Mustafa, Mubashar
    Siddique, Chaudry Naeem
    Qayyum, Shazib
    [J]. PLOS ONE, 2021, 16 (03):
  • [39] Air pollution forecasting based on attention-based LSTM neural network and ensemble learning
    Liu, Duen-Ren
    Lee, Shin-Jye
    Huang, Yang
    Chiu, Chien-Ju
    [J]. EXPERT SYSTEMS, 2020, 37 (03)
  • [40] Company Industry Classification with Neural and Attention-Based Learning Models
    Slavov, Stanislav
    Tagarev, Andrey
    Tulechki, Nikola
    Boytcheva, Svetla
    [J]. 2019 BIG DATA, KNOWLEDGE AND CONTROL SYSTEMS ENGINEERING (BDKCSE), 2019,