A novel software defect prediction based on atomic class-association rule mining

被引:36
|
作者
Shao, Yuanxun [1 ,2 ]
Liu, Bin [1 ,2 ]
Wang, Shihai [1 ,2 ]
Li, Guoqi [1 ,2 ]
机构
[1] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[2] Sci & Technol Reliabil & Environm Engn Lab, Beijing, Peoples R China
关键词
Software defect prediction; Data mining; Association rules; Apriori; Machine learning; ATTRIBUTE SELECTION; FAULT PREDICTION; CLASSIFICATION; MODELS; SYSTEM;
D O I
10.1016/j.eswa.2018.07.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To ensure the rational allocation of software testing resources and reduce costs, software defect prediction has drawn notable attention to many "white-box" and "black-box" classification algorithms. Although there have been lots of studies on using software product metrics to identify defect-prone modules, defect prediction algorithms are still worth exploring. For instance, it is not easy to directly implement the Apriori algorithm to classify defect-prone modules across a skewed dataset. Therefore, we propose a novel supervised approach for software defect prediction based on atomic class-association rule mining (ACAR). It holds the characteristics of only one feature of the antecedent and a unique class label of the consequent, which is a specific kind of association rules that explores the relationship between attributes and categories. It holds the characteristics of only one feature of the antecedent and a unique class label of the consequent, which is a specific kind of association rules that explores the relationship between attributes and categories. Such association patterns can provide meaningful knowledge that can be easily understood by software engineers. A new software defect prediction model infrastructure based on association rules is employed to improve the prediction of defect-prone modules, which is divided into data preprocessing, rule model building and performance evaluation. Moreover, ACAR can achieve a satisfactory classification performance compared with other seven benchmark learners (the extension of classification based on associations (CBA2), Support Vector Machine, Naive Bayesian, Decision Tree, OneR, K-nearest Neighbors and RIPPER) on NASA MDP and PROMISE datasets. In light of software defect associative prediction, a comparative experiment between ACAR and CBA2 is discussed in details. It is demonstrated that ACAR is better than CBA2 in terms of AUC, G-mean, Balance, and understandability. In addition, the average AUC of ACAR is increased by 2.9% compared with CBA2, which can reach 81.1%. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:237 / 254
页数:18
相关论文
共 50 条
  • [1] Software Defect Prediction Based on Class-Association Rules
    Shao, Yuanxun
    Liu, Bin
    Li, Guoqi
    Wang, Shihai
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON RELIABILITY SYSTEMS ENGINEERING (ICRSE 2017), 2017,
  • [2] Software defect prediction based on correlation weighted class association rule mining
    Shao, Yuanxun
    Liu, Bin
    Wang, Shihai
    Li, Guoqi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 196
  • [3] A novel software defect prediction approach via weighted classification based on association rule mining
    Wu, Wentao
    Wang, Shihai
    Liu, Bin
    Shao, Yuanxun
    Xie, Wandong
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [4] Software defect prediction using relational association rule mining
    Czibula, Gabriela
    Marian, Zsuzsanna
    Czibula, Istvan Gergely
    [J]. INFORMATION SCIENCES, 2014, 264 : 260 - 278
  • [5] SOFTWARE DEFECT PREDICTION USING INTERTRANSACTION ASSOCIATION RULE MINING
    Chang, Ching-Pao
    Chu, Chih-Ping
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (06) : 747 - 764
  • [6] Mining Quantitative Class-Association Rules for Software Size Estimation
    Moreno, Maria N.
    Lucas, Joel P.
    Segrera, Saddys
    Lopez, Vivian F.
    [J]. 2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 199 - 204
  • [7] Software Defect Prediction Based on Association Rule Classification
    Ma, Baojun
    Dejaeger, Karel
    Vanthienen, Jan
    Baesens, Bart
    [J]. ELECTRONIC-BUSINESS INTELLIGENCE: FOR CORPORATE COMPETITIVE ADVANTAGES IN THE AGE OF EMERGING TECHNOLOGIES & GLOBALIZATION, 2010, 14 : 396 - +
  • [8] Mining Class-Association Rules with Constraints
    Dang Nguyen
    Bay Vo
    [J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 307 - 318
  • [9] A Software Defect Prediction Classifier based on Three Minimum Support Threshold Association Rule Mining
    Wu, Wentao
    Wang, Shihai
    Shao, Yuanxun
    Zhang, Mingxing
    Xie, Wandong
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 278 - 282
  • [10] Mining Normal and Abnormal Class-Association Rules
    Viet Phan-Luong
    [J]. 2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2013, : 968 - 975