A neighborhood rough sets-based ensemble method, with application to software fault prediction

被引:0
|
作者
Jiang, Feng [1 ]
Hu, Qiang [1 ]
Yang, Zhiyong [1 ]
Liu, Jinhuan [2 ]
Du, Junwei [2 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Qingdao Univ Sci & Technol, Sch Data Sci, Qingdao 266061, Peoples R China
关键词
Ensemble learning; Software fault prediction; Neighborhood rough sets; Reduct; Neighborhood approximate reduct; Imbalanced data; SYSTEM;
D O I
10.1016/j.eswa.2024.125919
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software fault prediction (SFP) aims to detect fault-prone software modules, which is beneficial for allocating software testing resources and improving software quality. Recently, ensemble learning(EL)-based SFP methods have attracted much attention. Although many EL algorithms have been applied to SFP, they are still insufficient to generate multiple accurate and diverse base learners. Therefore, this paper presents a multi-modal EL algorithm (called NRSEL) based on neighborhood rough sets. In NRSEL, the technique of neighborhood approximate reduct (NAR) is used to implement the perturbation of attribute space and the bootstrap sampling technique is used to implement the perturbation of sample space. Asa novel technique for the perturbation of attribute space, NAR stems from the concept of approximate reduct in rough sets. We also consider the application of NRSEL to SFP, and employ a hybrid scheme (called SMOTE-NRSEL) to handle the problem of imbalanced data in SFP. We compare SMOTE-NRSEL with existing EL algorithms using 20 public datasets. Experimental results indicate that SMOTE-NRSEL is effective for SFP. Compared with the baseline algorithms, on average, SMOTE-NRSEL improves the AUC, F1-score, and MCC by 3.09%, 3.18%, and 7.5%, respectively. Moreover, the results of three statistical tests (including the paired t-test, Friedman test, and Nemenyi test) indicate that SMOTE-NRSEL is significantly better than the baseline algorithms inmost cases. This paper shows that NAR is a good choice for the perturbation of attribute space. With the help of NAR and the multi-modal perturbation strategy based on it, SMOTE-NRSEL can generate accurate and diverse base learners. The code is available at https://github.com/jiangfeng0278/NRSEL.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures
    Sun, Lin
    Wang, Lanying
    Xu, Jiucheng
    Zhang, Shiguang
    ENTROPY, 2019, 21 (02)
  • [3] WalkNAR: A neighborhood rough sets-based attribute reduction approach using random walk
    Li, Haibo
    Xiong, Wuyang
    Li, Yanbin
    Xie, Xiaojun
    APPLIED INTELLIGENCE, 2024, : 7099 - 7117
  • [4] Rough sets-based prediction model for increasing safety of thermal power plants
    Brtka, Vladimir
    Makitan, Vesna
    Radovanovic, Ljiljana
    Zivkovic, Zoran
    Momcilovic, Oliver
    ENERGY SOURCES PART B-ECONOMICS PLANNING AND POLICY, 2019, 14 (03) : 67 - 79
  • [5] Rough sets-based image processing for deinterlacing
    Jeon, Gwanggil
    Jeong, Jechang
    SIGNAL PROCESSING FOR IMAGE ENHANCEMENT AND MULTIMEDIA PROCESSING, 2008, : 227 - 239
  • [6] The application of the new discretization method of fault diagnosis based on rough sets
    Lu Peng
    Wang Xi-huai
    Xiao Jian-mei
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 388 - 391
  • [7] Rough Noise-Filtered Easy Ensemble for software Fault Prediction
    Riaz, Saman
    Arshad, Ali
    Jiao, Licheng
    IEEE ACCESS, 2018, 6 : 46886 - 46899
  • [8] Fault diagnosis method for nuclear power plant based on decision tree and neighborhood rough sets
    Mu, Yu
    Xia, Hong
    Liu, Yong-Kuo
    Yuanzineng Kexue Jishu/Atomic Energy Science and Technology, 2011, 45 (01): : 44 - 47
  • [9] Granular Ball Fuzzy Neighborhood Rough Sets-Based Feature Selection via Multiobjective Mayfly Optimization
    Sun, Lin
    Liang, Hanbo
    Ding, Weiping
    Xu, Jiucheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (11) : 6112 - 6124
  • [10] A rough sets based pruning method for bagging ensemble
    MIAO DuoqianWANG RuizhiDUAN QiguoLIU JimingDepartment of Computer Science and TechnologyTongji UniversityShanghai PRChinaComputer Science DepartmentHong Kong Baptist UniversityKowloon TongHong Kong SAR
    重庆邮电大学学报(自然科学版), 2008, (03) : 372 - 378