A neighborhood rough sets-based ensemble method, with application to software fault prediction

被引:0
|
作者
Jiang, Feng [1 ]
Hu, Qiang [1 ]
Yang, Zhiyong [1 ]
Liu, Jinhuan [2 ]
Du, Junwei [2 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Qingdao Univ Sci & Technol, Sch Data Sci, Qingdao 266061, Peoples R China
关键词
Ensemble learning; Software fault prediction; Neighborhood rough sets; Reduct; Neighborhood approximate reduct; Imbalanced data; SYSTEM;
D O I
10.1016/j.eswa.2024.125919
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software fault prediction (SFP) aims to detect fault-prone software modules, which is beneficial for allocating software testing resources and improving software quality. Recently, ensemble learning(EL)-based SFP methods have attracted much attention. Although many EL algorithms have been applied to SFP, they are still insufficient to generate multiple accurate and diverse base learners. Therefore, this paper presents a multi-modal EL algorithm (called NRSEL) based on neighborhood rough sets. In NRSEL, the technique of neighborhood approximate reduct (NAR) is used to implement the perturbation of attribute space and the bootstrap sampling technique is used to implement the perturbation of sample space. Asa novel technique for the perturbation of attribute space, NAR stems from the concept of approximate reduct in rough sets. We also consider the application of NRSEL to SFP, and employ a hybrid scheme (called SMOTE-NRSEL) to handle the problem of imbalanced data in SFP. We compare SMOTE-NRSEL with existing EL algorithms using 20 public datasets. Experimental results indicate that SMOTE-NRSEL is effective for SFP. Compared with the baseline algorithms, on average, SMOTE-NRSEL improves the AUC, F1-score, and MCC by 3.09%, 3.18%, and 7.5%, respectively. Moreover, the results of three statistical tests (including the paired t-test, Friedman test, and Nemenyi test) indicate that SMOTE-NRSEL is significantly better than the baseline algorithms inmost cases. This paper shows that NAR is a good choice for the perturbation of attribute space. With the help of NAR and the multi-modal perturbation strategy based on it, SMOTE-NRSEL can generate accurate and diverse base learners. The code is available at https://github.com/jiangfeng0278/NRSEL.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Rough sets-based recursive learning algorithm for radial basis function networks
    Bodyanskiy, Y
    Gorshkov, Y
    Kolodyazhniy, V
    Pliss, I
    ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 59 - 65
  • [42] Kernelized Fuzzy Rough Sets-Based Three-Way Feature Selection
    Liu, Xingchen
    Wang, Liuxin
    Pan, Linchao
    Gao, Can
    ROUGH SETS, IJCRS 2022, 2022, 13633 : 376 - 389
  • [43] An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets
    Sun, Lin
    Zhang, Xiaoyu
    Xu, Jiucheng
    Zhang, Shiguang
    ENTROPY, 2019, 21 (02)
  • [44] Neighborhood operators for covering-based rough sets
    D'eer, Lynn
    Restrepo, Mauricio
    Cornelis, Chris
    Gomez, Jonatan
    INFORMATION SCIENCES, 2016, 336 : 21 - 44
  • [45] Anomaly detection based on fuzzy neighborhood rough sets
    Yuan, Yuan
    Wang, Sihan
    Chen, Hongmei
    Luo, Chuan
    Yuan, Zhong
    INFORMATION SCIENCES, 2025, 709
  • [46] Reduction of Neighborhood-Based Generalized Rough Sets
    Wang, Zhaohao
    Shu, Lan
    Ding, Xiuyong
    JOURNAL OF APPLIED MATHEMATICS, 2011,
  • [47] NMGRS: Neighborhood-based multigranulation rough sets
    Lin, Guoping
    Qian, Yuhua
    Li, Jinjin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2012, 53 (07) : 1080 - 1093
  • [48] Application of rough sets in audience rating prediction
    Wu, Meimei, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [49] Application of MMAS and Rough Sets in Fault Diagnosis of Gearbox
    Kang, Hai-Ying
    Qi, Yan-Jie
    Liu, Guang-Sheng
    He, Yao-Xin
    Yan, Wen
    INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND AUTOMATION (ICCEA 2014), 2014, : 557 - 562
  • [50] Software Defect Prediction Method Based on Clustering Ensemble Learning
    Tao, Hongwei
    Cao, Qiaoling
    Chen, Haoran
    Li, Yanting
    Niu, Xiaoxu
    Wang, Tao
    Geng, Zhenhao
    Shang, Songtao
    IET SOFTWARE, 2024, 2024