Feature Selection Techniques to Counter Class Imbalance Problem for Aging Related Bug Prediction Aging Related Bug Prediction

被引:14
|
作者
Kumar, Lov [1 ]
Sureka, Ashish [2 ]
机构
[1] Thapar Univ, Patiala, Punjab, India
[2] Ashoka Univ, Sonepat, Haryana, India
关键词
Aging Related Bugs; Imbalance Learning; Empirical Software Engineering; Feature Selection Techniques; Machine Learning; Predictive Modeling; Software Maintenance; Source Code Metrics; CLASSIFICATION; COMPLEXITY;
D O I
10.1145/3172871.3172872
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Aging-Related Bugs (ARBs) occur in long running systems due to error conditions caused because of accumulation of problems such as memory leakage or unreleased files and locks. Aging-Related Bugs are hard to discover during software testing and also challenging to replicate. Automatic identification and prediction of aging related fault-prone files and classes in an object oriented system can help the software quality assurance team to optimize their testing efforts. In this paper, we present a study on the application of static source code metrics and machine learning techniques to predict aging related bugs. We conduct a series of experiments on publicly available dataset from two large open-source software systems: Linux and MySQL. Class imbalance and high dimensionality are the two main technical challenges in building effective predictors for aging related bugs. We investigate the application of five different feature selection techniques (OneR, Information Gain, Gain Ratio, RELEIF and Symmetric Uncertainty) for dimensionality reduction and five different strategies (Random Under-sampling, Random Oversampling, SMOTE, SMOTEBoost and RUSBoost) to counter the effect of class imbalance in our proposed machine learning based solution approach. Experimental results reveal that the random under-sampling approach performs best followed by RUSBoost in-terms of the mean AUC metric. Statistical significance test demonstrates that there is a significant difference between the performance of the various feature selection techniques. Experimental results shows that Gain Ratio and RELEIF performs best in comparison to other strategies to address the class imbalance problem. We infer from the statistical significance test that there is no difference between the performances of the five different learning algorithms.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Cross-Project Aging-Related Bug Prediction Based on Feature Transfer and Class Imbalance Learning
    Xie, Wenzhi
    Zhang, Chen
    Jia, Kai
    Zhao, Dongdong
    Zhou, Junwei
    Tian, Jing
    Xiang, Jianwen
    [J]. 2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW, 2023, : 206 - 213
  • [2] Cross-project Aging Related Bug Prediction
    Qin, Fangyun
    Zheng, Zheng
    Bai, Chenggang
    Qiao, Yu
    Zhang, Zhenyu
    Chen, Cheng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SECURITY AND RELIABILITY (QRS 2015), 2015, : 43 - 48
  • [3] Generative Adversarial Networks-Based Imbalance Learning in Software Aging-Related Bug Prediction
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (02) : 626 - 642
  • [4] SGT: Aging-related bug prediction via semantic feature learning based on graph-transformer
    Zhang, Chen
    Xiang, Jianwen
    Hao, Rui
    Hu, Wenhua
    Cotroneo, Domenico
    Natella, Roberto
    Pietrantuono, Roberto
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 217
  • [5] Studying Aging-Related Bug Prediction Using Cross-Project Models
    Qin, Fangyun
    Zheng, Zheng
    Qiao, Yu
    Trivedi, Kishor S.
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2019, 68 (03) : 1134 - 1153
  • [6] Supervised Representation Learning Approach for Cross-project Aging-Related Bug Prediction
    Wan, Xiaohui
    Zheng, Zheng
    Qin, Fangyun
    Qiao, Yu
    Trivedi, Kishor S.
    [J]. 2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2019, : 163 - 172
  • [7] An Empirical Study of Aging Related Bug Prediction Using Cross Project in Cloud Oriented Software
    Kaur, Harguneet
    Kaur, Arvinder
    [J]. Informatica (Slovenia), 2022, 46 (08): : 105 - 120
  • [8] An empirical study of factors affecting cross-project aging-related bug prediction with TLAP
    Fangyun Qin
    Xiaohui Wan
    Beibei Yin
    [J]. Software Quality Journal, 2020, 28 : 107 - 134
  • [9] A Software Aging-Related Bug Prediction Framework Based on Deep Learning and Weakly Supervised Oversampling
    Zhou, Yancai
    Zhang, Chen
    Jia, Kai
    Zhao, Dongdong
    Xiang, Jianwen
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2022), 2022, : 185 - 192
  • [10] An empirical study of factors affecting cross-project aging-related bug prediction with TLAP
    Qin, Fangyun
    Wan, Xiaohui
    Yin, Beibei
    [J]. SOFTWARE QUALITY JOURNAL, 2020, 28 (01) : 107 - 134