Tool to Handle Imbalancing Problem in Software Defect Prediction Using Oversampling Methods

被引:0
|
作者
Malhotra, Ruchika [1 ]
Kamal, Shine [1 ]
机构
[1] Delhi Technol Univ, Dept Software Engn, Bawana Rd, Delhi 110042, India
关键词
Defect prediction; Imbalanced data; Machine learning techniques; Oversampling methods; Object oriented metrics; CLASSIFICATION; SMOTE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data imbalancing is becoming a common problem to tackle in different fields like, defect prediction, change prediction, oil spills, medical diagnose etc. Various methods have been developed to handle imbalanced datasets in order to improve accuracy of the prediction models. Many studies have been carried out in the field of defect prediction for imbalanced datasets but most of them uses SMOTE oversampling method to handle the imbalanced data problem. There are many other oversampling methods which help to deal with imbalancing problem and are still unexplored particularly in the field of software defect prediction. This study develops a tool by implementing three of those unexplored oversampling methods namely ADASYN, SPIDER and Safe-Level-SMOTE. Furthermore, we analyze their performance in comparison to traditional method SMOTE. The performance of oversampling methods is evaluated by applying three machine learning techniques for defect prediction using object oriented metrics. The results are evaluated using two open source defect datasets. The result analysis showed that the prediction error decreased and performance of the machine learning techniques improved when balanced datasets were used with respect to three oversampling methods. Further, all of the three methods outperformed SMOTE while SPIDER oversampling method performed best in majority of the cases.
引用
收藏
页码:906 / 912
页数:7
相关论文
共 50 条
  • [1] Efficiency of oversampling methods for enhancing software defect prediction by using imbalanced data
    Benala, Tirimula Rao
    Tantati, Karunya
    [J]. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2023, 19 (03) : 247 - 263
  • [2] Efficiency of oversampling methods for enhancing software defect prediction by using imbalanced data
    Tirimula Rao Benala
    Karunya Tantati
    [J]. Innovations in Systems and Software Engineering, 2023, 19 : 247 - 263
  • [3] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    [J]. IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [4] Software Defect Prediction using Oversampling Algorithm: A-SUWO
    Choirunnisa, Shabrina
    Meidyani, Biandina
    Rochimah, Siti
    [J]. 2018 ELECTRICAL POWER, ELECTRONICS, COMMUNICATIONS, CONTROLS, AND INFORMATICS SEMINAR (EECCIS), 2018, : 337 - 341
  • [5] An empirical study to investigate oversampling methods for improving software defect prediction using imbalanced data
    Malhotra, Ruchika
    Kamal, Shine
    [J]. NEUROCOMPUTING, 2019, 343 : 120 - 140
  • [6] Oversampling Methods to Handle the Class Imbalance Problem: A Review
    Sharma, Harsh
    Gosain, Anushika
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 96 - 110
  • [7] On the Value of Oversampling for Deep Learning in Software Defect Prediction
    Yedida, Rahul
    Menzies, Tim
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (08) : 3103 - 3116
  • [8] Instance gravity oversampling method for software defect prediction
    Tang, Yu
    Zhou, Yang
    Yang, Cheng
    Du, Ye
    Yang, Ming-song
    [J]. Information and Software Technology, 2025, 179
  • [9] Software defect prediction using a bidirectional LSTM network combined with oversampling techniques
    Khleel, Nasraldeen Alnor Adam
    Nehez, Karoly
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3615 - 3638
  • [10] Prospect theory-based oversampling for software defect prediction
    Xu, Biao
    Yan, Yuanting
    Zhang, Yiwen
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (08): : 2822 - 2831