Support Vector based Oversampling Technique for Handling Class Imbalance in Software Defect Prediction

被引:1
|
作者
Malhotra, Ruchika [1 ]
Agrawal, Vaibhav [1 ]
Pal, Vedansh [1 ]
Agarwal, Tushar [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci, New Delhi, India
关键词
Defect Prediction; Support Vector; Naive Bayes; imbalmiced data; oversampling; SMOTE; SI MSMOTE; ADASEV;
D O I
10.1109/Confluence51648.2021.9377068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The importance of software defect prediction has risen significantly' over the past decade and is an inseparable part of software quality. Owing to the drawbacks of traditional techniques used for software quality assurance, machine learning algorithms are used to detect the defect in software modules. By the means of this study, we present a support vector based oversampling technique as a part of the software defect procedure and compare it with two other oversampling, techniques namely Synthetic minority oversampling technique and Adaptive Synthetic oversampling technique. For the purpose of this study, we chose 5 datasets from the PROMISE and AEEEM repositories. After extracting a subset of attributes from the original dataset through Linear discriminant Analysis, we utilize the improved oversampled dataset to train a support vector machine classifier and a Na ve Bayesian classifier. The proposed Support Vector based oversampling technique along with Linear Discriminant Analysis performs better than the other techniques for the performance evaluation metric of Fmeasure score and the area under receiver operating characteristic curve and the consistency of result is maintained.
引用
收藏
页码:1078 / 1083
页数:6
相关论文
共 50 条
  • [1] COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Bennin, Kwabena Ebo
    Kabir, Md Alamgir
    Zhang, Miao
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 129
  • [2] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    [J]. IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [3] MAHAKIL: Diversity Based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction
    Benni, Kwabena Ebo
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (06) : 534 - 550
  • [4] Adaptive Centre-Weighted Oversampling for Class Imbalance in Software Defect Prediction
    Zhao, Qi
    Yan, Xuefeng
    Zhou, Yong
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 223 - 230
  • [5] Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction
    C. Arun
    C. Lakshmi
    [J]. Soft Computing, 2022, 26 : 12915 - 12931
  • [6] Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction
    Arun, C.
    Lakshmi, C.
    [J]. SOFT COMPUTING, 2022, 26 (23) : 12915 - 12931
  • [7] MAHAKIL: Diversity based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction Extended Abstract
    Bennin, Kwabena E.
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    [J]. PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 699 - 699
  • [8] A Comprehensive Analysis of Synthetic Minority Oversampling Technique (SMOTE) for handling class imbalance
    Elreedy, Dina
    Atiya, Amir F.
    [J]. INFORMATION SCIENCES, 2019, 505 : 32 - 64
  • [9] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Goyal, Somya
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 2023 - 2064
  • [10] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Somya Goyal
    [J]. Artificial Intelligence Review, 2022, 55 : 2023 - 2064