Prospect theory-based oversampling for software defect prediction

被引:0
|
作者
Xu, Biao [1 ,2 ]
Yan, Yuanting [1 ,2 ]
Zhang, Yiwen [1 ,2 ]
机构
[1] Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei,230601, China
[2] School of Computer Science and Technology, Anhui University, Hefei,230601, China
基金
中国国家自然科学基金;
关键词
D O I
10.13196/j.cims.2023.BPM06
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In software defect prediction, the data difficulty factors have a more significant impact on prediction performance than class imbalance.However, most existing oversampling methods ignore the data difficulty factors inherent in software project datasets when addressing the class imbalance problem, which leads to poor prediction performance. To solve above problems, a Prospect theory-based Over Sampling algorithm (POS) for software defect prediction was proposed, which evaluated the learning difficulty of minority samples by considering the influence of homogeneous and heterogeneous samples within the local neighborhood. To be specific, POS constructed homogeneous gains and heterogeneous losses to characterize the prospect value of minority samples via a gravity-based strategy, and strengthened heterogeneous losses to calculate the sampling weights of minority samples for reducing the risk of introducing data difficulty factors, improving the quality of synthetic samples, and further improving the prediction performance. Experimental results on the NASA datasets showed that POS outperformed the comparison algorithms in terms of performance metrics AUC, balance and G-mean. © 2024 CIMS. All rights reserved.
引用
收藏
页码:2822 / 2831
相关论文
共 50 条
  • [1] On the Value of Oversampling for Deep Learning in Software Defect Prediction
    Yedida, Rahul
    Menzies, Tim
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (08) : 3103 - 3116
  • [2] Instance gravity oversampling method for software defect prediction
    Tang, Yu
    Zhou, Yang
    Yang, Cheng
    Du, Ye
    Yang, Ming-song
    [J]. Information and Software Technology, 2025, 179
  • [3] Investigation on the stability of SMOTE-based oversampling techniques in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Zhang, Miao
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 139
  • [4] Software Defect Prediction using Oversampling Algorithm: A-SUWO
    Choirunnisa, Shabrina
    Meidyani, Biandina
    Rochimah, Siti
    [J]. 2018 ELECTRICAL POWER, ELECTRONICS, COMMUNICATIONS, CONTROLS, AND INFORMATICS SEMINAR (EECCIS), 2018, : 337 - 341
  • [5] Support Vector based Oversampling Technique for Handling Class Imbalance in Software Defect Prediction
    Malhotra, Ruchika
    Agrawal, Vaibhav
    Pal, Vedansh
    Agarwal, Tushar
    [J]. 2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 1078 - 1083
  • [6] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    [J]. IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [7] Software defect prediction oversampling technique with generalization and difficulty-aware
    Fan, Hongqi
    Yan, Yuanting
    Zhang, Yiwen
    Zhang, Yanping
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (08): : 2663 - 2671
  • [8] MAHAKIL: Diversity Based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction
    Benni, Kwabena Ebo
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (06) : 534 - 550
  • [9] Tool to Handle Imbalancing Problem in Software Defect Prediction Using Oversampling Methods
    Malhotra, Ruchika
    Kamal, Shine
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 906 - 912
  • [10] Efficiency of oversampling methods for enhancing software defect prediction by using imbalanced data
    Benala, Tirimula Rao
    Tantati, Karunya
    [J]. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2023, 19 (03) : 247 - 263