Combat with Class Overlapping in Software Defect Prediction Using Neighbourhood Metric

被引:0
|
作者
Gupta S. [1 ]
Richa [2 ]
Kumar R. [3 ,4 ]
Jain K.L. [3 ,4 ]
机构
[1] School of Computer Science Engineering, Vellore Institute of Technology, Chennai
[2] Department of Computer science and Engineering, Birla Institute of Technology, Mesra, Ranchi
[3] School of Electronics Engineering, Vellore Institute of Technology, Chennai
[4] School of Computer & Communication Engineering, Manipal University Jaipur, Jaipur
关键词
AUC; Class imbalance; Class overlap; G-mean; Recall; Software defect prediction;
D O I
10.1007/s42979-023-02082-8
中图分类号
学科分类号
摘要
The characteristics of data is a open problem which has been tended perceived in data analysis in machine learning research from last decades. The researcher defined some measures to identify the characteristics of the dataset by applying data complexity measures to find the fitness for purpose. The presence of class overlapping in data-sets, significantly affect performance of the classifiers. Data complexity measures provide quantitative insight in quality of the data set and overlapping existent in it. Machine learning techniques are also utilized by several researchers on healthcare datasets in software defect prediction. In this paper, our aim is to evaluates the effectiveness of new overlap measure: Near Enemy Ratio, and its effect on complexity measures and performance of the classifier. The new ration is based on nearest instances to the target instance. The experimental result offers insights in usefulness of the method and help us decide whether this solution should be applied on a particular data-set or not. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [41] Software defect prediction using regression via classification
    Bibi, S.
    Tsoumakas, G.
    Stamelos, I.
    Vlahavas, I
    2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 330 - +
  • [42] Software Defect Prediction Using Random Forest Algorithm
    Soe, Yan Naung
    Santosa, Paulus Insap
    Hartanto, Rudy
    2018 12TH SOUTH EAST ASIAN TECHNICAL UNIVERSITY CONSORTIUM (SYMPOSIUM SEATUC 2018): ENGINEERING EDUCATION AND RESEARCH FOR SUSTAINABLE DEVELOPMENT, 2018,
  • [43] Software defect prediction using global and local models
    Suhag, Vikas
    Dubey, Sanjay Kumar
    Sharma, Bhupendra Kumar
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (08) : 4003 - 4017
  • [44] A Survey on Software Defect Prediction Using Deep Learning
    Akimova, Elena N.
    Bersenev, Alexander Yu
    Deikov, Artem A.
    Kobylkin, Konstantin S.
    Konygin, Anton, V
    Mezentsev, Ilya P.
    Misilov, Vladimir E.
    MATHEMATICS, 2021, 9 (11)
  • [45] Software Defect Density Prediction Using Deep Learning
    Alghanim, Firas
    Azzeh, Mohammad
    El-Hassan, Ammar
    Qattous, Hazem
    IEEE ACCESS, 2022, 10 : 114629 - 114641
  • [46] Software defect prediction using learning to rank approach
    Ali Bou Nassif
    Manar Abu Talib
    Mohammad Azzeh
    Shaikha Alzaabi
    Rawan Khanfar
    Ruba Kharsa
    Lefteris Angelis
    Scientific Reports, 13
  • [47] Software Defect Prediction Using Augmented Bayesian Networks
    Muthukumaran, K.
    Srinivas, Suri
    Malapati, Aruna
    Neti, Lalita Bhanu Murthy
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 279 - 293
  • [48] Software Defect Prediction using Convolutional Neural Network
    Wongpheng, Kittisak
    Visutsak, Porawat
    35TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2020), 2020, : 240 - 243
  • [49] Class Imbalance Reduction (CIR): A Novel Approach to Software Defect Prediction in the Presence of Class Imbalance
    Bejjanki, Kiran Kumar
    Gyani, Jayadev
    Gugulothu, Narsimha
    SYMMETRY-BASEL, 2020, 12 (03):
  • [50] Influence Analysis Method of Class Imbalance on Software Defect Prediction Model Stability and Prediction Performance
    Zhang Y.-M.
    Zhi S.-L.
    Jiang S.-J.
    Yuan G.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (08): : 2076 - 2087