Analysis of the Performance of Learners for Change Prediction Using Imbalanced Data

被引:0
|
作者
Bansal, Ankita [1 ]
Modi, Kanika [1 ]
Jain, Roopal [1 ]
机构
[1] NSIT, Div IT, Delhi, India
关键词
Software change prediction; Sampling; Change prone classes; Imbalanced learning; Object-oriented metrics; K-fold cross validation; CLASSIFICATION;
D O I
10.1007/978-981-13-1819-1_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software change prediction is important to economically schedule allocation of resources during various phases of software maintenance and testing. Furthermore, exact characterization of progress inclined and non-change inclined classes is significant in beginning times of programming advancement life cycle since that helps with creating financially savvy quality programming for real-time use. A good prediction model should predict both the change and non-change prone classes with high accuracy. However, most practical datasets have underrepresented information and serious class appropriation skews. Due to imbalanced data, the minority classes are not predicted accurately causing poor planning of resources. Popular operating systems like Android get updated very fast. In the current scenario, it is essential to recognize change prone and non-change prone classes with precision in newer versions of such software that are updated very frequently. In this paper, we give a complete survey of various machine learning models to predict change prone classes algorithms using sampling technologies like resampling and spreadsubsampling on six open source datasets having imbalanced data. The experimental result of the study advocates that resampling technique consistently and significantly improves the performance of all the models.
引用
收藏
页码:345 / 359
页数:15
相关论文
共 50 条
  • [1] An empirical study for software change prediction using imbalanced data
    Ruchika Malhotra
    Megha Khanna
    [J]. Empirical Software Engineering, 2017, 22 : 2806 - 2851
  • [2] An empirical study for software change prediction using imbalanced data
    Malhotra, Ruchika
    Khanna, Megha
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) : 2806 - 2851
  • [3] A Performance Analysis of Classifiers on Imbalanced Data
    Garcia, Nathan F.
    Strzoda, Romulo A.
    Lucca, Giancarlo
    Borges, Eduardo N.
    [J]. ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 602 - 609
  • [4] EWT-SMOTE to improve default prediction performance in imbalanced data: Analysis of Chinese data
    Zhou, Ying
    Lin, Xia
    Chi, Guotai
    Jin, Peng
    Li, Mengtong
    [J]. JOURNAL OF FORECASTING, 2024, 43 (03) : 615 - 643
  • [5] An empirical study of the classification performance of learners on imbalanced and noisy software quality data
    Seiffert, Chris
    Khoshgoftaar, Taghi M.
    Van Hulse, Jason
    Folleco, Andres
    [J]. IRI 2007: PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2007, : 651 - +
  • [6] Consumer Bankruptcy Prediction Using Balanced and Imbalanced Data
    Brygala, Magdalena
    [J]. RISKS, 2022, 10 (02)
  • [7] Performance Evaluations of Supervised Learners on Imbalanced Datasets
    Bulut, Faruk
    [J]. 2016 ELECTRIC ELECTRONICS, COMPUTER SCIENCE, BIOMEDICAL ENGINEERINGS' MEETING (EBBT), 2016,
  • [8] Rail Break Prediction and Cause Analysis Using Imbalanced In-Service Train Data
    Zeng, Cheng
    Huang, Jinsong
    Wang, Hongrui
    Xie, Jiawei
    Huang, Shan
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [9] Using Hybridized techniques for Prediction of Software Maintainability using Imbalanced data
    Malhotra, Ruchika
    Lata, Kusum
    [J]. PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 787 - 792
  • [10] Output Thresholding for Ensemble Learners and Imbalanced Big Data
    Johnson, Justin M.
    Khoshgoftaar, Taghi M.
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1449 - 1454