Analysis of the Performance of Learners for Change Prediction Using Imbalanced Data

被引:0
|
作者
Bansal, Ankita [1 ]
Modi, Kanika [1 ]
Jain, Roopal [1 ]
机构
[1] NSIT, Div IT, Delhi, India
关键词
Software change prediction; Sampling; Change prone classes; Imbalanced learning; Object-oriented metrics; K-fold cross validation; CLASSIFICATION;
D O I
10.1007/978-981-13-1819-1_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software change prediction is important to economically schedule allocation of resources during various phases of software maintenance and testing. Furthermore, exact characterization of progress inclined and non-change inclined classes is significant in beginning times of programming advancement life cycle since that helps with creating financially savvy quality programming for real-time use. A good prediction model should predict both the change and non-change prone classes with high accuracy. However, most practical datasets have underrepresented information and serious class appropriation skews. Due to imbalanced data, the minority classes are not predicted accurately causing poor planning of resources. Popular operating systems like Android get updated very fast. In the current scenario, it is essential to recognize change prone and non-change prone classes with precision in newer versions of such software that are updated very frequently. In this paper, we give a complete survey of various machine learning models to predict change prone classes algorithms using sampling technologies like resampling and spreadsubsampling on six open source datasets having imbalanced data. The experimental result of the study advocates that resampling technique consistently and significantly improves the performance of all the models.
引用
收藏
页码:345 / 359
页数:15
相关论文
共 50 条
  • [21] Prediction of Depression for Undergraduate Students Based on Imbalanced Data by Using Data Mining Techniques
    Narkbunnum, Warawut
    Wisaeng, Kittipol
    [J]. APPLIED SYSTEM INNOVATION, 2022, 5 (06)
  • [22] Handling Imbalanced Data using Ensemble Learning in Software Defect Prediction
    Malhotra, Ruchika
    Jain, Juhi
    [J]. PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 300 - 304
  • [23] Behavioral Analysis of Insider Threat: A Survey and Bootstrapped Prediction in Imbalanced Data
    Azaria, Amos
    Richardson, Ariella
    Kraus, Sarit
    Subrahmanian, V. S.
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2014, 1 (02): : 135 - 155
  • [24] Trajectory Prediction for Heterogeneous Agents: A Performance Analysis on Small and Imbalanced Datasets
    de Almeida, Tiago Rodrigues
    Zhu, Yufei
    Rudenko, Andrey
    Kucner, Tomasz P.
    Stork, Johannes A.
    Magnusson, Martin
    Lilienthal, Achim J.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6576 - 6583
  • [25] Enhancing and improving the performance of imbalanced class data using novel GBO and SSG: A comparative analysis
    Ahsan, Md Manjurul
    Ali, Md Shahin
    Siddique, Zahed
    [J]. NEURAL NETWORKS, 2024, 173
  • [26] Change prediction using interface data
    Hamraz, Bahram
    Hisarciklilar, Onur
    Rahmani, Keyvan
    Wynn, David C.
    Thomson, Vincent
    Clarkson, P. John
    [J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2013, 21 (02): : 141 - 154
  • [27] Analysis of the Factors Influencing Learners' Performance Prediction With Learning Analytics
    Manuel Moreno-Marcos, Pedro
    Pong, Ting-Chuen
    Munoz-Merino, Pedro J.
    Delgado Kloos, Carlos
    [J]. IEEE ACCESS, 2020, 8 (08): : 5264 - 5282
  • [28] LEARNERS' EFFICIENCY PREDICTION USING FACIAL BEHAVIOR ANALYSIS
    Verma, Manisha
    Nakashima, Yuta
    Kobori, Hirokazu
    Takaoka, Ryota
    Takemura, Noriko
    Kimura, Tsukasa
    Nagahara, Hajime
    Numao, Masayuki
    Shinohara, Kazumitsu
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1084 - 1088
  • [29] Stacked Denoising Autoencoders for Mortality Risk Prediction Using Imbalanced Clinical Data
    Alhassan, Zakhriya
    Budgen, David
    Alshammari, Riyad
    Daghstani, Tahani
    McGough, A. Stephen
    Al Moubayed, Noura
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 541 - 546
  • [30] Efficiency of oversampling methods for enhancing software defect prediction by using imbalanced data
    Benala, Tirimula Rao
    Tantati, Karunya
    [J]. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2023, 19 (03) : 247 - 263