A study of dealing class imbalance problem with machine learning methods for code smell severity detection using PCA-based feature selection technique

被引:0
|
作者
Rajwant Singh Rao
Seema Dewangan
Alok Mishra
Manjari Gupta
机构
[1] Guru Ghasidas Vishwavidyalaya,Department of Computer Science and Information Technology
[2] Norwegian University of Science and Technology,Faculty of Engineering
[3] Banaras Hindu University,(Computer Science), DST
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Detecting code smells may be highly helpful for reducing maintenance costs and raising source code quality. Code smells facilitate developers or researchers to understand several types of design flaws. Code smells with high severity can cause significant problems for the software and may cause challenges for the system's maintainability. It is quite essential to assess the severity of the code smells detected in software, as it prioritizes refactoring efforts. The class imbalance problem also further enhances the difficulties in code smell severity detection. In this study, four code smell severity datasets (Data class, God class, Feature envy, and Long method) are selected to detect code smell severity. In this work, an effort is made to address the issue of class imbalance, for which, the Synthetic Minority Oversampling Technique (SMOTE) class balancing technique is applied. Each dataset's relevant features are chosen using a feature selection technique based on principal component analysis. The severity of code smells is determined using five machine learning techniques: K-nearest neighbor, Random forest, Decision tree, Multi-layer Perceptron, and Logistic Regression. This study obtained the 0.99 severity accuracy score with the Random forest and Decision tree approach with the Long method code smell. The model's performance is compared based on its accuracy and three other performance measurements (Precision, Recall, and F-measure) to estimate severity classification models. The impact of performance is also compared and presented with and without applying SMOTE. The results obtained in the study are promising and can be beneficial for paving the way for further studies in this area.
引用
收藏
相关论文
共 50 条
  • [1] A study of dealing class imbalance problem with machine learning methods for code smell severity detection using PCA-based feature selection technique
    Rao, Rajwant Singh
    Dewangan, Seema
    Mishra, Alok
    Gupta, Manjari
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] Rank-based univariate feature selection methods on machine learning classifiers for code smell detection
    Shivani Jain
    Anju Saha
    Evolutionary Intelligence, 2022, 15 : 609 - 638
  • [3] Rank-based univariate feature selection methods on machine learning classifiers for code smell detection
    Jain, Shivani
    Saha, Anju
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (01) : 609 - 638
  • [4] Improving Code Smell Detection by Reducing Dimensionality Using Ensemble Feature Selection and Machine Learning
    Nandini A.
    Singh R.
    Rathee A.
    SN Computer Science, 5 (6)
  • [5] Machine Learning-Based Methods for Code Smell Detection: A Survey
    Yadav, Pravin Singh
    Rao, Rajwant Singh
    Mishra, Alok
    Gupta, Manjari
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [6] A Study on Facial Expression Change Detection Using Machine Learning Methods with Feature Selection Technique
    Sung, Sang-Ha
    Kim, Sangjin
    Park, Byung-Kwon
    Kang, Do-Young
    Sul, Sunhae
    Jeong, Jaehyun
    Kim, Sung-Phil
    MATHEMATICS, 2021, 9 (17)
  • [7] Improving performance with hybrid feature selection and ensemble machine learning techniques for code smell detection
    Jain, Shivani
    Saha, Anju
    SCIENCE OF COMPUTER PROGRAMMING, 2021, 212
  • [8] Comparison of Machine Learning Methods for Code Smell Detection Using Reduced Features
    Karaduzovic-Hadziabdic, Kanita
    Spahic, Rialda
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 670 - 672
  • [9] Phishing detection based on machine learning and feature selection methods
    Almseidin M.
    Abu Zuraiq A.M.
    Al-kasassbeh M.
    Alnidami N.
    International Journal of Interactive Mobile Technologies, 2019, 13 (12) : 71 - 183
  • [10] Automatic detection of Feature Envy and Data Class code smells using machine learning
    Skipina, Milica
    Slivka, Jelena
    Luburic, Nikola
    Kovacevic, Aleksandar
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243