Enhanced thyroid disease prediction using ensemble machine learning: a high-accuracy approach with feature selection and class balancing

被引:0
|
作者
Md. Rezaul Islam [1 ]
Aniruddha Islam Chowdhury [2 ]
Sharmin Shama [3 ]
Md. Masudul Hasan Lamyea [3 ]
机构
[1] Shahjalal University of Science and Technology,Department of Computer Science and Engineering
[2] Bangabandhu Sheikh Mujibur Rahman Digital University,Department of Educational Technology and Engineering
[3] Dhaka International University,Department of Computer Science and Engineering
来源
关键词
Thyroid Disease Prediction; Machine Learning Algorithms; Data Visualization; Class balancing techniques; XGBoost Algorithm; Confusion Matrices; Etc;
D O I
10.1007/s44163-025-00225-9
中图分类号
学科分类号
摘要
Thyroid disorders are increasingly prevalent, making early detection crucial for reducing mortality and complications. Accurate prediction of disease progression and understanding the interplay of clinical features are essential for effective diagnosis and treatment. Our study addresses these challenges by employing a standard machine learning model, enhanced with comprehensive clinical feature analysis and an ensemble learning technique. By leveraging machine learning, we can identify key risk factors and improve diagnostic accuracy. To achieve optimal prediction outcomes, we evaluated seventeen machine learning models and implemented an Ensemble ML classifier using a hard voting strategy. Class balancing techniques, particularly random oversampling, significantly improved classification performance. Our experimental results demonstrate that the proposed model outperforms existing methods, achieving 100% sensitivity and 99.72% accuracy using the XGBoost algorithm and SelectKBest feature selection. By addressing feature reduction and high class-imbalance, the ensemble ML classifier with hard voting proves more effective in handling classification challenges.
引用
收藏
相关论文
共 50 条
  • [31] Optimizing IoT Intrusion Detection Using Balanced Class Distribution, Feature Selection, and Ensemble Machine Learning Techniques
    Musthafa, Muhammad Bisri
    Huda, Samsul
    Kodera, Yuta
    Ali, Md. Arshad
    Araki, Shunsuke
    Mwaura, Jedidah
    Nogami, Yasuyuki
    SENSORS, 2024, 24 (13)
  • [32] Thyroid Disease Prediction Using Machine Learning Approaches
    Chaubey, Gyanendra
    Bisen, Dhananjay
    Arjaria, Siddharth
    Yadav, Vibhash
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2021, 44 (03): : 233 - 238
  • [33] Thyroid Disease Prediction Using Machine Learning Approaches
    Gyanendra Chaubey
    Dhananjay Bisen
    Siddharth Arjaria
    Vibhash Yadav
    National Academy Science Letters, 2021, 44 : 233 - 238
  • [34] A Supervised Machine Learning Approach using Different Feature Selection Techniques on Voice Datasets for Prediction of Parkinson's Disease
    Aich, Satyabrata
    Kim, Hee-Cheol
    Younga, Kim
    Hui, Kueh Lee
    Al-Absi, Ahmed Abdulhakim
    Sain, Mangal
    2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 1116 - 1121
  • [35] Enhanced detection of diabetes mellitus using novel ensemble feature engineering approach and machine learning model
    Rustam, Furqan
    Al-Shamayleh, Ahmad Sami
    Shafique, Rahman
    Obregon, Silvia Aparicio
    Iglesias, Ruben Calderon
    Gonzalez, J. Pablo Miramontes
    Ashraf, Imran
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [36] Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning
    Piles, Miriam
    Bergsma, Rob
    Gianola, Daniel
    Gilbert, Helene
    Tusell, Llibertat
    FRONTIERS IN GENETICS, 2021, 12
  • [37] Prediction of functional outcomes of schizophrenia with genetic biomarkers using a bagging ensemble machine learning method with feature selection
    Eugene Lin
    Chieh-Hsin Lin
    Hsien-Yuan Lane
    Scientific Reports, 11
  • [38] Prediction of functional outcomes of schizophrenia with genetic biomarkers using a bagging ensemble machine learning method with feature selection
    Lin, Eugene
    Lin, Chieh-Hsin
    Lane, Hsien-Yuan
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [39] Prediction of Radiation Induced Lymphedema for Head & Neck Cancer Patients Using Ensemble Feature Selection and Machine Learning
    Teo, P.
    Rogacki, K.
    Gopalakrishnan, M.
    Mittal, B.
    Das, I.
    Abazeed, M.
    Gentile, M.
    MEDICAL PHYSICS, 2022, 49 (06) : E266 - E267
  • [40] Prediction of Heart Disease using an Ensemble Learning Approach
    Alshehri G.A.
    Alharbi H.M.
    International Journal of Advanced Computer Science and Applications, 2023, 14 (08) : 1089 - 1097