Early Thyroid Risk Prediction by Data Mining and Ensemble Classifiers

被引:6
|
作者
Alshayeji, Mohammad H. [1 ]
机构
[1] Kuwait Univ, Coll Engn & Petr, Dept Comp Engn, POB 5969, Safat 13060, Kuwait
来源
关键词
machine learning; thyroid; data mining; ensemble model; feature engineering; SMOTE;
D O I
10.3390/make5030061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thyroid disease is among the most prevalent endocrinopathies worldwide. As the thyroid gland controls human metabolism, thyroid illness is a matter of concern for human health. To save time and reduce error rates, an automatic, reliable, and accurate thyroid identification machine-learning (ML) system is essential. The proposed model aims to address existing work limitations such as the lack of detailed feature analysis, visualization, improvement in prediction accuracy, and reliability. Here, a public thyroid illness dataset containing 29 clinical features from the University of California, Irvine ML repository was used. The clinical features helped us to build an ML model that can predict thyroid illness by analyzing early symptoms and replacing the manual analysis of these attributes. Feature analysis and visualization facilitate an understanding of the role of features in thyroid prediction tasks. In addition, the overfitting problem was eliminated by 5-fold cross-validation and data balancing using the synthetic minority oversampling technique (SMOTE). Ensemble learning ensures prediction model reliability owing to the involvement of multiple classifiers in the prediction decisions. The proposed model achieved 99.5% accuracy, 99.39% sensitivity, and 99.59% specificity with the boosting method which is applicable to real-time computer-aided diagnosis (CAD) systems to ease diagnosis and promote early treatment.
引用
收藏
页码:1195 / 1213
页数:19
相关论文
共 50 条
  • [1] Thyroid prediction using ensemble data mining techniques
    Yadav D.C.
    Pal S.
    [J]. International Journal of Information Technology, 2022, 14 (3) : 1273 - 1283
  • [2] A similarity evaluation technique for data mining with an ensemble of classifiers
    Puuronen, S
    Terziyan, V
    [J]. 11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 1155 - 1159
  • [3] Mining Smart Learning Analytics Data Using Ensemble Classifiers
    Kausar, Samina
    Oyelere, Solomon Sunday
    Salal, Yass Khudheir
    Hussain, Sadiq
    Cifci, Mehmet Akif
    Hilcenko, Slavoljub
    Iqbal, Muhammad Shahid
    Zhu Wenhao
    Xu Huahu
    [J]. INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2020, 15 (12) : 81 - 102
  • [4] DATA MINING CLASSIFIERS COMPARISON FOR SEISMIC HAZARD PREDICTION
    Sneha
    Abhari, Abdolreza
    Ding, Chen
    [J]. COMMUNICATIONS AND NETWORKING SYMPOSIUM (CNS 2018), 2018,
  • [5] Early Prediction of Parkinson's Disease (PD) Using Ensemble Classifiers
    Anisha, C. D.
    Arulanand, N.
    [J]. 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2020,
  • [6] Cardiovascular Risk Prediction Method Based on Test Analysis and Data Mining Ensemble System
    Xu, Shan
    Shi, Haoyue
    Duan, Xiaohui
    Zhu, Tiangang
    Wu, Peihua
    Liu, Dongyue
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 126 - 130
  • [7] Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers
    Ouyang, Zhenzheng
    Zhou, Min
    Wang, Tao
    Wu, Quanyuan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 360 - +
  • [8] Effective Credit Risk Prediction Using Ensemble Classifiers With Model Explanation
    Aruleba, Idowu
    Sun, Yanxia
    [J]. IEEE ACCESS, 2024, 12 : 115015 - 115025
  • [9] Mining Battlefield Information Using Ensemble Classifiers
    Xu, Xiansheng
    Wang, Tao
    Ouyang, Zhenzheng
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 506 - 509
  • [10] Predicting construction cost overruns using text mining, numerical data and ensemble classifiers
    Williams, Trefor P.
    Gong, Jie
    [J]. AUTOMATION IN CONSTRUCTION, 2014, 43 : 23 - 29