A machine learning based data modeling for medical diagnosis

被引:8
|
作者
Mahoto, Naeem Ahmed [1 ]
Shaikh, Asadullah [2 ]
Sulaiman, Adel [2 ]
Reshan, Mana Saleh Al [2 ]
Rajab, Adel [2 ]
Rajab, Khairan [2 ]
机构
[1] Mehran Univ Engn & Technol, Dept Software Engn, Jamshoro 76062, Sindh, Pakistan
[2] Najran Univ, Coll Comp Sci & Informat Syst, Najran 61441, Saudi Arabia
关键词
Machine learning; Medical data; Classification; Predictive models; CLASSIFICATION;
D O I
10.1016/j.bspc.2022.104481
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
High-dimensional medical data makes prediction a complex and difficult task. This study aims at modeling predictive models for medical data. Two datasets of medical data are applied in the study - one online available dataset (Heart Disease data) and another real clinical dataset (Eye Infection Data). A wide range of machine learning algorithms are applied in the modeling stage: Decision Tree, Multilayer Perceptron, Naive Bayesian, Random Forest, and Support Vector Machine. Furthermore, bagging and voting ensemble methods have also been applied with base learners. Both split and cross-validation methods are adopted for the model validation, and well-established evaluation metrics such as accuracy, precision, recall, and F-measure have been considered as evaluation metrics for the predictive models. The method applied for the modeling is comprised of two stages. The first stage uses available features for the predictions. In the second stage, selected features based on positive correlation are used. The adopted method is also for deep learning, especially Convolutional Neural Network (CNN) is applied to analyze the outcomes compared to conventional machine learning algorithms. The experimental results reveal that better predictions are achieved in the second stage. Besides, experiments also indicate split percentage produces better predictive models, and marginally better outcomes are observed in the presence of ensemble methods in comparison with base models. NB outperformed other algorithms with the highest accuracy rate as 88.90%, and MLP obtained 97.50% accuracy for Heart Disease and Eye Infection data, respectively, using 80-20 splits in the second stage. However, the CNN model performed poorly due to the size of the considered datasets.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] A Machine-Learning-Based Prediction Method for Hypertension Outcomes Based on Medical Data
    Chang, Wenbing
    Liu, Yinglai
    Xiao, Yiyong
    Yuan, Xinglong
    Xu, Xingxing
    Zhang, Siyue
    Zhou, Shenghan
    DIAGNOSTICS, 2019, 9 (04)
  • [32] Medical diagnosis of cephalalgia using inductive machine learning
    Dounias, GD
    Drivalou, S
    Moustakis, VS
    Nikolakaki, EP
    MEDICAL DECISION MAKING, 1998, 18 (04) : 487 - 487
  • [33] Hybrid Explanatory Interactive Machine Learning for Medical Diagnosis
    Slany, Emanuel
    Scheele, Stephan
    Schmid, Ute
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT I, AIAI 2024, 2024, 711 : 105 - 116
  • [34] Machine Learning Methods for Internet of Things in Medical Diagnosis
    Poniszewska-Maranda, Aneta
    Pawelska, Joanna
    Krym, Tomasz
    2020 28TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2020, : 24 - 29
  • [35] A Comprehensive Review on Medical Diagnosis Using Machine Learning
    Bhavsar, Kaustubh Arun
    Abugabah, Ahed
    Singla, Jimmy
    AlZubi, Ahmad Ali
    Bashir, Ali Kashif
    Nikita
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02): : 1997 - 2014
  • [36] Medical Diagnosis Using Machine Learning: A Statistical Review
    Bhavsar, Kaustubh Arun
    Singla, Jimmy
    Al-Otaibi, Yasser D.
    Song, Oh-Young
    Bin Zikriya, Yousaf
    Bashir, Ali Kashif
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (01): : 107 - 125
  • [37] Improving the accuracy of medical diagnosis with causal machine learning
    Richens, Jonathan G.
    Lee, Ciaran M.
    Johri, Saurabh
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [38] Big data medical behavior analysis based on machine learning and wireless sensors
    Cui, Moyang
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 9413 - 9427
  • [39] An improved cuckoo search based extreme learning machine for medical data classification
    Mohapatra, P.
    Chakravarty, S.
    Dash, P. K.
    SWARM AND EVOLUTIONARY COMPUTATION, 2015, 24 : 25 - 49
  • [40] Big data medical behavior analysis based on machine learning and wireless sensors
    Moyang Cui
    Neural Computing and Applications, 2022, 34 : 9413 - 9427