A machine learning based data modeling for medical diagnosis

被引:8
|
作者
Mahoto, Naeem Ahmed [1 ]
Shaikh, Asadullah [2 ]
Sulaiman, Adel [2 ]
Reshan, Mana Saleh Al [2 ]
Rajab, Adel [2 ]
Rajab, Khairan [2 ]
机构
[1] Mehran Univ Engn & Technol, Dept Software Engn, Jamshoro 76062, Sindh, Pakistan
[2] Najran Univ, Coll Comp Sci & Informat Syst, Najran 61441, Saudi Arabia
关键词
Machine learning; Medical data; Classification; Predictive models; CLASSIFICATION;
D O I
10.1016/j.bspc.2022.104481
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
High-dimensional medical data makes prediction a complex and difficult task. This study aims at modeling predictive models for medical data. Two datasets of medical data are applied in the study - one online available dataset (Heart Disease data) and another real clinical dataset (Eye Infection Data). A wide range of machine learning algorithms are applied in the modeling stage: Decision Tree, Multilayer Perceptron, Naive Bayesian, Random Forest, and Support Vector Machine. Furthermore, bagging and voting ensemble methods have also been applied with base learners. Both split and cross-validation methods are adopted for the model validation, and well-established evaluation metrics such as accuracy, precision, recall, and F-measure have been considered as evaluation metrics for the predictive models. The method applied for the modeling is comprised of two stages. The first stage uses available features for the predictions. In the second stage, selected features based on positive correlation are used. The adopted method is also for deep learning, especially Convolutional Neural Network (CNN) is applied to analyze the outcomes compared to conventional machine learning algorithms. The experimental results reveal that better predictions are achieved in the second stage. Besides, experiments also indicate split percentage produces better predictive models, and marginally better outcomes are observed in the presence of ensemble methods in comparison with base models. NB outperformed other algorithms with the highest accuracy rate as 88.90%, and MLP obtained 97.50% accuracy for Heart Disease and Eye Infection data, respectively, using 80-20 splits in the second stage. However, the CNN model performed poorly due to the size of the considered datasets.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Medical image fusion based on machine learning for health diagnosis and monitoring of colorectal cancer
    Peng, Yifeng
    Deng, Haijun
    BMC MEDICAL IMAGING, 2024, 24 (01)
  • [42] Research related to the diagnosis of prostate cancer based on machine learning medical images: A review
    Chen, Xinyi
    Liu, Xiang
    Wu, Yuke
    Wang, Zhenglei
    Wang, Shuo Hong
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 181
  • [43] Machine learning for medical imaging-based COVID-19 detection and diagnosis
    Rehouma, Rokaya
    Buchert, Michael
    Chen, Yi-Ping Phoebe
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (09) : 5085 - 5115
  • [44] Empowering Medical Diagnosis: A Machine Learning Approach for Symptom-Based Health Checker
    Aissaoui Ferhi, Leila
    Ben Amar, Manel
    Choubani, Fethi
    Bouallegue, Ridha
    MOBILE NETWORKS & APPLICATIONS, 2024, : 676 - 702
  • [45] Medical image fusion based on machine learning for health diagnosis and monitoring of colorectal cancer
    Yifeng Peng
    Haijun Deng
    BMC Medical Imaging, 24
  • [46] Research related to the diagnosis of prostate cancer based on machine learning medical images: A review
    Chen, Xinyi
    Liu, Xiang
    Wu, Yuke
    Wang, Zhenglei
    Wang, Shuo Hong
    International Journal of Medical Informatics, 2024, 181
  • [47] Designing an Artificial Immune System-Based Machine Learning Classifier for Medical Diagnosis
    Cheng, Hui-Ping
    Lin, Zheng-Sheng
    Hsiao, Hsiao-Fen
    Tseng, Ming-Lang
    INFORMATION COMPUTING AND APPLICATIONS, 2010, 6377 : 333 - +
  • [48] Fault diagnosis for automotive assembly based on optical coordinate data and machine learning
    Zeng, Xuan
    GLOBAL INTELLIGENCE INDUSTRY CONFERENCE (GIIC 2018), 2018, 10835
  • [49] Data Preparation Step for Automated Diagnosis based on HRV Analysis and Machine Learning
    Timothy, Vincentius
    Prihatmanto, Ary Setijadi
    Rhee, Kyung-Hyune
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2016, : 142 - 148
  • [50] Machine Learning Based Preemptive Diagnosis of Lung Cancer Using Clinical Data
    Olatunji, Sunday O.
    Alansari, Aisha
    Alkhorasani, Heba
    Alsubaii, Meelaf
    Sakloua, Rasha
    Alzah-Rani, Reem
    Alsaleem, Yasmeen
    Alassaf, Reem
    Farooqui, Mehwash
    Ahmed, Mohammed Imran Basheer
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 115 - 120