Nonclinical Features in Predictive Modeling of Cardiovascular Diseases: A Machine Learning Approach

被引:11
|
作者
Sajid, Mirza Rizwan [1 ]
Muhammad, Noryanti [1 ]
Zakaria, Roslinazairimah [1 ]
Shahbaz, Ahmad [2 ]
Bukhari, Syed Ahmad Chan [3 ]
Kadry, Seifedine [4 ]
Suresh, A. [5 ]
机构
[1] Univ Malaysia Pahang, Ctr Math Sci, Coll Comp & Appl Sci, Kuantan 26300, Pahang Darul Ma, Malaysia
[2] Punjab Inst Cardiol, Lahore 54000, Pakistan
[3] St Johns Univ, Collins Coll Profess Studies, Div Comp Sci Math & Sci, New York, NY 11439 USA
[4] Noroff Univ Coll, Fac Appl Comp & Technol, Kristiansand, Norway
[5] SRM Inst Sci & Technol, Dept Comp Sci & Engn, Kattankulathur 603203, Tamilnadu, India
关键词
Nonclinical features; Cardiovascular diseases; Machine learning algorithms; Risk prediction models; Cost-effective model; 10-YEAR RISK; HEALTH-CARE; BIG DATA; VALIDATION; MULTIPLE; PAKISTAN; SET;
D O I
10.1007/s12539-021-00423-w
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background In the broader healthcare domain, the prediction bears more value than an explanation considering the cost of delays in its services. There are various risk prediction models for cardiovascular diseases (CVDs) in the literature for early risk assessment. However, the substantial increase in CVDs-related mortality is challenging global health systems, especially in developing countries. This situation allows researchers to improve CVDs prediction models using new features and risk computing methods. This study aims to assess nonclinical features that can be easily available in any healthcare systems, in predicting CVDs using advanced and flexible machine learning (ML) algorithms. Methods A gender-matched case-control study was conducted in the largest public sector cardiac hospital of Pakistan, and the data of 460 subjects were collected. The dataset comprised of eight nonclinical features. Four supervised ML algorithms were used to train and test the models to predict the CVDs status by considering traditional logistic regression (LR) as the baseline model. The models were validated through the train-test split (70:30) and tenfold cross-validation approaches. Results Random forest (RF), a nonlinear ML algorithm, performed better than other ML algorithms and LR. The area under the curve (AUC) of RF was 0.851 and 0.853 in the train-test split and tenfold cross-validation approach, respectively. The nonclinical features yielded an admissible accuracy (minimum 71%) through the LR and ML models, exhibiting its predictive capability in risk estimation. Conclusion The satisfactory performance of nonclinical features reveals that these features and flexible computational methodologies can reinforce the existing risk prediction models for better healthcare services.
引用
收藏
页码:201 / 211
页数:11
相关论文
共 50 条
  • [1] Nonclinical Features in Predictive Modeling of Cardiovascular Diseases: A Machine Learning Approach
    Mirza Rizwan Sajid
    Noryanti Muhammad
    Roslinazairimah Zakaria
    Ahmad Shahbaz
    Syed Ahmad Chan Bukhari
    Seifedine Kadry
    A. Suresh
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 201 - 211
  • [2] Machine Learning Predictive Outcomes Modeling in Inflammatory Bowel Diseases
    Javaid, Aamir
    Shahab, Omer
    Adorno, William
    Fernandes, Philip
    May, Eve
    Syed, Sana
    [J]. INFLAMMATORY BOWEL DISEASES, 2022, 28 (06) : 819 - 829
  • [3] Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases
    Ogunpola, Adedayo
    Saeed, Faisal
    Basurra, Shadi
    Albarrak, Abdullah M.
    Qasem, Sultan Noman
    [J]. DIAGNOSTICS, 2024, 14 (02)
  • [4] Predictive Modelling of Diseases Based on a Network and Machine Learning Approach
    Tuan-Truong Quang
    Nghia Le
    Bac Le
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 641 - 654
  • [5] Predictive modeling for wine authenticity using a machine learning approach
    Costa, Nattane Luiza da
    Valentin, Leonardo A.
    Castro, Inar Alves
    Barbosa, Rommel Melgaco
    [J]. ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2021, 5 : 157 - 162
  • [6] Predictive modeling of wildfires: A new dataset and machine learning approach
    Oulad Sayad, Younes
    Mousannif, Hajar
    Al Moatassime, Hassan
    [J]. FIRE SAFETY JOURNAL, 2019, 104 : 130 - 146
  • [7] Corporate governance and innovation: a predictive modeling approach using machine learning
    de Pilla, Leonardo Henrique Lima
    Silveira, Elaine Barbosa Couto
    Caldieraro, Fabio
    Peci, Alketa
    Aggarwal, Ishani
    [J]. R & D MANAGEMENT, 2024,
  • [8] New Machine Learning based Approach for Predictive Modeling on Spatial Data
    Gangappa, M.
    Mai, C. Kiran
    Sammulal, P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1654 - 1659
  • [9] Recent developments in modeling, imaging, and monitoring of cardiovascular diseases using machine learning
    Moradi, Hamed
    Al-Hourani, Akram
    Concilia, Gianmarco
    Khoshmanesh, Farnaz
    Nezami, Farhad R.
    Needham, Scott
    Baratchi, Sara
    Khoshmanesh, Khashayar
    [J]. BIOPHYSICAL REVIEWS, 2023, 15 (01) : 19 - 33
  • [10] Prognostic Modeling in Cardiovascular Diseases: Unraveling Survival Odds Through Machine Learning
    Hossain, Al-Amin
    Ahamed, Imtiaj Uddin
    Das Gupta, Uchchas
    Anika, Ayvee Nusreen
    Ahamed, Imam Uddin
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND EMERGENT TECHNOLOGIES, ICASET 2024, 2024,