Using machine learning to predict cardiovascular risk using self-reported questionnaires: Findings from the 45 and Up Study

被引:2
|
作者
Wang, Hongkuan [1 ]
Tucker, William J. [2 ]
Jonnagaddala, Jitendra [3 ]
Schutte, Aletta E. [3 ,4 ]
Jalaludin, Bin [3 ,5 ]
Rye, Kerry-Anne [2 ]
Liaw, Siaw-Teng [6 ]
Wong, Raymond K. [1 ,9 ]
Ong, Kwok Leung [2 ,7 ,8 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ New South Wales, Sch Biomed Sci, Sydney, NSW, Australia
[3] Univ New South Wales, Sch Populat Hlth, Sydney, NSW, Australia
[4] George Inst Global Hlth, Sydney, NSW, Australia
[5] Univ New South Wales, Ingham Inst Appl Med Res, Sydney, Australia
[6] Univ New South Wales, WHO, Sch Populat Hlth, Collaborating Ctr ehlth, Sydney, NSW, Australia
[7] Univ Sydney, NHMRC Clin Trials Ctr, Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[8] Room 134,Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[9] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
基金
英国医学研究理事会;
关键词
Cardiovascular disease; Classification; Machine learning; Risk prediction; Survey; BODY-MASS INDEX; SOCIAL DETERMINANTS; POPULATION; DISEASES; MODELS;
D O I
10.1016/j.ijcard.2023.05.030
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Machine learning has been shown to outperform traditional statistical methods for risk prediction model development. We aimed to develop machine learning-based risk prediction models for cardiovascular mortality and hospitalisation for ischemic heart disease (IHD) using self-reported questionnaire data.Methods: The 45 and Up Study was a retrospective population-based study in New South Wales, Australia (2005-2009). Self-reported healthcare survey data on 187,268 participants without a history of cardiovascular disease was linked to hospitalisation and mortality data. We compared different machine learning algorithms, including traditional classification methods (support vector machine (SVM), neural network, random forest and logistic regression) and survival methods (fast survival SVM, Cox regression and random survival forest).Results: A total of 3687 participants experienced cardiovascular mortality and 12,841 participants had IHD-related hospitalisation over a median follow-up of 10.4 years and 11.6 years respectively. The best model for cardiovascular mortality was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 0.3 achieved by under-sampling of the non-cases. This model had the Uno's and Harrel's concordance indexes of 0.898 and 0.900 respectively. The best model for IHD hospitalisation was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 1.0 with Uno's and Harrel's concordance indexes of 0.711 and 0.718 respectively.Conclusion: Machine learning-based risk prediction models developed using self-reported questionnaire data had good prediction performance. These models may have the potential to be used in initial screening tests to identify high-risk individuals before undergoing costly investigation.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [41] Machine Learning Analyses For Determination Of Self-reported Knee Function From Biomechanical Variables
    Hogg, Jennifer Ann
    Liang, Yu
    Wu, Dalei
    Milburn, NIcholas
    Wilkerson, Gary
    MEDICINE & SCIENCE IN SPORTS & EXERCISE, 2022, 54 (09) : 646 - 646
  • [42] Can machine learning predict cardiac risk using mammography?
    Lip, Gerald
    O'Regan, Declan P.
    EUROPEAN HEART JOURNAL-CARDIOVASCULAR IMAGING, 2024, 25 (04) : 467 - 468
  • [43] Using machine learning to predict risk of opioid overdose in Medicare
    Lo-Ciganic, Weihsuan Jenny
    Huang, James L.
    Zhang, Hao H.
    Weiss, Jeremy C.
    Wu, Yonghui
    Kwoh, Chian K.
    Donohue, Julie M.
    Cochran, Jerry
    Gordon, Adam J.
    Malone, Daniel C.
    Kuza, Courtney C.
    Gellad, Walid F.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2019, 28 : 230 - 230
  • [44] Machine-learning-based models to predict cardiovascular risk using oculomics and clinic variables in KNHANES
    Zhang, Yuqi
    Li, Sijin
    Wu, Weijie
    Zhao, Yanqing
    Han, Jintao
    Tong, Chao
    Luo, Niansang
    Zhang, Kun
    BIODATA MINING, 2024, 17 (01)
  • [45] Self-Reported Inmate Conduct: Using Static and Dynamic Factors to Predict Inmate Recidivism
    Worthy, Lisa Degiorgio
    PRISON JOURNAL, 2017, 97 (01): : 79 - 93
  • [46] Estimation of Cardiovascular Risk from Self-Reported Knowledge of Risk Factors: Insights from the Minnesota Heart Survey
    Duval, Sue
    Van't Hof, Jeremy R.
    Steffen, Lyn M.
    Luepker, Russell, V
    CLINICAL EPIDEMIOLOGY, 2020, 12 : 41 - 49
  • [47] A Risk Model to Predict Atrial Fibrillation in Diabetes Using Machine Learning: The ACCORD Study
    Offerman, Erik J.
    Harirforoosh, Sarah
    Fan, Wenjun
    Wong, Nathan D.
    Donaldson, David M.
    CIRCULATION, 2023, 148
  • [48] A multimodal approach using fundus images and text meta-data in a machine learning classifier with embeddings to predict years with self-reported diabetes - An exploratory analysis
    Carrillo-Larco, Rodrigo M.
    Bravo-Rocca, Gusseppe
    Castillo-Cara, Manuel
    Xu, Xiaolin
    Bernabe-Ortiz, Antonio
    PRIMARY CARE DIABETES, 2024, 18 (03) : 327 - 332
  • [49] Self-reported symptoms of sleep-disordered breathing and risk of cardiovascular diseases: Observational and Mendelian randomization findings
    Titova, Olga E.
    Yuan, Shuai
    Baron, John A.
    Lindberg, Eva
    Michaelsson, Karl
    Larsson, Susanna C.
    JOURNAL OF SLEEP RESEARCH, 2022, 31 (06)
  • [50] Risk prediction of cardiovascular disease using machine learning classifiers
    Pal, Madhumita
    Parija, Smita
    Panda, Ganapati
    Dhama, Kuldeep
    Mohapatra, Ranjan K.
    OPEN MEDICINE, 2022, 17 (01): : 1100 - 1113