Using machine learning to predict cardiovascular risk using self-reported questionnaires: Findings from the 45 and Up Study

被引:2
|
作者
Wang, Hongkuan [1 ]
Tucker, William J. [2 ]
Jonnagaddala, Jitendra [3 ]
Schutte, Aletta E. [3 ,4 ]
Jalaludin, Bin [3 ,5 ]
Rye, Kerry-Anne [2 ]
Liaw, Siaw-Teng [6 ]
Wong, Raymond K. [1 ,9 ]
Ong, Kwok Leung [2 ,7 ,8 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ New South Wales, Sch Biomed Sci, Sydney, NSW, Australia
[3] Univ New South Wales, Sch Populat Hlth, Sydney, NSW, Australia
[4] George Inst Global Hlth, Sydney, NSW, Australia
[5] Univ New South Wales, Ingham Inst Appl Med Res, Sydney, Australia
[6] Univ New South Wales, WHO, Sch Populat Hlth, Collaborating Ctr ehlth, Sydney, NSW, Australia
[7] Univ Sydney, NHMRC Clin Trials Ctr, Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[8] Room 134,Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[9] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
基金
英国医学研究理事会;
关键词
Cardiovascular disease; Classification; Machine learning; Risk prediction; Survey; BODY-MASS INDEX; SOCIAL DETERMINANTS; POPULATION; DISEASES; MODELS;
D O I
10.1016/j.ijcard.2023.05.030
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Machine learning has been shown to outperform traditional statistical methods for risk prediction model development. We aimed to develop machine learning-based risk prediction models for cardiovascular mortality and hospitalisation for ischemic heart disease (IHD) using self-reported questionnaire data.Methods: The 45 and Up Study was a retrospective population-based study in New South Wales, Australia (2005-2009). Self-reported healthcare survey data on 187,268 participants without a history of cardiovascular disease was linked to hospitalisation and mortality data. We compared different machine learning algorithms, including traditional classification methods (support vector machine (SVM), neural network, random forest and logistic regression) and survival methods (fast survival SVM, Cox regression and random survival forest).Results: A total of 3687 participants experienced cardiovascular mortality and 12,841 participants had IHD-related hospitalisation over a median follow-up of 10.4 years and 11.6 years respectively. The best model for cardiovascular mortality was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 0.3 achieved by under-sampling of the non-cases. This model had the Uno's and Harrel's concordance indexes of 0.898 and 0.900 respectively. The best model for IHD hospitalisation was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 1.0 with Uno's and Harrel's concordance indexes of 0.711 and 0.718 respectively.Conclusion: Machine learning-based risk prediction models developed using self-reported questionnaire data had good prediction performance. These models may have the potential to be used in initial screening tests to identify high-risk individuals before undergoing costly investigation.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [1] Using machine learning to predict cardiovascular risk using self-reported questionnaires: A right, but a long way to go
    Shamia, David
    Zahger, Doron
    INTERNATIONAL JOURNAL OF CARDIOLOGY, 2023, 389
  • [2] Using machine learning to retrospectively predict self-reported gambling problems in Quebec
    Murch, W. Spencer
    Kairouz, Sylvia
    Dauphinais, Sophie
    Picard, Elyse
    Costes, Jean-Michel
    French, Martin
    ADDICTION, 2023, 118 (08) : 1569 - 1578
  • [3] Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
    Wang, Wei
    Han, Xiaotong
    Zhang, Jiaqing
    Shang, Xianwen
    Ha, Jason
    Liu, Zhenzhen
    Zhang, Lei
    Luo, Lixia
    He, Mingguang
    BRITISH JOURNAL OF OPHTHALMOLOGY, 2022, 106 (11) : 1503 - 1507
  • [4] Cardiovascular findings in self-reported healthy elderly - The Elite Seniors Study
    Nair, B
    Hughes, J
    Basta, M
    Hardy, D
    Crooks, R
    Finucane, P
    Fletcher, P
    Silberberg, J
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF MEDICINE, 1996, 26 (03): : 363 - 367
  • [5] Association Between Self-Reported Sleep Quality And Gait In Young Adults: A Study Using Machine Learning
    Boolani, Ali
    Huang, Haikun
    Johnson, Ronald
    Yu, Lap-Fai
    Jansen, Erica
    Martin, Rebecca
    Yager, Chelsea C.
    Jacobson, Bert
    Martin, Joel
    MEDICINE & SCIENCE IN SPORTS & EXERCISE, 2022, 54 (09) : 187 - 187
  • [6] Addressing Disparities Between Physician Rated and Self-Reported Dyspnea Using Machine Learning
    Mkrtchyan, Karapet
    Lohn, Borena
    Qazi, Anser
    Ma, Shujie
    Heinrich, Erica
    PHYSIOLOGY, 2024, 39
  • [7] Application of learning analytics to study the accuracy of self-reported working patterns in self-regulated learning questionnaires
    Uguina Gadella, Lucia
    Estevez-Ayres, Iria
    Arias Fisteus, Jesus
    Delgado-Kloos, Carlos
    PROCEEDINGS OF THE 2020 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2020), 2020, : 1201 - 1205
  • [8] Toward a Model to Predict Cardiovascular Disease Risk Using a Machine Learning Approach
    Slime, Khaoula
    Maizate, Abderrahim
    Hassouni, Larbi
    Mouine, Najat
    IAENG International Journal of Computer Science, 2024, 51 (05) : 519 - 527
  • [9] Suicide attempt risk predicts inconsistent self-reported suicide attempts: A machine learning approach using longitudinal data
    Haghish, E. F.
    Czajkowski, Nikolai
    Walby, Fredrik A.
    Qin, Ping
    Laeng, Bruno
    JOURNAL OF AFFECTIVE DISORDERS, 2024, 355 : 495 - 504
  • [10] Self-reported menstrual status and cardiovascular risk factors: The CARDIA Study
    Schreiner, PJ
    Matthews, KA
    Lewis, CE
    McCreath, HE
    Hilner, JE
    CIRCULATION, 2003, 107 (07) : E7029 - E7030