Using machine learning to predict cardiovascular risk using self-reported questionnaires: Findings from the 45 and Up Study

被引:2
|
作者
Wang, Hongkuan [1 ]
Tucker, William J. [2 ]
Jonnagaddala, Jitendra [3 ]
Schutte, Aletta E. [3 ,4 ]
Jalaludin, Bin [3 ,5 ]
Rye, Kerry-Anne [2 ]
Liaw, Siaw-Teng [6 ]
Wong, Raymond K. [1 ,9 ]
Ong, Kwok Leung [2 ,7 ,8 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ New South Wales, Sch Biomed Sci, Sydney, NSW, Australia
[3] Univ New South Wales, Sch Populat Hlth, Sydney, NSW, Australia
[4] George Inst Global Hlth, Sydney, NSW, Australia
[5] Univ New South Wales, Ingham Inst Appl Med Res, Sydney, Australia
[6] Univ New South Wales, WHO, Sch Populat Hlth, Collaborating Ctr ehlth, Sydney, NSW, Australia
[7] Univ Sydney, NHMRC Clin Trials Ctr, Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[8] Room 134,Med Fdn Bldg,92-94 Parramatta Rd, Camperdown, NSW 2050, Australia
[9] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
基金
英国医学研究理事会;
关键词
Cardiovascular disease; Classification; Machine learning; Risk prediction; Survey; BODY-MASS INDEX; SOCIAL DETERMINANTS; POPULATION; DISEASES; MODELS;
D O I
10.1016/j.ijcard.2023.05.030
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Machine learning has been shown to outperform traditional statistical methods for risk prediction model development. We aimed to develop machine learning-based risk prediction models for cardiovascular mortality and hospitalisation for ischemic heart disease (IHD) using self-reported questionnaire data.Methods: The 45 and Up Study was a retrospective population-based study in New South Wales, Australia (2005-2009). Self-reported healthcare survey data on 187,268 participants without a history of cardiovascular disease was linked to hospitalisation and mortality data. We compared different machine learning algorithms, including traditional classification methods (support vector machine (SVM), neural network, random forest and logistic regression) and survival methods (fast survival SVM, Cox regression and random survival forest).Results: A total of 3687 participants experienced cardiovascular mortality and 12,841 participants had IHD-related hospitalisation over a median follow-up of 10.4 years and 11.6 years respectively. The best model for cardiovascular mortality was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 0.3 achieved by under-sampling of the non-cases. This model had the Uno's and Harrel's concordance indexes of 0.898 and 0.900 respectively. The best model for IHD hospitalisation was a Cox survival regression with L1 penalty at a re-sampled case/non-case ratio of 1.0 with Uno's and Harrel's concordance indexes of 0.711 and 0.718 respectively.Conclusion: Machine learning-based risk prediction models developed using self-reported questionnaire data had good prediction performance. These models may have the potential to be used in initial screening tests to identify high-risk individuals before undergoing costly investigation.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 50 条
  • [21] Using deep learning to predict cardiovascular magnetic resonance findings from echocardiography videos
    Sahashi, Y.
    Ouyang, D.
    Kwan, A.
    EUROPEAN HEART JOURNAL, 2024, 45
  • [22] Monitoring aerobic capacity in cancer survivors using self-reported questionnaires: criterion validity and responsiveness
    Weemaes, Anouk T. R.
    Meijer, Renske
    Beelen, Milou
    van Hooff, Martijn
    Weijenberg, Matty P.
    Lenssen, Antoine F.
    van de Poll-franse, Lonneke V.
    Savelberg, Hans H. C. M.
    Schep, Goof
    JOURNAL OF PATIENT-REPORTED OUTCOMES, 2023, 7 (01)
  • [23] An exploration into self-reported inactivity behaviours of adults with an intellectual disability using physical activity questionnaires
    Lynch, L.
    McCarron, M.
    McCallion, P.
    Burke, E.
    JOURNAL OF INTELLECTUAL DISABILITY RESEARCH, 2024, 68 (12) : 1396 - 1407
  • [24] Monitoring aerobic capacity in cancer survivors using self-reported questionnaires: criterion validity and responsiveness
    Anouk T.R. Weemaes
    Renske Meijer
    Milou Beelen
    Martijn van Hooff
    Matty P. Weijenberg
    Antoine F. Lenssen
    Lonneke V. van de Poll-Franse
    Hans H.C.M. Savelberg
    Goof Schep
    Journal of Patient-Reported Outcomes, 7
  • [25] Association between Self-reported Sleep Quality and Single-task Gait in Young Adults: A Study Using Machine Learning
    Martin, Joel
    Huang, Haikun
    Johnson, Ronald
    Yu, Lap-Fai
    Jansen, Erica
    Martin, Rebecca
    Yager, Chelsea
    Boolani, Ali
    SLEEP SCIENCE, 2023, 16 (04) : 399 - 407
  • [26] Using self-reported data to predict expenditures for the health care of older people
    Pacala, JT
    Boult, C
    Urdangarin, C
    McCaffrey, D
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2003, 51 (05) : 609 - 614
  • [27] Using machine learning to ace cardiovascular risk tests
    Bell, James R.
    Figtree, Gemma A.
    Drummond, Grant R.
    CARDIOVASCULAR RESEARCH, 2020, 116 (14) : 2173 - 2174
  • [28] Patients' Self-Reported Adherence to Cardiovascular Medication Using Electronic Monitors as Comparators
    Zeller, Andreas
    Ramseier, Esther
    Teagtmeyer, Anne
    Battegay, Edouard
    HYPERTENSION RESEARCH, 2008, 31 (11) : 2037 - 2043
  • [29] Patients' Self-Reported Adherence to Cardiovascular Medication Using Electronic Monitors as Comparators
    Andreas Zeller
    Esther Ramseier
    Anne Teagtmeyer
    Edouard Battegay
    Hypertension Research, 2008, 31 : 2037 - 2043
  • [30] Prevalence of Self-reported Cardiovascular Risk Factors among Saudi Physicians: A Comparative Study
    Al Alwan, Ibrahim
    Badri, Motasim
    Al-Ghamdi, Maram
    Aljarbou, Alanoud
    Alotaibi, Hessa
    Tamim, Hani
    INTERNATIONAL JOURNAL OF HEALTH SCIENCES-IJHS, 2013, 7 (01): : 3 - 13