Development of machine learning-based models to predict 10-year risk of cardiovascular disease: a prospective cohort study

被引:9
|
作者
You, Jia [1 ,2 ]
Guo, Yu [1 ,2 ]
Kang, Ju-Jiao [1 ,2 ]
Wang, Hui-Fu [1 ,2 ]
Yang, Ming [1 ,2 ]
Feng, Jian-Feng [1 ,2 ,3 ,4 ,5 ,6 ]
Yu, Jin-Tai [1 ,2 ]
Cheng, Wei [1 ,2 ,3 ,6 ,7 ]
机构
[1] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, State Key Lab Med Neurobiol, Dept Neurol,Huashan Hosp, Shanghai, Peoples R China
[2] Fudan Univ, MOE Frontiers Ctr Brain Sci, Shanghai, Peoples R China
[3] Fudan Univ, Minist Educ, Key Lab Computat Neurosci & Brain Inspired Intelli, Shanghai, Peoples R China
[4] Fudan Univ, Zhangjiang Fudan Int Innovat Ctr, Shanghai, Peoples R China
[5] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[6] Zhejiang Normal Univ, Fudan ISTBI ZJNU Algorithm Ctr Brain inspired Inte, Jinhua, Zhejiang, Peoples R China
[7] Fudan Univ, Shanghai Med Coll, Zhongshan Hosp Immunotherapy Technol Transfer Ctr, Shanghai, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Stroke; Cerebrovascular Disorders; CORONARY-HEART-DISEASE; PRIMARY-CARE; VALIDATION; DERIVATION; SCORE; ASSOCIATION; REGRESSION; UPDATE; ONSET; AGE;
D O I
10.1136/svn-2023-002332
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
BackgroundPrevious prediction algorithms for cardiovascular diseases (CVD) were established using risk factors retrieved largely based on empirical clinical knowledge. This study sought to identify predictors among a comprehensive variable space, and then employ machine learning (ML) algorithms to develop a novel CVD risk prediction model. MethodsFrom a longitudinal population-based cohort of UK Biobank, this study included 473 611 CVD-free participants aged between 37 and 73 years old. We implemented an ML-based data-driven pipeline to identify predictors from 645 candidate variables covering a comprehensive range of health-related factors and assessed multiple ML classifiers to establish a risk prediction model on 10-year incident CVD. The model was validated through a leave-one-center-out cross-validation. ResultsDuring a median follow-up of 12.2 years, 31 466 participants developed CVD within 10 years after baseline visits. A novel UK Biobank CVD risk prediction (UKCRP) model was established that comprised 10 predictors including age, sex, medication of cholesterol and blood pressure, cholesterol ratio (total/high-density lipoprotein), systolic blood pressure, previous angina or heart disease, number of medications taken, cystatin C, chest pain and pack-years of smoking. Our model obtained satisfied discriminative performance with an area under the receiver operating characteristic curve (AUC) of 0.762 +/- 0.010 that outperformed multiple existing clinical models, and it was well-calibrated with a Brier Score of 0.057 +/- 0.006. Further, the UKCRP can obtain comparable performance for myocardial infarction (AUC 0.774 +/- 0.011) and ischaemic stroke (AUC 0.730 +/- 0.020), but inferior performance for haemorrhagic stroke (AUC 0.644 +/- 0.026). ConclusionML-based classification models can learn expressive representations from potential high-risked CVD participants who may benefit from earlier clinical decisions.
引用
收藏
页码:475 / 485
页数:11
相关论文
共 50 条
  • [1] Development and Validation of Machine Learning-Based Race-Specific Models to Predict 10-Year Risk of Heart Failure A Multicohort Analysis
    Segar, Matthew W.
    Jaeger, Byron C.
    Patel, Kershaw, V
    Nambi, Vijay
    Ndumele, Chiadi E.
    Correa, Adolfo
    Butler, Javed
    Chandra, Alvin
    Ayers, Colby
    Rao, Shreya
    Lewis, Alana A.
    Raffield, Laura M.
    Rodriguez, Carlos J.
    Michos, Erin D.
    Ballantyne, Christie M.
    Hall, Michael E.
    Mentz, Robert J.
    de Lemos, James A.
    Pandey, Ambarish
    CIRCULATION, 2021, 143 (24) : 2370 - 2383
  • [2] MACHINE LEARNING ALGORITHMS TO PREDICT 10-YEAR ATHEROSCLEROTIC CARDIOVASCULAR RISK IN A CONTEMPORARY, COMMUNITY-BASED HISTORICAL COHORT
    Medina-Inojosa, Jose
    Shelly, Michal
    Attia, Zachi Itzhak
    Noseworthy, Peter
    Kapa, Suraj
    Friedman, Paul
    Lopez-Jimenez, Francisco
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2020, 75 (11) : 2027 - 2027
  • [3] A machine learning-based model to predict the 15-year risk for cardiovascular disease in a cohort of people living with HIV
    Muccini, C.
    Masci, C.
    Corso, F.
    Galli, L.
    Poli, A.
    Ranzenigo, M.
    Monardo, R.
    Paganoni, A. M.
    Castagna, A.
    Leva, F.
    HIV MEDICINE, 2021, 22 : 143 - 144
  • [4] Healthy sleep pattern reduce the risk of cardiovascular disease: A 10-year prospective cohort study
    Zhong, Qingqing
    Qin, Zhongshu
    Wang, Xiaowei
    Lan, Jian
    Zhu, Tingping
    Xiao, Xiao
    Su, Li
    Pei, Pei
    Long, Jianxiong
    Zhou, Lifang
    SLEEP MEDICINE, 2023, 105 : 53 - 60
  • [5] Is fatty liver in diabetes a risk factor for HCC and cardiovascular disease? Prospective 10-year cohort study
    Seike, Masataka
    Honda, Koichi
    Oribe, Junya
    Endo, Mizuki
    Yoshihara, Mie
    Tokoro, Masanori
    Iwao, Masao
    Syo, Hiroki
    Murakami, Kazunari
    HEPATOLOGY, 2014, 60 : 620A - 620A
  • [6] IMPACT OF DIET ON 10-YEAR ABSOLUTE CARDIOVASCULAR RISK: A PROSPECTIVE POPULATION-BASED COHORT STUDY
    Kjeldsen, E. W.
    Thomassen, J. Q.
    Rasmussen, K. L.
    Nordestgaard, B. G.
    Tybjaerg-Hansen, A.
    Frikke-Schmidt, R.
    ATHEROSCLEROSIS, 2022, 355 : E64 - E65
  • [7] Development and Validation of Machine Learning-based Race-specific Models to Predict 10year Risk of Heart Failure: A Multi-cohort Analysis
    Segar, Matthew W.
    Jaeger, Byron
    Patel, Kershaw V.
    Nambi, Vijay
    Ndumele, Chiadi E.
    Correa, Adolfo
    Butler, Javed
    Chandra, Alvin
    Ayers, Colby
    Raffield, Laura M.
    Rodriguez, Carlos J.
    Michos, Erin D.
    Ballantyne, Christie M.
    Hall, Michael E.
    Mentz, Robert J.
    De Lemos, James A.
    Pandey, Ambarish
    CIRCULATION, 2020, 142
  • [8] Development and validation of 10-year risk prediction models of cardiovascular disease in Chinese type 2 diabetes mellitus patients in primary care using interpretable machine learning-based methods
    Dong, Weinan
    Wan, Eric Yuk Fai
    Fong, Daniel Yee Tak
    Tan, Kathryn Choon-Beng
    Tsui, Wendy Wing-Sze
    Hui, Eric Ming-Tung
    Chan, King Hong
    Fung, Colman Siu Cheung
    Lam, Cindy Lo Kuen
    DIABETES OBESITY & METABOLISM, 2024, 26 (09): : 3969 - 3987
  • [9] Elevated lipoprotein(a) levels predict cardiovascular disease in type 2 diabetes mellitus: a 10-year prospective cohort study
    Lim, Tae-Seok
    Yun, Jae-Seung
    Cha, Seon-Ah
    Song, Ki-Ho
    Yoo, Ki-Dong
    Ahn, Yu-Bae
    Park, Yong-Moon
    Ko, Seung-Hyun
    KOREAN JOURNAL OF INTERNAL MEDICINE, 2016, 31 (06): : 1110 - 1119
  • [10] Utility of Framingham general cardiovascular disease risk score for predicting 10-year cardiovascular risk in an inner Mongolian population: A prospective cohort study
    Peng, Hao
    Jiao, Yang
    Zeng, Qinghua
    Li, Hongmei
    Zhang, Mingzhi
    Wang, Aili
    Zhang, Yonghong
    INTERNATIONAL JOURNAL OF CARDIOLOGY, 2014, 172 (01) : 274 - 275