Stroke risk prediction using machine learning: a prospective cohort study of 0.5 million Chinese adults

被引:31
|
作者
Chun, Matthew [1 ,2 ,3 ]
Clarke, Robert [1 ,2 ]
Cairns, Benjamin J. [1 ,2 ,4 ]
Clifton, David [3 ,5 ]
Bennett, Derrick [1 ,2 ]
Chen, Yiping [1 ,2 ,4 ]
Guo, Yu [6 ]
Pei, Pei [6 ]
Lv, Jun [7 ]
Yu, Canqing [7 ]
Yang, Ling [1 ,2 ]
Li, Liming [7 ]
Chen, Zhengming [4 ]
Zhu, Tingting [3 ]
机构
[1] Univ Oxford, Nuffield Dept Populat Hlth, Clin Trial Serv Unit, Oxford, England
[2] Univ Oxford, Nuffield Dept Populat Hlth, Epidemiol Studies, Oxford, England
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Univ Oxford, Med Res Council, Populat Hlth Res Unit, Oxford, England
[5] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China
[6] Chinese Acad Med Sci, Beijing, Peoples R China
[7] Peking Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Hlth Sci Ctr, Beijing, Peoples R China
基金
英国惠康基金; 英国医学研究理事会;
关键词
stroke; cardiovascular diseases; machine learning; risk assessment; China; PRIMARY PREVENTION; INCIDENT STROKE; VALIDATION; DISEASE; POPULATION; DERIVATION; STATEMENT; KADOORIE; PROFILE; SCORE;
D O I
10.1093/jamia/ocab068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To compare Cox models, machine learning (ML), and ensemble models combining both approaches, for prediction of stroke risk in a prospective study of Chinese adults. Materials and Methods: We evaluated models for stroke risk at varying intervals of follow-up (<9 years, 0-3 years, 3-6 years, 6-9 years) in 503 842 adults without prior history of stroke recruited from 10 areas in China in 2004-2008. Inputs included sociodemographic factors, diet, medical history, physical activity, and physical measurements. We compared discrimination and calibration of Cox regression, logistic regression, support vector machines, random survival forests, gradient boosted trees (GBT), and multilayer perceptrons, benchmarking performance against the 2017 Framingham Stroke Risk Profile. We then developed an ensemble approach to identify individuals at high risk of stroke (>10% predicted 9-yr stroke risk) by selectively applying either a GBT or Cox model based on individual-level characteristics. Results: For 9-yr stroke risk prediction, GBT provided the best discrimination (AUROC: 0.833 in men, 0.836 in women) and calibration, with consistent results in each interval of follow-up. The ensemble approach yielded incrementally higher accuracy (men: 76%, women: 80%), specificity (men: 76%, women: 81%), and positive predictive value (men: 26%, women: 24%) compared to any of the single-model approaches. Discussion and Conclusion: Among several approaches, an ensemble model combining both GBT and Cox models achieved the best performance for identifying individuals at high risk of stroke in a contemporary study of Chinese adults. The results highlight the potential value of expanding the use of ML in clinical practice.
引用
收藏
页码:1719 / 1727
页数:9
相关论文
共 50 条
  • [21] Solid fuels for cooking and tobacco use and risk of major chronic liver disease mortality: a prospective cohort study of 0.5 million Chinese adults
    Chan, Ka Hung
    Bennett, Derrick A.
    Kurmi, Om P.
    Yang, Ling
    Chen, Yiping
    Lv, Jun
    Guo, Yu
    Bian, Zheng
    Yu, Canqing
    Chen, Xiaofang
    Dong, Caixia
    Li, Liming
    Chen, Zhengming
    Lam, Kin Bong Hubert
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (01) : 45 - 55
  • [22] Self-Reported Primary Cooking Fuels Use and Risk of Chronic Digestive Diseases: A Prospective Cohort Study of 0.5 Million Chinese Adults
    Wen, Qiaorui
    Liu, Tanxin
    Yu, Yuelin
    Zhang, Yunjing
    Yang, Yingzi
    Zheng, Rongshou
    Li, Liming
    Chen, Ru
    Wang, Shengfeng
    ENVIRONMENTAL HEALTH PERSPECTIVES, 2023, 131 (04)
  • [23] Type 2 Diabetes and Risk of Incident Cancer in China: A Prospective Study Among 0.5 Million Chinese Adults
    Pan, Xiong-Fei
    He, Meian
    Yu, Canqing
    Lv, Jun
    Guo, Yu
    Bian, Zheng
    Yang, Ling
    Chen, Yiping
    Wu, Tangchun
    Chen, Zhengming
    Pan, An
    Li, Liming
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2018, 187 (07) : 1380 - 1391
  • [24] Associations of egg consumption with cardiovascular disease in a cohort study of 0.5 million Chinese adults
    Qin, Chenxi
    Lv, Jun
    Guo, Yu
    Bian, Zheng
    Si, Jiahui
    Yang, Ling
    Chen, Yiping
    Zhou, Yonglin
    Zhang, Hao
    Liu, Jianjun
    Chen, Junshi
    Chen, Zhengming
    Yu, Canqing
    Li, Liming
    HEART, 2018, 104 (21) : 1756 - 1763
  • [25] Sleep duration and risk of stroke and coronary heart disease: a 9-year community-based prospective study of 0.5 million Chinese adults
    Chen, Yiping
    Kartsonaki, Christiana
    Clarke, Robert
    Guo, Yu
    Du, Huaidong
    Yu, Canqing
    Yang, Ling
    Pei, Pei
    Stevens, Rebecca
    Burgess, Sushila
    Hua, Yujie
    Chen, Junshi
    Lv, Jun
    Li, Liming
    Chen, Zhengming
    BMC NEUROLOGY, 2023, 23 (01)
  • [26] Sleep duration and risk of stroke and coronary heart disease: a 9-year community-based prospective study of 0.5 million Chinese adults
    Yiping Chen
    Christiana Kartsonaki
    Robert Clarke
    Yu Guo
    Huaidong Du
    Canqing Yu
    Ling Yang
    Pei Pei
    Rebecca Stevens
    Sushila Burgess
    Yujie Hua
    Junshi Chen
    Jun Lv
    Liming Li
    Zhengming Chen
    BMC Neurology, 23
  • [27] Chronic hepatitis B virus infection and risk of chronic kidney disease: a population-based prospective cohort study of 0.5 million Chinese adults
    Si, Jiahui
    Yu, Canqing
    Guo, Yu
    Bian, Zheng
    Qin, Chenxi
    Yang, Ling
    Chen, Yiping
    Yin, Li
    Li, Hui
    Lan, Jian
    Chen, Junshi
    Chen, Zhengming
    Lv, Jun
    Li, Liming
    BMC MEDICINE, 2018, 16
  • [28] Chronic hepatitis B virus infection and risk of chronic kidney disease: a population-based prospective cohort study of 0.5 million Chinese adults
    Jiahui Si
    Canqing Yu
    Yu Guo
    Zheng Bian
    Chenxi Qin
    Ling Yang
    Yiping Chen
    Li Yin
    Hui Li
    Jian Lan
    Junshi Chen
    Zhengming Chen
    Jun Lv
    Liming Li
    BMC Medicine, 16
  • [29] Association of anxiety with cardiovascular disease in a Chinese cohort of 0.5 million adults
    Wu, Man
    Zhu, Yunqing
    Lv, Jun
    Guo, Yu
    Yang, Ling
    Chen, Yiping
    Tang, Wei
    Xiang, Shengping
    Sun, Xiaohui
    Chen, Junshi
    Chen, Zhengming
    Yu, Canqing
    Li, Liming
    JOURNAL OF AFFECTIVE DISORDERS, 2022, 315 : 291 - 296
  • [30] Urban and rural disparities in stroke prediction using machine learning among Chinese older adults
    Zhu, Jingjing
    Lin, Luotao
    Si, Lei
    Zhao, Hailei
    Song, Hualing
    Xu, Xianglong
    SCIENTIFIC REPORTS, 2025, 15 (01):