Stroke risk prediction using machine learning: a prospective cohort study of 0.5 million Chinese adults

被引:29
|
作者
Chun, Matthew [1 ,2 ,3 ]
Clarke, Robert [1 ,2 ]
Cairns, Benjamin J. [1 ,2 ,4 ]
Clifton, David [3 ,5 ]
Bennett, Derrick [1 ,2 ]
Chen, Yiping [1 ,2 ,4 ]
Guo, Yu [6 ]
Pei, Pei [6 ]
Lv, Jun [7 ]
Yu, Canqing [7 ]
Yang, Ling [1 ,2 ]
Li, Liming [7 ]
Chen, Zhengming [4 ]
Zhu, Tingting [3 ]
机构
[1] Univ Oxford, Nuffield Dept Populat Hlth, Clin Trial Serv Unit, Oxford, England
[2] Univ Oxford, Nuffield Dept Populat Hlth, Epidemiol Studies, Oxford, England
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Univ Oxford, Med Res Council, Populat Hlth Res Unit, Oxford, England
[5] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China
[6] Chinese Acad Med Sci, Beijing, Peoples R China
[7] Peking Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Hlth Sci Ctr, Beijing, Peoples R China
基金
英国惠康基金; 英国医学研究理事会;
关键词
stroke; cardiovascular diseases; machine learning; risk assessment; China; PRIMARY PREVENTION; INCIDENT STROKE; VALIDATION; DISEASE; POPULATION; DERIVATION; STATEMENT; KADOORIE; PROFILE; SCORE;
D O I
10.1093/jamia/ocab068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To compare Cox models, machine learning (ML), and ensemble models combining both approaches, for prediction of stroke risk in a prospective study of Chinese adults. Materials and Methods: We evaluated models for stroke risk at varying intervals of follow-up (<9 years, 0-3 years, 3-6 years, 6-9 years) in 503 842 adults without prior history of stroke recruited from 10 areas in China in 2004-2008. Inputs included sociodemographic factors, diet, medical history, physical activity, and physical measurements. We compared discrimination and calibration of Cox regression, logistic regression, support vector machines, random survival forests, gradient boosted trees (GBT), and multilayer perceptrons, benchmarking performance against the 2017 Framingham Stroke Risk Profile. We then developed an ensemble approach to identify individuals at high risk of stroke (>10% predicted 9-yr stroke risk) by selectively applying either a GBT or Cox model based on individual-level characteristics. Results: For 9-yr stroke risk prediction, GBT provided the best discrimination (AUROC: 0.833 in men, 0.836 in women) and calibration, with consistent results in each interval of follow-up. The ensemble approach yielded incrementally higher accuracy (men: 76%, women: 80%), specificity (men: 76%, women: 81%), and positive predictive value (men: 26%, women: 24%) compared to any of the single-model approaches. Discussion and Conclusion: Among several approaches, an ensemble model combining both GBT and Cox models achieved the best performance for identifying individuals at high risk of stroke in a contemporary study of Chinese adults. The results highlight the potential value of expanding the use of ML in clinical practice.
引用
收藏
页码:1719 / 1727
页数:9
相关论文
共 50 条
  • [1] Tea consumption and risk of stroke in Chinese adults: a prospective cohort study of 0.5 million men and women
    Tian, Tian
    Lv, Jun
    Jin, Guangfu
    Yu, Canqing
    Guo, Yu
    Bian, Zheng
    Yang, Ling
    Chen, Yiping
    Shen, Hongbing
    Chen, Zhengming
    Hu, Zhibin
    Li, Liming
    [J]. AMERICAN JOURNAL OF CLINICAL NUTRITION, 2020, 111 (01): : 197 - 206
  • [2] Chronic hepatitis B virus infection and risk of stroke types: a prospective cohort study of 0.5 million Chinese adults
    Hamilton, Elizabeth
    Yang, Ling
    Millwood, Iona
    Chen, Zhengming
    [J]. JOURNAL OF HEPATOLOGY, 2023, 78 : S74 - S74
  • [3] Development, validation and comparison of multivariable risk scores for prediction of total stroke and stroke types in Chinese adults: a prospective study of 0.5 million adults
    Chun, Matthew
    Clarke, Robert
    Zhu, Tingting
    Clifton, David
    Bennett, Derrick A.
    Chen, Yiping
    Guo, Yu
    Pei, Pei
    Lv, Jun
    Yu, Canqing
    Yang, Ling
    Li, Liming
    Chen, Zhengming
    Cairns, Benjamin J.
    [J]. STROKE AND VASCULAR NEUROLOGY, 2022, 7 (04) : 328 - 336
  • [4] Prediction and clinical utility of a liver cancer risk model in Chinese adults: A prospective cohort study of 0.5 million people
    Yu, Chengxiao
    Song, Ci
    Lv, Jun
    Zhu, Meng
    Yu, Canqing
    Guo, Yu
    Yang, Ling
    Chen, Yiping
    Chen, Zhengming
    Jiang, Tao
    Ma, Hongxia
    Jin, Guangfu
    Shen, Hongbing
    Hu, Zhibin
    Li, Liming
    [J]. INTERNATIONAL JOURNAL OF CANCER, 2021, 148 (12) : 2924 - 2934
  • [5] Development and validation of risk prediction model for lung cancer in Chinese populations: A prospective cohort study of 0.5 million adults
    Zhu, Meng
    Song, Ci
    Ma, Hongxia
    Shen, Hongbing
    [J]. GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 517 - 518
  • [6] Association of Major Depressive Episodes With Stroke Risk in a Prospective Study of 0.5 Million Chinese Adults
    Sun, Jie
    Ma, Hongxia
    Yu, Canqing
    Lv, Jun
    Guo, Yu
    Bian, Zheng
    Yang, Ling
    Chen, Yiping
    Shen, Hongbing
    Chen, Zhengming
    Hu, Zhibin
    Li, Liming
    [J]. STROKE, 2016, 47 (09) : 2203 - 2208
  • [7] Habitual Tea Consumption and Risk of Fracture in 0.5 Million Chinese Adults: A Prospective Cohort Study
    Shen, Qian
    Yu, Canqing
    Guo, Yu
    Bian, Zheng
    Zhu, Nanbo
    Yang, Ling
    Chen, Yiping
    Luo, Guojin
    Li, Jianguo
    Qin, Yulu
    Chen, Junshi
    Chen, Zhengming
    Lv, Jun
    Li, Liming
    [J]. NUTRIENTS, 2018, 10 (11)
  • [8] Adiposity and risk of ischaemic and haemorrhagic stroke in 0.5 million Chinese men and women: a prospective cohort study
    Chen, Zhengming
    Iona, Andri
    Parish, Sarah
    Chen, Yiping
    Guo, Yu
    Bragg, Fiona
    Yang, Ling
    Bian, Zheng
    Holmes, Michael V.
    Lewington, Sarah
    Lacey, Ben
    Gao, Ruqin
    Liu, Fang
    Zhang, Zengzhi
    Chen, Junshi
    Walters, Robin G.
    Collins, Rory
    Clarke, Robert
    Peto, Richard
    Li, Liming
    [J]. LANCET GLOBAL HEALTH, 2018, 6 (06): : E630 - E640
  • [9] Association between tea consumption and risk of cancer: a prospective cohort study of 0.5 million Chinese adults
    Li, Xinyi
    Yu, Canqing
    Guo, Yu
    Bian, Zheng
    Shen, Zewei
    Yang, Ling
    Chen, Yiping
    Wei, Yongyue
    Zhang, Hao
    Qiu, Zhe
    Chen, Junshi
    Chen, Feng
    Chen, Zhengming
    Lv, Jun
    Li, Liming
    [J]. EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2019, 34 (08) : 753 - 763
  • [10] Association between tea consumption and risk of cancer: a prospective cohort study of 0.5 million Chinese adults
    Xinyi Li
    Canqing Yu
    Yu Guo
    Zheng Bian
    Zewei Shen
    Ling Yang
    Yiping Chen
    Yongyue Wei
    Hao Zhang
    Zhe Qiu
    Junshi Chen
    Feng Chen
    Zhengming Chen
    Jun Lv
    Liming Li
    [J]. European Journal of Epidemiology, 2019, 34 : 753 - 763