Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data

被引:46
|
作者
Ravaut, Mathieu [1 ,2 ]
Sadeghi, Hamed [1 ]
Leung, Kin Kwan [1 ]
Volkovs, Maksims [1 ]
Kornas, Kathy [3 ]
Harish, Vinyas [3 ,4 ]
Watson, Tristan [3 ,5 ]
Lewis, Gary F. [6 ,7 ]
Weisman, Alanna [8 ,9 ]
Poutanen, Tomi [1 ]
Rosella, Laura [3 ,5 ,10 ,11 ,12 ]
机构
[1] Layer 6 AI, Toronto, ON, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[4] Univ Toronto, Temerty Fac Med, MD PhD Program, Toronto, ON, Canada
[5] ICES, Toronto, ON, Canada
[6] Univ Toronto, Temerty Fac Med, Dept Med, Toronto, ON, Canada
[7] Univ Toronto, Temerty Fac Med, Dept Physiol, Toronto, ON, Canada
[8] Mt Sinai Hosp, Lunenfeld Tanenbaum Res Inst, Toronto, ON, Canada
[9] Univ Toronto, Temerty Fac Med, Div Endocrinol & Metab, Toronto, ON, Canada
[10] Vector Inst, Toronto, ON, Canada
[11] Trillium Hlth Partners, Inst Better Hlth, Mississauga, ON, Canada
[12] Univ Toronto, Temerty Fac Med, Dept Lab Med & Pathol, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
CARDIOVASCULAR-DISEASE; MAJOR COMPLICATIONS; RISK; COSTS; POPULATION; CARE; EPIDEMIOLOGY; EXPLANATIONS; PREVALENCE; PREVENTION;
D O I
10.1038/s41746-021-00394-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7-77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data
    Mathieu Ravaut
    Hamed Sadeghi
    Kin Kwan Leung
    Maksims Volkovs
    Kathy Kornas
    Vinyas Harish
    Tristan Watson
    Gary F. Lewis
    Alanna Weisman
    Tomi Poutanen
    Laura Rosella
    npj Digital Medicine, 4
  • [2] Predicting common maternal postpartum complications: leveraging health administrative data and machine learning
    Betts, K. S.
    Kisely, S.
    Alati, R.
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2019, 126 (06) : 702 - 709
  • [3] Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand
    Nhung Nghiem
    Nick Wilson
    Jeremy Krebs
    Truyen Tran
    BMC Medical Informatics and Decision Making, 24 (1)
  • [4] Correction: Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand
    Nhung Nghiem
    Nick Wilson
    Jeremy Krebs
    Truyen Tran
    BMC Medical Informatics and Decision Making, 24 (1)
  • [5] Predicting 1-year mortality of patients with diabetes mellitus in Kazakhstan based on administrative health data using machine learning
    Aidar Alimbayev
    Gulnur Zhakhina
    Arnur Gusmanov
    Yesbolat Sakko
    Sauran Yerdessov
    Iliyar Arupzhanov
    Ardak Kashkynbayev
    Amin Zollanvari
    Abduzhappar Gaipov
    Scientific Reports, 13
  • [6] Predicting 1-year mortality of patients with diabetes mellitus in Kazakhstan based on administrative health data using machine learning
    Alimbayev, Aidar
    Zhakhina, Gulnur
    Gusmanov, Arnur
    Sakko, Yesbolat
    Yerdessov, Sauran
    Arupzhanov, Iliyar
    Kashkynbayev, Ardak
    Zollanvari, Amin
    Gaipov, Abduzhappar
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] A Machine Learning Approach to Predicting Diabetes Complications
    Jian, Yazan
    Pasquier, Michel
    Sagahyroon, Assim
    Aloul, Fadi
    HEALTHCARE, 2021, 9 (12)
  • [8] Predicting complications of diabetes mellitus using advanced machine learning algorithms
    Ljubic, Branimir
    Hai, Ameen Abdel
    Stanojevic, Marija
    Diaz, Wilson
    Polimac, Daniel
    Pavlovski, Martin
    Obradovic, Zoran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1343 - 1351
  • [9] Predicting Health Outcomes Using Machine Learning in Pediatric Heart Transplantation Using UNOS Data
    Killian, M. O.
    Tian, S.
    Xing, A.
    Gupta, D.
    He, Z.
    JOURNAL OF HEART AND LUNG TRANSPLANTATION, 2023, 42 (04): : S22 - S22
  • [10] PREDICTING THE RISK OF STROKE USING MACHINE LEARNING ON A LARGE ADMINISTRATIVE HEALTH DATABASE
    Ghiani, M.
    Maywald, U.
    Wilke, T.
    VALUE IN HEALTH, 2022, 25 (12) : S14 - S14