Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data

被引:46
|
作者
Ravaut, Mathieu [1 ,2 ]
Sadeghi, Hamed [1 ]
Leung, Kin Kwan [1 ]
Volkovs, Maksims [1 ]
Kornas, Kathy [3 ]
Harish, Vinyas [3 ,4 ]
Watson, Tristan [3 ,5 ]
Lewis, Gary F. [6 ,7 ]
Weisman, Alanna [8 ,9 ]
Poutanen, Tomi [1 ]
Rosella, Laura [3 ,5 ,10 ,11 ,12 ]
机构
[1] Layer 6 AI, Toronto, ON, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[4] Univ Toronto, Temerty Fac Med, MD PhD Program, Toronto, ON, Canada
[5] ICES, Toronto, ON, Canada
[6] Univ Toronto, Temerty Fac Med, Dept Med, Toronto, ON, Canada
[7] Univ Toronto, Temerty Fac Med, Dept Physiol, Toronto, ON, Canada
[8] Mt Sinai Hosp, Lunenfeld Tanenbaum Res Inst, Toronto, ON, Canada
[9] Univ Toronto, Temerty Fac Med, Div Endocrinol & Metab, Toronto, ON, Canada
[10] Vector Inst, Toronto, ON, Canada
[11] Trillium Hlth Partners, Inst Better Hlth, Mississauga, ON, Canada
[12] Univ Toronto, Temerty Fac Med, Dept Lab Med & Pathol, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
CARDIOVASCULAR-DISEASE; MAJOR COMPLICATIONS; RISK; COSTS; POPULATION; CARE; EPIDEMIOLOGY; EXPLANATIONS; PREVALENCE; PREVENTION;
D O I
10.1038/s41746-021-00394-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7-77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Predicting return to work after traumatic brain injury using machine learning and administrative data
    Van Deynse, Helena
    Cools, Wilfried
    De Deken, Viktor-Jan
    Depreitere, Bart
    Hubloue, Ives
    Kimpe, Eva
    Moens, Maarten
    Pien, Karen
    Tisseghem, Ellen
    Van Belleghem, Griet
    Putman, Koen
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 178
  • [22] Predicting Diabetes Diseases Using Mixed Data and Supervised Machine Learning Algorithms
    Daanouni, Othmane
    Cherradi, Bouchaib
    Tmiri, Amal
    4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [23] PREDICTING SPINE SURGERY COMPLICATIONS USING MACHINE LEARNING
    Hoda, Mohamad
    El Saddik, Abdulmotaleb
    Wai, Eugene
    Phan, Philippe
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 49 - 53
  • [24] Using machine learning for predicting outcomes in ACLF
    Tonon, Marta
    Moreau, Richard
    LIVER INTERNATIONAL, 2022, 42 (11) : 2354 - 2355
  • [25] Intrahospital transfers and adverse patient outcomes: An analysis of administrative health data
    Blay, Nicole
    Roche, Michael
    Duffield, Christine
    Xu, Xiaoyue
    JOURNAL OF CLINICAL NURSING, 2017, 26 (23-24) : 4927 - 4935
  • [26] Identification of postoperative complications using electronic health record data and machine learning
    Bronsert, Michael
    Singh, Abhinav B.
    Henderson, William G.
    Hammermeister, Karl
    Meguid, Robert A.
    Colborn, Kathryn L.
    AMERICAN JOURNAL OF SURGERY, 2020, 220 (01): : 114 - 119
  • [27] Applying machine learning approaches for predicting obesity risk using US health administrative claims database
    Choong, Casey
    Brnabic, Alan
    Chinthammit, Chanadda
    Ravuri, Meena
    Terrell, Kendra
    Kan, Hong
    BMJ OPEN DIABETES RESEARCH & CARE, 2024, 12 (05)
  • [28] Using administrative data to assess health care outcomes
    Ayanian, JZ
    EUROPEAN HEART JOURNAL, 1999, 20 (23) : 1689 - 1691
  • [29] Risk assessment for adverse neonatal outcomes in pregnancies complicated by gestational diabetes using machine learning
    Houri, Ohad
    Gil, Yotam
    Krispin, Eyal
    Hadar, Eran
    Wiznitzer, Arnon
    Chen, Rony
    Komem, Daphna
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2019, 220 (01) : S277 - S277
  • [30] Predicting neonatal respiratory distress syndrome and hypoglycaemia prior to discharge: Leveraging health administrative data and machine learning
    Betts, Kim S.
    Kisely, Steve
    Alati, Rosa
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 114