Predicting preterm birth using auto-ML frameworks: a large observational study using electronic inpatient discharge data

被引:3
|
作者
Kong, Deming [1 ]
Tao, Ye [1 ]
Xiao, Haiyan [1 ]
Xiong, Huini [1 ]
Wei, Weizhong [1 ]
Cai, Miao [2 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Childrens Hosp, Wuhan Maternal & Child Healthcare Hosp, Tongji Med Coll, Wuhan, Hubei, Peoples R China
[2] Sun Yat Sen Univ, Sch Publ Hlth, Dept Epidemiol, Guangzhou, Guangdong, Peoples R China
来源
FRONTIERS IN PEDIATRICS | 2024年 / 12卷
关键词
preterm birth; machine learning; administrative data; China; autoML; MORTALITY;
D O I
10.3389/fped.2024.1330420
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Background To develop and compare different AutoML frameworks and machine learning models to predict premature birth.Methods The study used a large electronic medical record database to include 715,962 participants who had the principal diagnosis code of childbirth. Three Automatic Machine Learning (AutoML) were used to construct machine learning models including tree-based models, ensembled models, and deep neural networks on the training sample (N = 536,971). The area under the curve (AUC) and training times were used to assess the performance of the prediction models, and feature importance was computed via permutation-shuffling.Results The H2O AutoML framework had the highest median AUC of 0.846, followed by AutoGluon (median AUC: 0.840) and Auto-sklearn (median AUC: 0.820), and the median training time was the lowest for H2O AutoML (0.14 min), followed by AutoGluon (0.16 min) and Auto-sklearn (4.33 min). Among different types of machine learning models, the Gradient Boosting Machines (GBM) or Extreme Gradient Boosting (XGBoost), stacked ensemble, and random forrest models had better predictive performance, with median AUC scores being 0.846, 0.846, and 0.842, respectively. Important features related to preterm birth included premature rupture of membrane (PROM), incompetent cervix, occupation, and preeclampsia.Conclusions Our study highlights the potential of machine learning models in predicting the risk of preterm birth using readily available electronic medical record data, which have significant implications for improving prenatal care and outcomes.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Prediction of Spontaneous Preterm Birth in At-risk Women Using Thrombospondin 1 from Cervicovaginal Fluid: A Prospective Observational Study
    Stubert, Johannes
    Gruendler, Kathleen
    Gerber, Bernd
    Richter, Dagmar-Ulrike
    Dieterich, Max
    GEBURTSHILFE UND FRAUENHEILKUNDE, 2021, 81 (09) : 1055 - 1064
  • [22] Factors associated with exclusive breastfeeding at hospital discharge: a study using data from the Georgian Birth Registry
    Marie Sigstad Lande
    Ingvild Hersoug Nedberg
    Erik Eik Anda
    International Breastfeeding Journal, 15
  • [23] Factors associated with exclusive breastfeeding at hospital discharge: a study using data from the Georgian Birth Registry
    Lande, Marie Sigstad
    Nedberg, Ingvild Hersoug
    Anda, Erik Eik
    INTERNATIONAL BREASTFEEDING JOURNAL, 2020, 15 (01)
  • [24] Predicting Progression from Gestational Diabetes to Impaired Glucose Tolerance Using Peridelivery Data: An Observational Study
    Bengtson, Angela M.
    Dice, Ana Lucia Espinosa
    Clark, Melissa A.
    Gutman, Roee
    Rouse, Dwight
    Werner, Erika
    AMERICAN JOURNAL OF PERINATOLOGY, 2024, 41 : e282 - e289
  • [25] Prediction of spontaneous preterm birth using supervised machine learning on metabolomic data: A case-cohort study
    Al Ghadban, Yasmina
    Du, Yuheng
    Charnock-Jones, D. Stephen
    Garmire, Lana X.
    Smith, Gordon C. S.
    Sovio, Ulla
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2024, 131 (07) : 908 - 916
  • [26] Predicting all-cause unplanned readmission within 30 days of discharge using electronic medical record data: A multi-centre study
    Sharmin, Sifat
    Meij, Johannes J.
    Zajac, Jeffrey D.
    Moodie, Alan Rob
    Maier, Andrea B.
    INTERNATIONAL JOURNAL OF CLINICAL PRACTICE, 2021, 75 (08)
  • [27] Electronic Cigarette Users' Perspective on the COVID-19 Pandemic: Observational Study Using Twitter Data
    Gao, Yankun
    Xie, Zidian
    Li, Dongmei
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (01): : 124 - 130
  • [28] Analgesic prescribing in patients with inflammatory arthritis in England: an observational study using electronic healthcare record data
    Scott, Ian C.
    Bailey, James
    White, Chris
    Mallen, Christian D.
    Muller, Sara
    RHEUMATOLOGY, 2022, 61 (08) : 3201 - 3211
  • [29] A Feasibility Study to Attribute Patients to Primary Interns on Inpatient Ward Teams Using Electronic Health Record Data
    Schumacher, Daniel J.
    Wu, Danny T. Y.
    Meganathan, Karthikeyan
    Li, Lezhi
    Kinnear, Benjamin
    Sall, Dana R.
    Holmboe, Eric
    Carraccio, Carol
    van der Vleuten, Cees
    Busari, Jamiu
    Kelleher, Matthew
    Schauer, Daniel
    Warm, Eric
    ACADEMIC MEDICINE, 2019, 94 (09) : 1376 - 1383
  • [30] Association of Final Discharge Blood Pressure with Post-discharge Outcomes Using Electronic Medical Record Data: a Retrospective Study
    Maleka Khambaty
    Nishant Sahni
    Journal of General Internal Medicine, 2019, 34 : 1373 - 1375