Comparison of Machine Learning Methods and Conventional Logistic Regressions for Predicting Gestational Diabetes Using Routine Clinical Data: A Retrospective Cohort Study

被引:55
|
作者
Ye, Yunzhen [1 ,2 ]
Xiong, Yu [1 ,2 ]
Zhou, Qiongjie [1 ,2 ]
Wu, Jiangnan [1 ]
Li, Xiaotian [1 ,2 ,3 ,4 ]
Xiao, Xirong [1 ,2 ]
机构
[1] Fudan Univ, Obstet & Gynecol Hosp, Shanghai, Peoples R China
[2] Shanghai Key Lab Female Reprod Endocrine Related, Shanghai, Peoples R China
[3] Shanghai Key Lab Birth Defects, Shanghai, Peoples R China
[4] Fudan Univ, Inst Biochem Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
RISK; HYPERGLYCEMIA; PREGNANCY; MELLITUS;
D O I
10.1155/2020/4168340
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background. Gestational diabetes mellitus (GDM) contributes to adverse pregnancy and birth outcomes. In recent decades, extensive research has been devoted to the early prediction of GDM by various methods. Machine learning methods are flexible prediction algorithms with potential advantages over conventional regression.Objective. The purpose of this study was to use machine learning methods to predict GDM and compare their performance with that of logistic regressions.Methods. We performed a retrospective, observational study including women who attended their routine first hospital visits during early pregnancy and had Down's syndrome screening at 16-20 gestational weeks in a tertiary maternity hospital in China from 2013.1.1 to 2017.12.31. A total of 22,242 singleton pregnancies were included, and 3182 (14.31%) women developed GDM. Candidate predictors included maternal demographic characteristics and medical history (maternal factors) and laboratory values at early pregnancy. The models were derived from the first 70% of the data and then validated with the next 30%. Variables were trained in different machine learning models and traditional logistic regression models. Eight common machine learning methods (GDBT, AdaBoost, LGB, Logistic, Vote, XGB, Decision Tree, and Random Forest) and two common regressions (stepwise logistic regression and logistic regression with RCS) were implemented to predict the occurrence of GDM. Models were compared on discrimination and calibration metrics.Results. In the validation dataset, the machine learning and logistic regression models performed moderately (AUC 0.59-0.74). Overall, the GBDT model performed best (AUC 0.74, 95% CI 0.71-0.76) among the machine learning methods, with negligible differences between them. Fasting blood glucose, HbA1c, triglycerides, and BMI strongly contributed to GDM. A cutoff point for the predictive value at 0.3 in the GBDT model had a negative predictive value of 74.1% (95% CI 69.5%-78.2%) and a sensitivity of 90% (95% CI 88.0%-91.7%), and the cutoff point at 0.7 had a positive predictive value of 93.2% (95% CI 88.2%-96.1%) and a specificity of 99% (95% CI 98.2%-99.4%).Conclusion. In this study, we found that several machine learning methods did not outperform logistic regression in predicting GDM. We developed a model with cutoff points for risk stratification of GDM.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting the risk of gestational diabetes using clinical data with machine learning: a predictive model study
    Kadambi, Adesh
    Fulcher, Isabel
    Venkatesh, Kartik
    Schor, Jonathan S.
    Clapp, Mark A.
    Wen, Timothy
    [J]. AMERICAN JOURNAL OF OBSTETRICS & GYNECOLOGY MFM, 2023, 5 (07)
  • [2] Comparison of logistic regression and machine learning methods for predicting postoperative delirium in elderly patients: A retrospective study
    Song, Yu-Xiang
    Yang, Xiao-Dong
    Luo, Yun-Gen
    Ouyang, Chun-Lei
    Yu, Yao
    Ma, Yu-Long
    Li, Hao
    Lou, Jing-Sheng
    Liu, Yan-Hong
    Chen, Yi-Qiang
    Cao, Jiang-Bei
    Mi, Wei-Dong
    [J]. CNS NEUROSCIENCE & THERAPEUTICS, 2023, 29 (01) : 158 - 167
  • [3] Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study
    Stefan Kuhle
    Bryan Maguire
    Hongqun Zhang
    David Hamilton
    Alexander C. Allen
    K. S. Joseph
    Victoria M. Allen
    [J]. BMC Pregnancy and Childbirth, 18
  • [4] Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study
    Kuhle, Stefan
    Maguire, Bryan
    Zhang, Hongqun
    Hamilton, David
    Allen, Alexander C.
    Joseph, K. S.
    Allen, Victoria M.
    [J]. BMC PREGNANCY AND CHILDBIRTH, 2018, 18
  • [5] Comparison of Conventional Logistic Regression and Machine Learning Methods for Predicting Delayed Cerebral Ischemia After Aneurysmal Subarachnoid Hemorrhage: A Multicentric Observational Cohort Study
    Hu, Ping
    Li, Yuntao
    Liu, Yangfan
    Guo, Geng
    Gao, Xu
    Su, Zhongzhou
    Wang, Long
    Deng, Gang
    Yang, Shuang
    Qi, Yangzhi
    Xu, Yang
    Ye, Liguo
    Sun, Qian
    Nie, Xiaohu
    Sun, Yanqi
    Li, Mingchang
    Zhang, Hongbo
    Chen, Qianxue
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
  • [6] Multicenter Comparison Of Conventional Regression And Machine Learning Methods For Predicting Clinical Deterioration On The Wards
    Churpek, M. M.
    Winslow, C. J.
    Meltzer, D. O.
    Kattan, M.
    Edelson, D. P.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2015, 191
  • [7] Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards
    Churpek, Matthew M.
    Yuen, Trevor C.
    Winslow, Christopher
    Meltzer, David O.
    Kattan, Michael W.
    Edelson, Dana P.
    [J]. CRITICAL CARE MEDICINE, 2016, 44 (02) : 368 - 374
  • [8] Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening
    Cao, Xia
    Lin, Yanhui
    Yang, Binfang
    Li, Ying
    Zhou, Jiansong
    [J]. RISK MANAGEMENT AND HEALTHCARE POLICY, 2022, 15 : 817 - 826
  • [9] Predicting recurrent gestational diabetes mellitus using artificial intelligence models: a retrospective cohort study
    Chen, Min
    Xu, Weijiao
    Guo, Yanni
    Yan, Jianying
    [J]. ARCHIVES OF GYNECOLOGY AND OBSTETRICS, 2024, 310 (03) : 1621 - 1630
  • [10] Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes
    Daghistani, Tahani
    Alshammari, Riyad
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (02) : 78 - 83