Machine Learning Improves Prediction Over Logistic Regression on Resected Colon Cancer Patients

被引:19
|
作者
Leonard, Grey [1 ]
South, Charles [2 ]
Balentine, Courtney [1 ,3 ,4 ]
Porembka, Matthew [1 ]
Mansour, John [1 ]
Wang, Sam [1 ]
Yopp, Adam [1 ]
Polanco, Patricio [1 ]
Zeh, Herbert [1 ]
Augustine, Mathew [1 ,3 ]
机构
[1] Univ Texas Southwestern Med Ctr Dallas, Dept Surg, Dallas, TX 75390 USA
[2] Southern Methodist Univ, Dept Stat Sci, Dallas, TX USA
[3] VA North Texas Healthcare Syst, Dallas, TX USA
[4] UTSW Surg Ctr Outcomes Implementat & Novel Interv, Dallas, TX USA
关键词
Colon cancer; Prediction; Machine learning; Outcomes; Risk; READMISSION; COMPLICATIONS; MODEL; RISK; MORTALITY; COLECTOMY; ADULTS; COST;
D O I
10.1016/j.jss.2022.01.012
中图分类号
R61 [外科手术学];
学科分类号
摘要
Introduction: Despite advances, readmission and mortality rates for surgical patients with colon cancer remain high. Prediction models using regression techniques allows for risk stratification to aid periprocedural care. Technological advances have enabled large data to be analyzed using machine learning (ML) algorithms. A national database of colon cancer patients was selected to determine whether ML methods better predict outcomes following surgery compared to conventional methods. Methods: Surgical colon cancer patients were identified using the 2013 National Cancer Database (NCDB). The negative outcome was defined as a composite of 30-d unplanned readmission and 30-and 90-d mortality. ML models, including Random Forest and XGBoost, were built and compared with conventional logistic regression. For the ac-counting of unbalanced outcomes, a synthetic minority oversampling technique (SMOTE) was implemented and applied using XGBoost. Results: Analysis included 528,060 patients. The negative outcome occurred in 11.6% of patients. Model building utilized 30 variables. The primary metric for model comparison was area under the curve (AUC). In comparison to logistic regression (AUC 0.730, 95% CI: 0.725-0.735), AUC's for ML algorithms ranged between 0.748 and 0.757, with the Random Forest model (AUC 0.757, 95% CI: 0.752-0.762) outperforming XGBoost (AUC 0.756, 95% CI: 0.751-0.761) and XGBoost using SMOTE data (AUC 0.748, 95% CI: 0.743-0.753). Conclusions: We show that a large registry of surgical colon cancer patients can be utilized to build ML models to improve outcome prediction with differential discriminative ability. These results reveal the potential of these methods to enhance risk prediction, leading to improved strategies to mitigate those risks. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:181 / 193
页数:13
相关论文
共 50 条
  • [31] Comparison between logistic regression and machine learning algorithms on survival prediction of traumatic brain injuries
    Feng, Jin-zhou
    Wang, Yu
    Peng, Jin
    Sun, Ming-wei
    Zeng, Jun
    Jiang, Hua
    JOURNAL OF CRITICAL CARE, 2019, 54 : 110 - 116
  • [32] Machine learning did not beat logistic regression in time series prediction for severe asthma exacerbations
    de Hond, Anne A. H.
    Kant, Ilse M. J.
    Honkoop, Persijn J.
    Smith, Andrew D.
    Steyerberg, Ewout W.
    Sont, Jacob K.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [33] Machine learning and regular logistic regression predictive models for prediction of difficult laryngoscopies: a prospective cohort
    Carvalho, Clistenes
    Souza, Ana Beatriz
    Regueira, Stephanie
    ANESTHESIA AND ANALGESIA, 2021, 133 (3S_SUPPL): : 391 - 391
  • [34] Prediction of Colon Cancer Stages and Survival Period with Machine Learning Approach
    Gupta, Pushpanjali
    Chiang, Sum-Fu
    Sahoo, Prasan Kumar
    Mohapatra, Suvendu Kumar
    You, Jeng-Fu
    Onthoni, Djeane Debora
    Hung, Hsin-Yuan
    Chiang, Jy-Ming
    Huang, Yenlin
    Tsai, Wen-Sy
    CANCERS, 2019, 11 (12)
  • [35] Risk Factor Prediction by Naive Bayes Classifier, Logistic Regression Models, Various Classification and Regression Machine Learning Techniques
    Kannan K.
    Menaga A.
    Proceedings of the National Academy of Sciences, India Section B: Biological Sciences, 2022, 92 (1) : 63 - 79
  • [36] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Kuteesa R. Bisaso
    Susan A. Karungi
    Agnes Kiragga
    Jackson K. Mukonzo
    Barbara Castelnuovo
    BMC Medical Informatics and Decision Making, 18
  • [37] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Bisaso, Kuteesa R.
    Karungi, Susan A.
    Kiragga, Agnes
    Mukonzo, Jackson K.
    Castelnuovo, Barbara
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [38] Improved survival prediction for pancreatic cancer using machine learning and regression
    Floyd, Stuart H.
    Alvarez, Sergio A.
    Ruiz, Carolina
    Hayward, John
    Sullivan, Mary
    Tseng, Jennifer F.
    Whalen, Giles F.
    GASTROENTEROLOGY, 2007, 132 (04) : A869 - A870
  • [39] Quantitative tumor heterogeneity MRI profiling improves machine learning–based prognostication in patients with metastatic colon cancer
    Dania Daye
    Azadeh Tabari
    Hyunji Kim
    Ken Chang
    Sophia C. Kamran
    Theodore S. Hong
    Jayashree Kalpathy-Cramer
    Michael S. Gee
    European Radiology, 2021, 31 : 5759 - 5767
  • [40] Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study
    Stefan Kuhle
    Bryan Maguire
    Hongqun Zhang
    David Hamilton
    Alexander C. Allen
    K. S. Joseph
    Victoria M. Allen
    BMC Pregnancy and Childbirth, 18