Machine Learning Improves Prediction Over Logistic Regression on Resected Colon Cancer Patients

被引:19
|
作者
Leonard, Grey [1 ]
South, Charles [2 ]
Balentine, Courtney [1 ,3 ,4 ]
Porembka, Matthew [1 ]
Mansour, John [1 ]
Wang, Sam [1 ]
Yopp, Adam [1 ]
Polanco, Patricio [1 ]
Zeh, Herbert [1 ]
Augustine, Mathew [1 ,3 ]
机构
[1] Univ Texas Southwestern Med Ctr Dallas, Dept Surg, Dallas, TX 75390 USA
[2] Southern Methodist Univ, Dept Stat Sci, Dallas, TX USA
[3] VA North Texas Healthcare Syst, Dallas, TX USA
[4] UTSW Surg Ctr Outcomes Implementat & Novel Interv, Dallas, TX USA
关键词
Colon cancer; Prediction; Machine learning; Outcomes; Risk; READMISSION; COMPLICATIONS; MODEL; RISK; MORTALITY; COLECTOMY; ADULTS; COST;
D O I
10.1016/j.jss.2022.01.012
中图分类号
R61 [外科手术学];
学科分类号
摘要
Introduction: Despite advances, readmission and mortality rates for surgical patients with colon cancer remain high. Prediction models using regression techniques allows for risk stratification to aid periprocedural care. Technological advances have enabled large data to be analyzed using machine learning (ML) algorithms. A national database of colon cancer patients was selected to determine whether ML methods better predict outcomes following surgery compared to conventional methods. Methods: Surgical colon cancer patients were identified using the 2013 National Cancer Database (NCDB). The negative outcome was defined as a composite of 30-d unplanned readmission and 30-and 90-d mortality. ML models, including Random Forest and XGBoost, were built and compared with conventional logistic regression. For the ac-counting of unbalanced outcomes, a synthetic minority oversampling technique (SMOTE) was implemented and applied using XGBoost. Results: Analysis included 528,060 patients. The negative outcome occurred in 11.6% of patients. Model building utilized 30 variables. The primary metric for model comparison was area under the curve (AUC). In comparison to logistic regression (AUC 0.730, 95% CI: 0.725-0.735), AUC's for ML algorithms ranged between 0.748 and 0.757, with the Random Forest model (AUC 0.757, 95% CI: 0.752-0.762) outperforming XGBoost (AUC 0.756, 95% CI: 0.751-0.761) and XGBoost using SMOTE data (AUC 0.748, 95% CI: 0.743-0.753). Conclusions: We show that a large registry of surgical colon cancer patients can be utilized to build ML models to improve outcome prediction with differential discriminative ability. These results reveal the potential of these methods to enhance risk prediction, leading to improved strategies to mitigate those risks. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:181 / 193
页数:13
相关论文
共 50 条
  • [21] Research on Logistic Regression Algorithm of Breast Cancer Diagnose Data by Machine Learning
    Lei, Liu
    2018 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2018), 2018, : 157 - 160
  • [22] Explainable Machine Learning Approach to Prediction of Prolonged Intesive Care Unit Stay in Adult Spinal Deformity Patients: Machine Learning Outperforms Logistic Regression
    Zaidat, Bashar
    Kurapatti, Mark
    Gal, Jonathan S.
    Cho, Samuel K.
    Kim, Jun S.
    GLOBAL SPINE JOURNAL, 2024,
  • [24] Machine learning versus regression for prediction of sporadic pancreatic cancer
    Chen, Wansu
    Zhou, Botao
    Jeon, Christie Y.
    Xie, Fagen
    Lin, Yu-Chen
    Butler, Rebecca K.
    Zhou, Yichen
    Luong, Tiffany Q.
    Lustigova, Eva
    Pisegna, Joseph R.
    Wu, Bechien U.
    PANCREATOLOGY, 2023, 23 (04) : 396 - 402
  • [25] A Comparison of Machine Learning and Logistic Regression Performance for the Prediction of In-Hospital Mortality Among Acute Biliary Pancreatitis Patients
    Luthra, Anjuli K.
    Porter, Kyle
    Hinton, Alice
    Krishna, Somashekar G.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2019, 114 : S48 - S49
  • [26] Explainable Machine Learning for Improving Logistic Regression Models
    Yang, Yimin
    Wu, Min
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,
  • [27] Prediction of Gastrointestinal Bleeding Hospitalization Risk in Hemodialysis: Machine Learning vs. Logistic Regression
    Lama, Suman Kumar
    Larkin, John W.
    Chaudhuri, Sheetal
    Jiao, Yue
    Winter, Anke
    StaussGrabo, Manuela
    Usvyat, Len A.
    Hymes, Jeffrey L.
    Maddux, Franklin W.
    Wheeler, David C.
    Stenvinkel, Peter
    Floege, Jurgen
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2024, 35 (10):
  • [28] Machine learning did not beat logistic regression in time series prediction for severe asthma exacerbations
    Anne A. H. de Hond
    Ilse M. J. Kant
    Persijn J. Honkoop
    Andrew D. Smith
    Ewout W. Steyerberg
    Jacob K. Sont
    Scientific Reports, 12
  • [29] Machine learning and regular logistic regression predictive models for prediction of difficult laryngoscopies: a prospective cohort
    Regueira, Stephanie L.
    de Carvalho, Clistenes C.
    de Souza, Ana Beatriz S.
    Rego, Caroline O.
    Ramos, Isabella B.
    da Silva, Danielle M.
    ANESTHESIA AND ANALGESIA, 2021, 132 (5S_SUPPL): : 54 - 54
  • [30] Prediction of Neurodevelopmental Outcomes in Very Preterm Infants: Comparing Machine Learning Methods to Logistic Regression
    Afifi, Jehier
    Ahmad, Tahani
    Guida, Alessandro
    Vincer, Michael John
    Stewart, Samuel Alan
    CHILDREN-BASEL, 2024, 11 (12):