Machine Learning Improves Prediction Over Logistic Regression on Resected Colon Cancer Patients

被引:19
|
作者
Leonard, Grey [1 ]
South, Charles [2 ]
Balentine, Courtney [1 ,3 ,4 ]
Porembka, Matthew [1 ]
Mansour, John [1 ]
Wang, Sam [1 ]
Yopp, Adam [1 ]
Polanco, Patricio [1 ]
Zeh, Herbert [1 ]
Augustine, Mathew [1 ,3 ]
机构
[1] Univ Texas Southwestern Med Ctr Dallas, Dept Surg, Dallas, TX 75390 USA
[2] Southern Methodist Univ, Dept Stat Sci, Dallas, TX USA
[3] VA North Texas Healthcare Syst, Dallas, TX USA
[4] UTSW Surg Ctr Outcomes Implementat & Novel Interv, Dallas, TX USA
关键词
Colon cancer; Prediction; Machine learning; Outcomes; Risk; READMISSION; COMPLICATIONS; MODEL; RISK; MORTALITY; COLECTOMY; ADULTS; COST;
D O I
10.1016/j.jss.2022.01.012
中图分类号
R61 [外科手术学];
学科分类号
摘要
Introduction: Despite advances, readmission and mortality rates for surgical patients with colon cancer remain high. Prediction models using regression techniques allows for risk stratification to aid periprocedural care. Technological advances have enabled large data to be analyzed using machine learning (ML) algorithms. A national database of colon cancer patients was selected to determine whether ML methods better predict outcomes following surgery compared to conventional methods. Methods: Surgical colon cancer patients were identified using the 2013 National Cancer Database (NCDB). The negative outcome was defined as a composite of 30-d unplanned readmission and 30-and 90-d mortality. ML models, including Random Forest and XGBoost, were built and compared with conventional logistic regression. For the ac-counting of unbalanced outcomes, a synthetic minority oversampling technique (SMOTE) was implemented and applied using XGBoost. Results: Analysis included 528,060 patients. The negative outcome occurred in 11.6% of patients. Model building utilized 30 variables. The primary metric for model comparison was area under the curve (AUC). In comparison to logistic regression (AUC 0.730, 95% CI: 0.725-0.735), AUC's for ML algorithms ranged between 0.748 and 0.757, with the Random Forest model (AUC 0.757, 95% CI: 0.752-0.762) outperforming XGBoost (AUC 0.756, 95% CI: 0.751-0.761) and XGBoost using SMOTE data (AUC 0.748, 95% CI: 0.743-0.753). Conclusions: We show that a large registry of surgical colon cancer patients can be utilized to build ML models to improve outcome prediction with differential discriminative ability. These results reveal the potential of these methods to enhance risk prediction, leading to improved strategies to mitigate those risks. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:181 / 193
页数:13
相关论文
共 50 条
  • [1] Machine Learning Algorithms Improve Outcome Prediction Over Conventional Logistic Regression Using the NCDB on Resected Colon Cancer Patients
    Leonard, G.
    Polanco, P. M.
    Wang, S. C.
    Porembka, M.
    Yopp, A.
    Mansour, J. C.
    Zeh, H.
    South, C.
    Augustine, M.
    ANNALS OF SURGICAL ONCOLOGY, 2020, 27 (SUPPL 1) : S43 - S43
  • [2] Advancing Breast Cancer Prediction using Logistic Regression and Machine Learning Techniques
    Bhuria, Ruchika
    Gill, Kanwarpartap Singh
    Malhotra, Sonal
    Singh, Mukesh
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1374 - 1377
  • [3] Machine learning with in silico analysis markedly improves survival prediction modeling in colon cancer patients
    Lee, Choong-Jae
    Baek, Bin
    Cho, Sang Hee
    Jang, Tae-Young
    Jeon, So-El
    Lee, Sunjae
    Lee, Hyunju
    Nam, Jeong-Seok
    CANCER MEDICINE, 2023, 12 (06): : 7603 - 7615
  • [4] Machine learning prediction of financial toxicity in patients with resected lung cancer
    Deboever, Nathaniel
    Al-Tashi, Qasem
    Eisenberg, Michael
    Hofstetter, Wayne
    Mehran, Reza
    Rice, David
    Roth, Jack
    Sepesi, Boris
    Swisher, Stephen
    Vaporciyan, Ara
    Walsh, Garrett
    Antonoff, Mara
    Wu, Jia
    Rajaram, Ravi
    CANCER RESEARCH, 2023, 83 (07)
  • [5] Accuracy of machine learning logistic regression in death prediction for patients of road traffic injury
    Somboon, Sirada
    Phunghassaporn, Naralin
    Tansawet, Amarit
    Lolak, Sermkiat
    ASIAN JOURNAL OF SURGERY, 2022, 45 (01) : 537 - 538
  • [6] A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
    Christodoulou, Evangelia
    Ma, Jie
    Collins, Gary S.
    Steyerberg, Ewout W.
    Verbakel, Jan Y.
    Van Calster, Ben
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 : 12 - 22
  • [7] Machine learning versus logistic regression for the prediction of complications after pancreatoduodenectomy
    Ingwersen, Erik W.
    Stam, Wessel T.
    Meijs, Bono J. V.
    Roor, Joran
    Besselink, Marc G.
    Koerkamp, Bas Groot
    de Hingh, Ignace H. J. T.
    van Santvoort, Hjalmar C.
    Stommel, Martijn W. J.
    Daams, Freek
    SURGERY, 2023, 174 (03) : 435 - 440
  • [8] Heart Disease Prediction Using Logistic Regression Machine Learning Model
    Hrvat, Faris
    Spahic, Lemana
    Aleta, Amina
    MEDICON 2023 AND CMBEBIH 2023, VOL 1, 2024, 93 : 654 - 662
  • [9] Loan Repayment Prediction Using Logistic Regression Ensemble Learning With Machine Learning Algorithms
    Dinh, Thuan Nguyen
    Thanh, Binh Pham
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 79 - 85
  • [10] Prediction of Heart Disease using Decision Tree over Logistic Regression using Machine Learning with Improved Accuracy
    Raj, K. N. S. Shanmukha
    Thinakaran, K.
    CARDIOMETRY, 2022, (25): : 1514 - 1519