Machine learning versus regression for prediction of sporadic pancreatic cancer

被引:12
|
作者
Chen, Wansu [1 ,6 ]
Zhou, Botao [1 ]
Jeon, Christie Y. [2 ]
Xie, Fagen [1 ]
Lin, Yu-Chen [2 ]
Butler, Rebecca K. [1 ]
Zhou, Yichen [1 ]
Luong, Tiffany Q. [1 ]
Lustigova, Eva [1 ]
Pisegna, Joseph R. [3 ,4 ]
Wu, Bechien U. [5 ]
机构
[1] Kaiser Permanente Southern Calif Res & Evaluat, Pasadena, CA USA
[2] Cedars Sinai Med Ctr, Los Angeles, CA USA
[3] VA Greater Los Angeles Healthcare Syst, Div Gastroenterol & Hepatol, Los Angeles, CA USA
[4] UCLA, David Geffen Sch Med, Dept Med & Human Genet, Los Angeles, CA USA
[5] Southern Calif Permanente Med Grp, Ctr Pancreat Care, Los Angeles Med Ctr, Dept Gastroenterol, Los Angeles, CA USA
[6] Kaiser Permanente Southern Calif, Dept Res & Evaluat, 100 S Robles,2nd Floor, Pasadena, CA 91101 USA
基金
美国国家卫生研究院;
关键词
Risk prediction; Pancreatic cancer; Machine learning versus regression; Random survival forest; eXtreme gradient boosting; RANDOM SURVIVAL FORESTS; MODELS;
D O I
10.1016/j.pan.2023.04.009
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background/objectives: There is currently no widely accepted approach to identify patients at increased risk for sporadic pancreatic cancer (PC). We aimed to compare the performance of two machine-learning models with a regression-based model in predicting pancreatic ductal adenocarcinoma (PDAC), the most common form of PC. Methods: This retrospective cohort study consisted of patients 50-84 years of age enrolled in either Kaiser Permanente Southern California (KPSC, model training, internal validation) or the Veterans Affairs (VA, external testing) between 2008 and 2017. The performance of random survival forests (RSF) and eXtreme gradient boosting (XGB) models were compared to that of COX proportional hazards regression (COX). Heterogeneity of the three models were assessed. Results: The KPSC and the VA cohorts consisted of 1.8 and 2.7 million patients with 1792 and 4582 incident PDAC cases within 18 months, respectively. Predictors selected into all three models included age, abdominal pain, weight change, and glycated hemoglobin (A1c). Additionally, RSF selected change in alanine transaminase (ALT), whereas the XGB and COX selected the rate of change in ALT. The COX model appeared to have lower AUC (KPSC: 0.737, 95% CI 0.710-0.764; VA: 0.706, 0.699-0.714), compared to those of RSF (KPSC: 0.767, 0.744-0.791; VA: 0.731, 0.724-0.739) and XGB (KPSC: 0.779, 0.755-0.802; VA: 0.742, 0.735-0.750). Among patients with top 5% predicted risk from all three models (N 1/4 29,663), 117 developed PDAC, of which RSF, XGB and COX captured 84 (9 unique), 87 (4 unique), 87 (19 unique) cases, respectively. Conclusions: The three models complement each other, but each has unique contributions. & COPY; 2023 IAP and EPC. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:396 / 402
页数:7
相关论文
共 50 条
  • [41] Phytochemicals in Pancreatic Cancer Treatment: A Machine Learning Study
    Genc, Destina Ekingen
    Ozbek, Ozlem
    Oral, Burcu
    Yildirim, Ramazan
    Ercan, Nazar Ileri
    ACS OMEGA, 2023, 9 (01): : 413 - 421
  • [42] Machine learning to predict completion of treatment for pancreatic cancer
    Pasha, Shamsher A.
    Khalid, Abdullah
    Levy, Todd
    Demyan, Lyudmyla
    Hartman, Sarah
    Newman, Elliot
    Weiss, Matthew J.
    King, Daniel A.
    Zanos, Theodoros
    Melis, Marcovalerio
    JOURNAL OF SURGICAL ONCOLOGY, 2024,
  • [43] Predicted Prognosis of Patients with Pancreatic Cancer by Machine Learning
    Yokoyama, Seiya
    Hamada, Taiji
    Higashi, Michiyo
    Matsuo, Kei
    Maemura, Kosei
    Kurahara, Hiroshi
    Horinouchi, Michiko
    Hiraki, Tsubasa
    Sugimoto, Tomoyuki
    Akahane, Toshiaki
    Yonezawa, Suguru
    Kornmann, Marko
    Batra, Surinder K.
    Hollingsworth, Michael A.
    Tanimoto, Akihide
    CLINICAL CANCER RESEARCH, 2020, 26 (10) : 2411 - 2421
  • [44] Machine learning of clinical performance in a pancreatic cancer database
    Hayward, John
    Alvarez, Sergio A.
    Ruiz, Carolina
    Sullivan, Mary
    Tseng, Jennifer
    Whalen, Giles
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2010, 49 (03) : 187 - 195
  • [45] A machine learning based delta-radiomics process for early prediction of treatment response of pancreatic cancer
    Nasief, Haidy
    Zheng, Cheng
    Schott, Diane
    Hall, William
    Tsai, Susan
    Erickson, Beth
    Li, X. Allen
    NPJ PRECISION ONCOLOGY, 2019, 3 (1)
  • [46] Development and validation of a pancreatic cancer prediction model from electronic health records using machine learning
    Appelbaum, Limor
    Cambronero, Jose Pablo
    Pollick, Karla
    Silva, George
    Stevens, Jennifer P.
    Mamon, Harvey J.
    Kaplan, Irving D.
    Rinard, Martin
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (04)
  • [47] Machine learning-based prediction of surgical benefit in borderline resectable and locally advanced pancreatic cancer
    Zhang, Leiming
    Yu, Zehao
    Jin, Rong
    Yang, Xuanang
    Ying, Dongjian
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (13) : 11857 - 11871
  • [48] Machine learning-based prediction of surgical benefit in borderline resectable and locally advanced pancreatic cancer
    Leiming Zhang
    Zehao Yu
    Rong Jin
    Xuanang Yang
    Dongjian Ying
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 11857 - 11871
  • [49] Risk Prediction of Pancreatic Cancer in Patients With Recent-onset Hyperglycemia A Machine-learning Approach
    Chen, Wansu
    Butler, Rebecca K.
    Lustigova, Eva
    Chari, Suresh T.
    Maitra, Anirban
    Rinaudo, Jo A.
    Wu, Bechien U.
    JOURNAL OF CLINICAL GASTROENTEROLOGY, 2023, 57 (01) : 103 - 110
  • [50] A machine learning based delta-radiomics process for early prediction of treatment response of pancreatic cancer
    Haidy Nasief
    Cheng Zheng
    Diane Schott
    William Hall
    Susan Tsai
    Beth Erickson
    X. Allen Li
    npj Precision Oncology, 3