Performance of Statistical and Machine Learning Risk Prediction Models for Surveillance Benefits and Failures in Breast Cancer Survivors

被引:3
|
作者
Su, Yu-Ru [1 ]
Buist, Diana S. M. [1 ,4 ]
Lee, Janie M. [2 ,3 ]
Ichikawa, Laura [1 ]
Miglioretti, Diana L. [1 ]
Bowles, Erin J. Aiello [1 ]
Wernli, Karen J. [1 ]
Kerlikowske, Karla [5 ,6 ,7 ]
Tosteson, Anna [8 ,9 ]
Lowry, Kathryn P. [2 ,3 ]
Henderson, Louise M. [10 ]
Sprague, Brian L. [11 ,12 ]
Hubbard, Rebecca A. [13 ]
机构
[1] Kaiser Permanente WA, Kaiser Permanente Washington Hlth Res Inst, Seattle, WA USA
[2] Univ Washington, Dept Radiol, Seattle, WA USA
[3] Seattle Canc Care Alliance, Seattle, WA USA
[4] Univ Calif Davis, Dept Publ Hlth Sci, Div Biostat, Davis, CA USA
[5] Univ Calif San Francisco, Dept Med, San Francisco, CA USA
[6] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[7] Univ Calif San Francisco, Gen Internal Med Sect, Dept Vet Affairs, San Francisco, CA USA
[8] Geisel Sch Med Dartmouth, Dartmouth Inst Hlth Policy & Clin Practice, Lebanon, NH USA
[9] Geisel Sch Med Dartmouth, Norris Cotton Canc Ctr, Lebanon, NH USA
[10] Univ N Carolina, Dept Radiol, Chapel Hill, NC USA
[11] Univ Vermont, Dept Surg, Burlington, VT USA
[12] Univ Vermont, Dept Radiol, Burlington, VT USA
[13] Univ Penn, Dept Biostat Epidemiol & Informat, Perelman Sch Med, Philadelphia, PA 19104 USA
关键词
ANNUAL HAZARD RATES; PERSONAL HISTORY; FOLLOW-UP; REGRESSION; WOMEN; RECURRENCE; SELECTION;
D O I
10.1158/1055-9965.EPI-22-0677
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Machine learning (ML) approaches facilitate risk prediction model development using high-dimensional predictors and higher-order interactions at the cost of model interpretability and transparency. We compared the relative predictive perfor-mance of statistical and ML models to guide modeling strategy selection for surveillance mammography outcomes in women with a personal history of breast cancer (PHBC).Methods: We cross-validated seven risk prediction models for two surveillance outcomes, failure (breast cancer within 12 months of a negative surveillance mammogram) and benefit (surveillance -detected breast cancer). We included 9,447 mammograms (495 failures, 1,414 benefits, and 7,538 nonevents) from years 1996 to 2017 using a 1:4 matched case-control samples of women with PHBC in the Breast Cancer Surveillance Consortium. We assessed model performance of conventional regression, regularized regres-sions (LASSO and elastic-net), and ML methods (random forests and gradient boosting machines) by evaluating their calibration and, among well-calibrated models, comparing the area under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CI).Results: LASSO and elastic-net consistently provided well -calibrated predicted risks for surveillance failure and benefit. The AUCs of LASSO and elastic-net were both 0.63 (95% CI, 0.60-0.66) for surveillance failure and 0.66 (95% CI, 0.64-0.68) for surveillance benefit, the highest among well-calibrated models.Conclusions: For predicting breast cancer surveillance mam-mography outcomes, regularized regression outperformed other modeling approaches and balanced the trade-off between model flexibility and interpretability. Impact: Regularized regression may be preferred for developing risk prediction models in other contexts with rare outcomes, similar training sample sizes, and low-dimensional features.
引用
收藏
页码:561 / 571
页数:11
相关论文
共 50 条
  • [1] Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning-Based Models
    El Haji, Hasna
    Souadka, Amine
    Patel, Bhavik N.
    Sbihi, Nada
    Ramasamy, Gokul
    Patel, Bhavika K.
    Ghogho, Mounir
    Banerjee, Imon
    [J]. JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [2] Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning-Based Models
    El Haji, Hasna
    Souadka, Amine
    Patel, Bhavik N.
    Sbihi, Nada
    Ramasamy, Gokul
    Patel, Bhavika K.
    Ghogho, Mounir
    Banerjee, Imon
    [J]. JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [3] Machine learning-based models for the prediction of breast cancer recurrence risk
    Duo Zuo
    Lexin Yang
    Yu Jin
    Huan Qi
    Yahui Liu
    Li Ren
    [J]. BMC Medical Informatics and Decision Making, 23
  • [4] Machine learning-based models for the prediction of breast cancer recurrence risk
    Zuo, Duo
    Yang, Lexin
    Jin, Yu
    Qi, Huan
    Liu, Yahui
    Ren, Li
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [5] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [6] Machine learning models in breast cancer survival prediction
    Montazeri, Mitra
    Montazeri, Mohadeseh
    Montazeri, Mahdieh
    Beigzadeh, Amin
    [J]. TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42
  • [7] Lung Cancer Risk Prediction with Machine Learning Models
    Dritsas, Elias
    Trigka, Maria
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [8] An Assessment of the Predictive Performance of Current Machine Learning-Based Breast Cancer Risk Prediction Models: Systematic Review
    Gao, Ying
    Li, Shu
    Jin, Yujing
    Zhou, Lengxiao
    Sun, Shaomei
    Xu, Xiaoqian
    Li, Shuqian
    Yang, Hongxi
    Zhang, Qing
    Wang, Yaogang
    [J]. JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (12):
  • [9] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
    Ming, Chang
    Viassolo, Valeria
    Probst-Hensch, Nicole
    Chappuis, Pierre O.
    Dinov, Ivo D.
    Katapodi, Maria C.
    [J]. BREAST CANCER RESEARCH, 2019, 21 (1)
  • [10] Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
    Chang Ming
    Valeria Viassolo
    Nicole Probst-Hensch
    Pierre O. Chappuis
    Ivo D. Dinov
    Maria C. Katapodi
    [J]. Breast Cancer Research, 21