Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study

被引:10
|
作者
Zhong, Xiaorong [1 ]
Luo, Ting [1 ]
Deng, Ling [2 ]
Liu, Pei [3 ]
Hu, Kejia [4 ]
Lu, Donghao [4 ]
Zheng, Dan [2 ]
Luo, Chuanxu [2 ]
Xie, Yuxin [1 ]
Li, Jiayuan [5 ]
He, Ping [1 ]
Pu, Tianjie [6 ]
Ye, Feng [6 ]
Bu, Hong [6 ]
Fu, Bo [3 ]
Zheng, Hong [2 ]
机构
[1] Sichuan Univ, West China Hosp, Canc Ctr, Dept Head Neck & Mammary Gland Oncol, Chengdu, Peoples R China
[2] Sichuan Univ, West China Hosp, Clin Res Ctr Breast, Lab Mol Diag Canc, 37 Guoxuexiang, Chengdu, Peoples R China
[3] Univ Elect Sci & Technol China, Big Data Res Ctr, Sch Comp Sci & Engn, Chengdu, Peoples R China
[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[5] Sichuan Univ, West China Sch Publ Hlth, Dept Epidemiol & Biostat, Chengdu, Peoples R China
[6] Sichuan Univ, West China Hosp, Lab Pathol, Chengdu, Peoples R China
关键词
breast cancer; prognosis; machine learning; prediction model; REAL-WORLD DATA; ADJUVANT ONLINE; STAGE; SURVIVAL; WOMEN; RECURRENCE; RISK; DECISIONS; THERAPY; PROGRAM;
D O I
10.2196/19069
中图分类号
R-058 [];
学科分类号
摘要
Background: Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries have revealed suboptimal predictions. Objective: We aimed to develop an advanced breast cancer prognosis model for disease progression, cancer-specific mortality, and all-cause mortality by integrating tumor, demographic, and treatment characteristics from a large breast cancer cohort in China. Methods: This study was approved by the Clinical Test and Biomedical Ethics Committee of West China Hospital, Sichuan University on May 17, 2012. Data collection for this project was started in May 2017 and ended in March 2019. Data on 5293 women diagnosed with stage I to III invasive breast cancer between 2000 and 2013 were collected. Disease progression, cancer-specific mortality, all-cause mortality, and the likelihood of disease progression or death within a 5-year period were predicted. Extreme gradient boosting was used to develop the prediction model. Model performance was assessed by calculating the area under the receiver operating characteristic curve (AUROC), and the model was calibrated and compared with PREDICT. Results: The training, test, and validation sets comprised 3276 (499 progressions, 202 breast cancer-specific deaths, and 261 all-cause deaths within 5-year follow-up), 1405 (211 progressions, 94 breast cancer-specific deaths, and 129 all-cause deaths), and 612 (109 progressions, 33 breast cancer-specific deaths, and 37 all-cause deaths) women, respectively. The AUROC values for disease progression, cancer-specific mortality, and all-cause mortality were 0.76, 0.88, and 0.82 for training set; 0.79, 0.80, and 0.83 for the test set; and 0.79, 0.84, and 0.88 for the validation set, respectively. Calibration analysis demonstrated good agreement between predicted and observed events within 5 years. Comparable AUROC and calibration results were confirmed in different age, residence status, and receptor status subgroups. Compared with PREDICT, our model showed similar AUROC and improved calibration values. Conclusions: Our prognostic model exhibits high discrimination and good calibration. It may facilitate prognosis prediction and clinical decision making for patients with breast cancer in China.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Population-based validation of the prognostic model ADJUVANT! for early breast cancer
    Olivotto, IA
    Bajdik, CD
    Ravdin, PM
    Speers, CH
    Coldman, AJ
    Norris, BD
    Davis, GJ
    Chia, SK
    Gelmon, KA
    JOURNAL OF CLINICAL ONCOLOGY, 2005, 23 (12) : 2716 - 2725
  • [2] A population-based validation of the prognostic model PREDICT for early breast cancer
    Wishart, G. C.
    Bajdik, C. D.
    Azzato, E. M.
    Dicks, E.
    Greenberg, D. C.
    Rashbass, J.
    Caldas, C.
    Pharoah, P. D. P.
    EJSO, 2011, 37 (05): : 411 - 417
  • [3] Prognostic Factors and a Model for Occult Breast Cancer: A Population-Based Cohort Study
    Zhang, Di
    Zhai, Jingtong
    Li, Lixi
    Wu, Yun
    Ma, Fei
    Xu, Binghe
    JOURNAL OF CLINICAL MEDICINE, 2022, 11 (22)
  • [4] Development and Validation of a Machine Learning Model for Predicting the Prognosis of Older Patients with Gastric Cancer: A Retrospective, Population-Based Cohort Study In China
    Huang, Ze-Ning
    Zhang, Xing-Qi
    Wu, Ju
    Zheng, Chang-Yue
    Xie, Rong-Zhen
    Wu, Ying-Xin
    Liu, Xiao-Dong
    Chen, Qi-Yue
    Xie, Jian-Wei
    Li, Ping
    Zheng, Chao-Hui
    Lin, Jian-Xian
    Zhou, Yanbing
    Huang, Chang-Ming
    SSRN,
  • [5] PREDICT PLUS: A POPULATION-BASED VALIDATION OF A PROGNOSTIC MODEL FOR EARLY BREAST CANCER THAT INCLUDES HER2
    Wishart, G. C.
    Bajdik, C. D.
    Dicks, E.
    Provenzano, E.
    Pharoah, P. D. P.
    BREAST, 2012, 21 (02): : 151 - 151
  • [6] Treatment patterns and outcomes in older women with early breast cancer: a population-based cohort study in China
    Xu Liu
    Dan Zheng
    Yanqi Wu
    Chuanxu Luo
    Yu Fan
    Xiaorong Zhong
    Hong Zheng
    BMC Cancer, 21
  • [7] Treatment patterns and outcomes in older women with early breast cancer: a population-based cohort study in China
    Liu, Xu
    Zheng, Dan
    Wu, Yanqi
    Luo, Chuanxu
    Fan, Yu
    Zhong, Xiaorong
    Zheng, Hong
    BMC CANCER, 2021, 21 (01)
  • [8] Predictive model of prognosis index for invasive micropapillary carcinoma of the breast based on machine learning: a SEER population-based study
    Zirong Jiang
    Yushuai Yu
    Xin Yu
    Mingyao Huang
    Qing Wang
    Kaiyan Huang
    Chuangui Song
    BMC Medical Informatics and Decision Making, 24 (1)
  • [9] Recalibration of the Gail model for predicting invasive breast cancer risk in Spanish women: a population-based cohort study
    Pastor-Barriuso, Roberto
    Ascunce, Nieves
    Ederra, Maria
    Erdozain, Nieves
    Murillo, Alberto
    Ales-Martinez, Jose E.
    Pollan, Marina
    BREAST CANCER RESEARCH AND TREATMENT, 2013, 138 (01) : 249 - 259
  • [10] Recalibration of the Gail model for predicting invasive breast cancer risk in Spanish women: a population-based cohort study
    Roberto Pastor-Barriuso
    Nieves Ascunce
    María Ederra
    Nieves Erdozáin
    Alberto Murillo
    José E. Alés-Martínez
    Marina Pollán
    Breast Cancer Research and Treatment, 2013, 138 : 249 - 259