Multidimensional Machine Learning Personalized Prognostic Model in an Early Invasive Breast Cancer Population-Based Cohort in China: Algorithm Validation Study

被引:10
|
作者
Zhong, Xiaorong [1 ]
Luo, Ting [1 ]
Deng, Ling [2 ]
Liu, Pei [3 ]
Hu, Kejia [4 ]
Lu, Donghao [4 ]
Zheng, Dan [2 ]
Luo, Chuanxu [2 ]
Xie, Yuxin [1 ]
Li, Jiayuan [5 ]
He, Ping [1 ]
Pu, Tianjie [6 ]
Ye, Feng [6 ]
Bu, Hong [6 ]
Fu, Bo [3 ]
Zheng, Hong [2 ]
机构
[1] Sichuan Univ, West China Hosp, Canc Ctr, Dept Head Neck & Mammary Gland Oncol, Chengdu, Peoples R China
[2] Sichuan Univ, West China Hosp, Clin Res Ctr Breast, Lab Mol Diag Canc, 37 Guoxuexiang, Chengdu, Peoples R China
[3] Univ Elect Sci & Technol China, Big Data Res Ctr, Sch Comp Sci & Engn, Chengdu, Peoples R China
[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[5] Sichuan Univ, West China Sch Publ Hlth, Dept Epidemiol & Biostat, Chengdu, Peoples R China
[6] Sichuan Univ, West China Hosp, Lab Pathol, Chengdu, Peoples R China
关键词
breast cancer; prognosis; machine learning; prediction model; REAL-WORLD DATA; ADJUVANT ONLINE; STAGE; SURVIVAL; WOMEN; RECURRENCE; RISK; DECISIONS; THERAPY; PROGRAM;
D O I
10.2196/19069
中图分类号
R-058 [];
学科分类号
摘要
Background: Current online prognostic prediction models for breast cancer, such as Adjuvant! Online and PREDICT, are based on specific populations. They have been well validated and widely used in the United States and Western Europe; however, several validation attempts in non-European countries have revealed suboptimal predictions. Objective: We aimed to develop an advanced breast cancer prognosis model for disease progression, cancer-specific mortality, and all-cause mortality by integrating tumor, demographic, and treatment characteristics from a large breast cancer cohort in China. Methods: This study was approved by the Clinical Test and Biomedical Ethics Committee of West China Hospital, Sichuan University on May 17, 2012. Data collection for this project was started in May 2017 and ended in March 2019. Data on 5293 women diagnosed with stage I to III invasive breast cancer between 2000 and 2013 were collected. Disease progression, cancer-specific mortality, all-cause mortality, and the likelihood of disease progression or death within a 5-year period were predicted. Extreme gradient boosting was used to develop the prediction model. Model performance was assessed by calculating the area under the receiver operating characteristic curve (AUROC), and the model was calibrated and compared with PREDICT. Results: The training, test, and validation sets comprised 3276 (499 progressions, 202 breast cancer-specific deaths, and 261 all-cause deaths within 5-year follow-up), 1405 (211 progressions, 94 breast cancer-specific deaths, and 129 all-cause deaths), and 612 (109 progressions, 33 breast cancer-specific deaths, and 37 all-cause deaths) women, respectively. The AUROC values for disease progression, cancer-specific mortality, and all-cause mortality were 0.76, 0.88, and 0.82 for training set; 0.79, 0.80, and 0.83 for the test set; and 0.79, 0.84, and 0.88 for the validation set, respectively. Calibration analysis demonstrated good agreement between predicted and observed events within 5 years. Comparable AUROC and calibration results were confirmed in different age, residence status, and receptor status subgroups. Compared with PREDICT, our model showed similar AUROC and improved calibration values. Conclusions: Our prognostic model exhibits high discrimination and good calibration. It may facilitate prognosis prediction and clinical decision making for patients with breast cancer in China.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Breast cancer survivorship and sexual dysfunction: a population-based cohort study
    Chun-Pin Chang
    Tiffany F. Ho
    John Snyder
    Mark Dodson
    Vikrant Deshmukh
    Michael Newman
    Ankita Date
    N. Lynn Henry
    Mia Hashibe
    Breast Cancer Research and Treatment, 2023, 200 : 103 - 113
  • [42] Genetic implications of bilateral breast cancer: A population-based cohort study
    Hartman, M
    Czene, K
    Reilly, M
    Bergh, J
    Lagiou, P
    Trichopoulos, D
    Adami, HO
    Hall, P
    LANCET ONCOLOGY, 2005, 6 (06): : 377 - 382
  • [43] Breast cancer survivorship and sexual dysfunction: a population-based cohort study
    Chang, Chun-Pin
    Ho, Tiffany F. F.
    Snyder, John
    Dodson, Mark
    Deshmukh, Vikrant
    Newman, Michael
    Date, Ankita
    Henry, N. Lynn
    Hashibe, Mia
    BREAST CANCER RESEARCH AND TREATMENT, 2023, 200 (01) : 103 - 113
  • [44] Diabetes mellitus and breast cancer: a retrospective population-based cohort study
    Lorraine L. Lipscombe
    Pamela J. Goodwin
    Bernard Zinman
    John R. McLaughlin
    Janet E. Hux
    Breast Cancer Research and Treatment, 2006, 98
  • [45] Young age is an independent adverse prognostic factor in early stage breast cancer: a population-based study
    Zhang, Xiao
    Yang, Jian
    Cai, Haoyang
    Ye, Yifeng
    CANCER MANAGEMENT AND RESEARCH, 2018, 10 : 4005 - 4018
  • [46] Machine-learning prediction model for gestational diabetes in twin pregnancies: Population-based cohort study
    Mustafa, Hiba
    Kalafat, Erkan
    Heydari, Mohammad-Hossein
    Nunge, Rebecca
    Khalil, Asma
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2023, 130 : 38 - 38
  • [47] Machine-learning prediction model for gestational diabetes in twin pregnancies: Population-based cohort study
    Mustafa, Hiba
    Kalafat, Erkan
    Heydari, Mohammad-Hossein
    Nunge, Rebecca
    Khalil, Asma
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2023, 130 : 38 - 38
  • [48] Development and Validation of an Insulin Resistance Predicting Model Using a Machine-Learning Approach in a Population-Based Cohort in Korea
    Park, Sunmin
    Kim, Chaeyeon
    Wu, Xuangao
    DIAGNOSTICS, 2022, 12 (01)
  • [49] Development and validation of machine learning-augmented algorithm for insulin sensitivity assessment in the community and primary care settings: a population-based study in China
    Zhang, Hao
    Zeng, Tianshu
    Zhang, Jiaoyue
    Zheng, Juan
    Min, Jie
    Peng, Miaomiao
    Liu, Geng
    Zhong, Xueyu
    Wang, Ying
    Qiu, Kangli
    Tian, Shenghua
    Liu, Xiaohuan
    Huang, Hantao
    Surmach, Marina
    Wang, Ping
    Hu, Xiang
    Chen, Lulu
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [50] Risk-Adapted Starting Age of Personalized Lung Cancer Screening A Population-Based, Prospective Cohort Study in China
    Wang, Chenran
    Dong, Xuesi
    Tan, Fengwei
    Wu, Zheng
    Huang, Yufei
    Zheng, Yadi
    Luo, Zilin
    Xu, Yongjie
    Zhao, Liang
    Li, Jibin
    Zou, Kaiyong
    Cao, Wei
    Wang, Fei
    Ren, Jiansong
    Shi, Jufang
    Chen, Wanqing
    He, Jie
    Li, Ni
    CHEST, 2024, 165 (06) : 1538 - 1554