An Assessment of the Predictive Performance of Current Machine Learning-Based Breast Cancer Risk Prediction Models: Systematic Review

被引:5
|
作者
Gao, Ying [1 ]
Li, Shu [2 ]
Jin, Yujing [1 ]
Zhou, Lengxiao [1 ]
Sun, Shaomei [1 ]
Xu, Xiaoqian [1 ]
Li, Shuqian [1 ]
Yang, Hongxi [3 ]
Zhang, Qing [1 ]
Wang, Yaogang [4 ,5 ]
机构
[1] Tianjin Med Univ Gen Hosp, Hlth Management Ctr, Tianjin, Peoples R China
[2] Tianjin Univ Tradit Chinese Med, Sch Management, Tianjin, Peoples R China
[3] Tianjin Med Univ, Sch Basic Med Sci, Dept Bioinformat, Tianjin, Peoples R China
[4] Tianjin Med Univ, Sch Publ Hlth, Tianjin, Peoples R China
[5] Tianjin Med Univ, Sch Publ Hlth, Qixiangtai Rd 22, Tianjin 300070, Peoples R China
来源
JMIR PUBLIC HEALTH AND SURVEILLANCE | 2022年 / 8卷 / 12期
基金
中国国家自然科学基金;
关键词
breast cancer; machine learning; risk prediction; cancer; oncology; systemic review; review; meta-analysis; cancer research; risk model; BIAS; METAANALYSIS; DENSITY; APPLICABILITY; PROBAST; TOOL;
D O I
10.2196/35750
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Several studies have explored the predictive performance of machine learning-based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning-based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models.Objective: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning-based breast cancer risk prediction models.Methods: We searched for papers published until June 9, 2021, on machine learning-based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model.Results: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning-based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I2=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor-based models in predicting future breast cancer risk. The pooled AUC of the neural network-based risk prediction model was higher than that of the nonneural network-based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; Pheterogeneity=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis.Conclusions: Our review shows that the current machine learning-based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory.(JMIR Public Health Surveill 2022;8(12):e35750) doi: 10.2196/35750
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Systematic Review on Machine Learning and Deep Learning Based Predictive Models for Health Informatics
    Aloyuni, Saleh Abdullah
    [J]. JOURNAL OF PHARMACEUTICAL RESEARCH INTERNATIONAL, 2021, 33 (47B) : 183 - 194
  • [42] Pre-existing and machine learning-based models for cardiovascular risk prediction
    Sang-Yeong Cho
    Sun-Hwa Kim
    Si-Hyuck Kang
    Kyong Joon Lee
    Dongjun Choi
    Seungjin Kang
    Sang Jun Park
    Tackeun Kim
    Chang-Hwan Yoon
    Tae-Jin Youn
    In-Ho Chae
    [J]. Scientific Reports, 11
  • [43] Pre-existing and machine learning-based models for cardiovascular risk prediction
    Cho, Sang-Yeong
    Kim, Sun-Hwa
    Kang, Si-Hyuck
    Lee, Kyong Joon
    Choi, Dongjun
    Kang, Seungjin
    Park, Sang Jun
    Kim, Tackeun
    Yoon, Chang-Hwan
    Youn, Tae-Jin
    Chae, In-Ho
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01) : 8886
  • [44] Risk prediction models of breast cancer: a systematic review of model performances
    Anothaisintawee, Thunyarat
    Teerawattananon, Yot
    Wiratkapun, Chollathip
    Kasamesup, Vijj
    Thakkinstian, Ammarin
    [J]. BREAST CANCER RESEARCH AND TREATMENT, 2012, 133 (01) : 1 - 10
  • [45] Risk prediction models of breast cancer: a systematic review of model performances
    Thunyarat Anothaisintawee
    Yot Teerawattananon
    Chollathip Wiratkapun
    Vijj Kasamesup
    Ammarin Thakkinstian
    [J]. Breast Cancer Research and Treatment, 2012, 133 : 1 - 10
  • [46] Performance tuning for machine learning-based software development effort prediction models
    Ertugrul, Egemen
    Baytar, Zakir
    Catal, Cagatay
    Muratli, Can
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1308 - 1324
  • [47] Predictive Performance of Machine Learning-Based Methods for the Prediction of Preeclampsia-A Prospective Study
    Melinte-Popescu, Alina-Sinziana
    Vasilache, Ingrid-Andrada
    Socolov, Demetra
    Melinte-Popescu, Marian
    [J]. JOURNAL OF CLINICAL MEDICINE, 2023, 12 (02)
  • [48] Systematic review finds "spin"practices and poor reporting standards in studies on machine learning-based prediction models
    Navarro, Constanza L. Andaur
    Damen, Johanna A. A.
    Takada, Toshihiko
    Nijman, Steven W. J.
    Dhiman, Paula
    Ma, Jie
    Collins, Gary S.
    Bajpai, Ram
    Riley, Richard D.
    Moons, Karel G. M.
    Hooft, Lotty
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2023, 158 : 99 - 110
  • [49] Machine learning-based 30-day readmission prediction models for patients with heart failure: a systematic review
    Yu, Min-Young
    Son, Youn-Jung
    [J]. EUROPEAN JOURNAL OF CARDIOVASCULAR NURSING, 2024,
  • [50] Performance of Statistical and Machine Learning Risk Prediction Models for Surveillance Benefits and Failures in Breast Cancer Survivors
    Su, Yu-Ru
    Buist, Diana S. M.
    Lee, Janie M.
    Ichikawa, Laura
    Miglioretti, Diana L.
    Bowles, Erin J. Aiello
    Wernli, Karen J.
    Kerlikowske, Karla
    Tosteson, Anna
    Lowry, Kathryn P.
    Henderson, Louise M.
    Sprague, Brian L.
    Hubbard, Rebecca A.
    [J]. CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2023, 32 (04) : 561 - 571