Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models

被引:39
|
作者
Navarro, Constanza L. Andaur [1 ,2 ,6 ]
Damen, Johanna A. A. [1 ,2 ]
van Smeden, Maarten [1 ]
Takada, Toshihiko [1 ]
Nijman, Steven W. J. [1 ]
Dhiman, Paula [3 ,4 ]
Ma, Jie [3 ]
Collins, Gary S. [3 ,4 ]
Bajpai, Ram [5 ]
Riley, Richard D. [5 ]
Moons, Karel G. M. [1 ,2 ]
Hooft, Lotty [1 ,2 ]
机构
[1] Univ Utrecht, Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
[2] Univ Utrecht, Univ Med Ctr Utrecht, Cochrane Netherlands, Utrecht, Netherlands
[3] Univ Oxford, Ctr Stat Med, Nuffield Dept Orthopaed Rheumatol & Musculoskeleta, Oxford, England
[4] Oxford Univ Hosp NHS Fdn Trust, NIHR Oxford Biomed Res Ctr, Oxford, England
[5] Keele Univ, Ctr Prognosis Res, Sch Med, Keele, England
[6] Julius Ctr Hlth Sci & Primary Care, Univ Weg 100,POB 8550, NL-3508 GA Utrecht, Netherlands
基金
澳大利亚研究理事会;
关键词
Predictive algorithm; Risk prediction; Diagnosis; Prognosis; Development; Validation; RISK; APPLICABILITY; EXPLANATION; VALIDATION; DIAGNOSIS; PROBAST; BIAS; TOOL;
D O I
10.1016/j.jclinepi.2022.11.015
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and Objectives: We sought to summarize the study design, modelling strategies, and performance measures reported in studies on clinical prediction models developed using machine learning techniques.Methods: We search PubMed for articles published between 01/01/2018 and 31/12/2019, describing the development or the development with external validation of a multivariable prediction model using any supervised machine learning technique. No restrictions were made based on study design, data source, or predicted patient-related health outcomes.Results: We included 152 studies, 58 (38.2% [95% CI 30.8-46.1]) were diagnostic and 94 (61.8% [95% CI 53.9-69.2]) prognostic studies. Most studies reported only the development of prediction models (n = 133, 87.5% [95% CI 81.3-91.8]), focused on binary outcomes (n = 131, 86.2% [95% CI 79.8-90.8), and did not report a sample size calculation (n = 125, 82.2% [95% CI 75.4-87.5]). The most common algorithms used were support vector machine (n = 86/522, 16.5% [95% CI 13.5-19.9]) and random forest (n = 73/522, 14% [95% CI 11.3-17.2]). Values for area under the Receiver Operating Characteristic curve ranged from 0.45 to 1.00. Calibration metrics were often missed (n 5 494/522, 94.6% [95% CI 92.4-96.3]).Conclusion: Our review revealed that focus is required on handling of missing values, methods for internal validation, and reporting of calibration to improve the methodological conduct of studies on machine learning-based prediction models.Systematic review registration: PROSPERO, CRD42019161764. (c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:8 / 22
页数:15
相关论文
共 50 条
  • [21] Methodological quality of machine learning-based quantitative imaging analysis studies in esophageal cancer: a systematic review of clinical outcome prediction after concurrent chemoradiotherapy
    Shi, Zhenwei
    Zhang, Zhen
    Liu, Zaiyi
    Zhao, Lujun
    Ye, Zhaoxiang
    Dekker, Andre
    Wee, Leonard
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2022, 49 (08) : 2462 - 2481
  • [22] Methodological quality of machine learning-based quantitative imaging analysis studies in esophageal cancer: a systematic review of clinical outcome prediction after concurrent chemoradiotherapy
    Zhenwei Shi
    Zhen Zhang
    Zaiyi Liu
    Lujun Zhao
    Zhaoxiang Ye
    Andre Dekker
    Leonard Wee
    European Journal of Nuclear Medicine and Molecular Imaging, 2022, 49 : 2462 - 2481
  • [23] A systematic review of machine learning-based microscopic traffic flow models and simulations
    Rowan, Davies
    He, Haitao
    Hui, Fang
    Yasir, Ali
    Mohammed, Quddus
    COMMUNICATIONS IN TRANSPORTATION RESEARCH, 2025, 5
  • [24] Diagnostic Performance of Machine Learning-based Models in Neonatal Sepsis: A Systematic Review
    Kainth, Deepika
    Prakash, Satya
    Sankar, M. Jeeva
    PEDIATRIC INFECTIOUS DISEASE JOURNAL, 2024, 43 (09) : 889 - 901
  • [25] Machine Learning-Based Predictive Models for Patients with Venous Thromboembolism: A Systematic Review
    Danilatou, Vasiliki
    Dimopoulos, Dimitrios
    Kostoulas, Theodoros
    Douketis, James
    THROMBOSIS AND HAEMOSTASIS, 2024, 124 (11) : 1040 - 1052
  • [26] Machine learning-based 30-day readmission prediction models for patients with heart failure: a systematic review
    Yu, Min-Young
    Son, Youn-Jung
    EUROPEAN JOURNAL OF CARDIOVASCULAR NURSING, 2024, 23 (07) : 711 - 719
  • [27] An Assessment of the Predictive Performance of Current Machine Learning-Based Breast Cancer Risk Prediction Models: Systematic Review
    Gao, Ying
    Li, Shu
    Jin, Yujing
    Zhou, Lengxiao
    Sun, Shaomei
    Xu, Xiaoqian
    Li, Shuqian
    Yang, Hongxi
    Zhang, Qing
    Wang, Yaogang
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (12):
  • [28] Machine learning-based prediction models for postpartum hemorrhage
    Venkatesh, Kartik K.
    Strauss, Robert
    Grotegut, Chad
    Heine, Phillips
    Stamilio, David M.
    Menard, Kathryn
    Jelovsek, Eric
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2020, 222 (01) : S175 - S176
  • [29] Interpretability of machine learning-based prediction models in healthcare
    Stiglic, Gregor
    Kocbek, Primoz
    Fijacko, Nino
    Zitnik, Marinka
    Verbert, Katrien
    Cilar, Leona
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (05)
  • [30] Machine Learning-Based Software Defect Prediction for Mobile Applications: A Systematic Literature Review
    Jorayeva, Manzura
    Akbulut, Akhan
    Catal, Cagatay
    Mishra, Alok
    SENSORS, 2022, 22 (07)