Performance Metrics for the Comparative Analysis of Clinical Risk Prediction Models Employing Machine Learning

被引:23
|
作者
Huang, Chenxi [1 ]
Li, Shu-Xia [1 ]
Caraballo, Cesar [1 ]
Masoudi, Frederick A. [2 ,3 ]
Rumsfeld, John S. [2 ]
Spertus, John A. [4 ,5 ]
Normand, Sharon-Lise T. [6 ,7 ]
Mortazavi, Bobak J. [8 ]
Krumholz, Harlan M. [1 ,9 ,10 ]
机构
[1] Yale New Haven Hosp, Ctr Outcomes Res & Evaluat, 20 York St, New Haven, CT 06504 USA
[2] Univ Colorado, Div Cardiol, Anschutz Med Campus, Aurora, CO USA
[3] Ascens Hlth, St Louis, MO USA
[4] Univ Missouri, Dept Internal Med, Kansas City, MO 64110 USA
[5] St Lukes Mid Amer Heart Inst, Dept Cardiovasc Med, Kansas City, MO USA
[6] Harvard Univ, TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[7] Harvard Med Sch, Dept Hlth Care Policy, Boston, MA 02115 USA
[8] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX USA
[9] Yale Sch Publ Hlth, Dept Hlth Policy & Management, New Haven, CT USA
[10] Yale Sch Med, Dept Internal Med, Sect Cardiovasc Med, New Haven, CT 06510 USA
来源
关键词
acute kidney injury; machine learning; metrics; percutaneous coronary intervention; precision medicine; statistical model; EXTERNAL VALIDATION; MISLEADING MEASURE; INCREMENTAL VALUE; BRIER SCORE; CURVE; MARKERS; NRI;
D O I
10.1161/CIRCOUTCOMES.120.007526
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: New methods such as machine learning techniques have been increasingly used to enhance the performance of risk predictions for clinical decision-making. However, commonly reported performance metrics may not be sufficient to capture the advantages of these newly proposed models for their adoption by health care professionals to improve care. Machine learning models often improve risk estimation for certain subpopulations that may be missed by these metrics. Methods and Results: This article addresses the limitations of commonly reported metrics for performance comparison and proposes additional metrics. Our discussions cover metrics related to overall performance, discrimination, calibration, resolution, reclassification, and model implementation. Models for predicting acute kidney injury after percutaneous coronary intervention are used to illustrate the use of these metrics. Conclusions: We demonstrate that commonly reported metrics may not have sufficient sensitivity to identify improvement of machine learning models and propose the use of a comprehensive list of performance metrics for reporting and comparing clinical risk prediction models.
引用
收藏
页码:1076 / 1086
页数:11
相关论文
共 50 条
  • [1] Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks
    Tousi, Ashkan
    Lujan, Mikel
    [J]. IEEE ACCESS, 2022, 10 : 11994 - 12011
  • [2] Comparative analysis of machine learning models for rainfall prediction
    Das, Pritee Krishna
    Sahu, Rajiv Lochan
    Swain, Prakash Chandra
    [J]. JOURNAL OF ATMOSPHERIC AND SOLAR-TERRESTRIAL PHYSICS, 2024, 264
  • [3] Analysis of Machine Learning Models for Academic Performance Prediction
    Benitez Amaya, Andres
    Castro Barrera, Harold
    Manrique, Ruben
    [J]. GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT II, ITS 2024, 2024, 14799 : 150 - 161
  • [4] Comparative analysis of machine learning models for solar flare prediction
    Zheng, Yanfang
    Qin, Weishu
    Li, Xuebao
    Ling, Yi
    Huang, Xusheng
    Li, Xuefeng
    Yan, Pengchao
    Yan, Shuainan
    Lou, Hengrui
    [J]. ASTROPHYSICS AND SPACE SCIENCE, 2023, 368 (07)
  • [5] Comparative analysis of machine learning models for solar flare prediction
    Yanfang Zheng
    Weishu Qin
    Xuebao Li
    Yi Ling
    Xusheng Huang
    Xuefeng Li
    Pengchao Yan
    Shuainan Yan
    Hengrui Lou
    [J]. Astrophysics and Space Science, 2023, 368
  • [6] Design analysis and performance prediction of packed bed latent heat storage system employing machine learning models
    Anand, Pratyush
    Tejes, P. K. S.
    Naik, B. Kiran
    Niyas, Hakeem
    [J]. JOURNAL OF ENERGY STORAGE, 2023, 72
  • [7] A Comparative Analysis of Machine Learning Models in Prediction of Mortar Compressive Strength
    Gayathri, Rajakumaran
    Rani, Shola Usha
    Cepova, Lenka
    Rajesh, Murugesan
    Kalita, Kanak
    [J]. PROCESSES, 2022, 10 (07)
  • [8] Comparative analysis of explainable machine learning prediction models for hospital mortality
    Eline Stenwig
    Giampiero Salvi
    Pierluigi Salvo Rossi
    Nils Kristian Skjærvold
    [J]. BMC Medical Research Methodology, 22
  • [9] A Comparative Analysis of Machine Learning Models for the Prediction of Insurance Uptake in Kenya
    Yego, Nelson Kemboi
    Kasozi, Juma
    Nkurunziza, Joseph
    [J]. DATA, 2021, 6 (11)
  • [10] Comparative analysis of explainable machine learning prediction models for hospital mortality
    Stenwig, Eline
    Salvi, Giampiero
    Rossi, Pierluigi Salvo
    Skjaervold, Nils Kristian
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)