On evaluation metrics for medical applications of artificial intelligence

被引:197
|
作者
Hicks, Steven A. [1 ,2 ]
Struemke, Inga [1 ]
Thambawita, Vajira [1 ,2 ]
Hammou, Malek [1 ]
Riegler, Michael A. [1 ]
Halvorsen, Pal [1 ,2 ]
Parasa, Sravanthi [3 ]
机构
[1] SimulaMet, Oslo, Norway
[2] Oslo Metropolitan Univ, Oslo, Norway
[3] Swedish Med Ctr, Seattle, WA USA
关键词
VALIDATION;
D O I
10.1038/s41598-022-09954-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinicians and software developers need to understand how proposed machine learning (ML) models could improve patient care. No single metric captures all the desirable properties of a model, which is why several metrics are typically reported to summarize a model's performance. Unfortunately, these measures are not easily understandable by many clinicians. Moreover, comparison of models across studies in an objective manner is challenging, and no tool exists to compare models using the same performance metrics. This paper looks at previous ML studies done in gastroenterology, provides an explanation of what different metrics mean in the context of binary classification in the presented studies, and gives a thorough explanation of how different metrics should be interpreted. We also release an open source web-based tool that may be used to aid in calculating the most relevant metrics presented in this paper so that other researchers and clinicians may easily incorporate them into their research.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] On evaluation metrics for medical applications of artificial intelligence
    Steven A. Hicks
    Inga Strümke
    Vajira Thambawita
    Malek Hammou
    Michael A. Riegler
    Pål Halvorsen
    Sravanthi Parasa
    [J]. Scientific Reports, 12
  • [2] Artificial Intelligence in Medical Applications
    Chan, Yung-Kuan
    Chen, Yung-Fu
    Pham, Tuan
    Chang, Weide
    Hsieh, Ming-Yuan
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018
  • [3] Evaluation Metrics in Explainable Artificial Intelligence (XAI)
    Coroama, Loredana
    Groza, Adrian
    [J]. ADVANCED RESEARCH IN TECHNOLOGIES, INFORMATION, INNOVATION AND SUSTAINABILITY, ARTIIS 2022, PT I, 2022, 1675 : 401 - 413
  • [4] Advanced Artificial Intelligence Methods for Medical Applications
    Siriborvornratanakul, Thitirat
    [J]. DIGITAL HUMAN MODELING AND APPLICATIONS IN HEALTH, SAFETY, ERGONOMICS AND RISK MANAGEMENT, DHM 2023, PT II, 2023, 14029 : 329 - 340
  • [5] Artificial intelligence and Medical Parasitology: Applications and perspectives
    Diab, Radwa G.
    [J]. PARASITOLOGISTS UNITED JOURNAL, 2023, 16 (02) : 91 - 93
  • [6] International Workshop on Artificial Intelligence in Medical Applications
    [J]. 2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 191 - 191
  • [7] International workshop on Artificial Intelligence in Medical Applications
    [J]. 1600, IEEE Computer Society, 2001 L Street N.W., Suite 700, Washington, DC 20036-4928, United States
  • [8] Clinical and Research Medical Applications of Artificial Intelligence
    Ramkumar, Prem N.
    Kunze, Kyle N.
    Haeberle, Heather S.
    Karnuta, Jaret M.
    Luu, Bryan C.
    Nwachukwu, Benedict U.
    Williams, Riley J.
    [J]. ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY, 2021, 37 (05): : 1694 - 1697
  • [9] Trustworthy Artificial Intelligence in Medical Applications: A Mini Survey
    Onari, Mohsen Abbaspour
    Grau, Isel
    Nobile, Marco S.
    Zhang, Yingqian
    [J]. 2023 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, CIBCB, 2023, : 71 - 78
  • [10] Artificial Intelligence in Medical Imaging, Opportunities, Applications and Risks
    Nabavi, Shahabedin
    Mohammadi, Mohammad
    [J]. PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2021, 44 (02) : 591 - 593