On evaluation metrics for medical applications of artificial intelligence

被引：197

作者：

Hicks, Steven A. ^{[1
,2
]}

Struemke, Inga ^{[1
]}

Thambawita, Vajira ^{[1
,2
]}

Hammou, Malek ^{[1
]}

Riegler, Michael A. ^{[1
]}

Halvorsen, Pal ^{[1
,2
]}

Parasa, Sravanthi ^{[3
]}

机构：

[1] SimulaMet, Oslo, Norway

[2] Oslo Metropolitan Univ, Oslo, Norway

[3] Swedish Med Ctr, Seattle, WA USA

来源：

SCIENTIFIC REPORTS | 2022年 / 12卷 / 01期

关键词：

VALIDATION;

D O I：

10.1038/s41598-022-09954-8

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Clinicians and software developers need to understand how proposed machine learning (ML) models could improve patient care. No single metric captures all the desirable properties of a model, which is why several metrics are typically reported to summarize a model's performance. Unfortunately, these measures are not easily understandable by many clinicians. Moreover, comparison of models across studies in an objective manner is challenging, and no tool exists to compare models using the same performance metrics. This paper looks at previous ML studies done in gastroenterology, provides an explanation of what different metrics mean in the context of binary classification in the presented studies, and gives a thorough explanation of how different metrics should be interpreted. We also release an open source web-based tool that may be used to aid in calculating the most relevant metrics presented in this paper so that other researchers and clinicians may easily incorporate them into their research.

引用

页数：9

共 50 条

[1] On evaluation metrics for medical applications of artificial intelligence
Steven A. Hicks
Inga Strümke
Vajira Thambawita
Malek Hammou
Michael A. Riegler
Pål Halvorsen
Sravanthi Parasa
[J]. Scientific Reports, 12
[2] Artificial Intelligence in Medical Applications
Chan, Yung-Kuan
Chen, Yung-Fu
Pham, Tuan
Chang, Weide
Hsieh, Ming-Yuan
[J]. JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018
[3] Evaluation Metrics in Explainable Artificial Intelligence (XAI)
Coroama, Loredana
Groza, Adrian
[J]. ADVANCED RESEARCH IN TECHNOLOGIES, INFORMATION, INNOVATION AND SUSTAINABILITY, ARTIIS 2022, PT I, 2022, 1675 : 401 - 413
[4] Advanced Artificial Intelligence Methods for Medical Applications
Siriborvornratanakul, Thitirat
[J]. DIGITAL HUMAN MODELING AND APPLICATIONS IN HEALTH, SAFETY, ERGONOMICS AND RISK MANAGEMENT, DHM 2023, PT II, 2023, 14029 : 329 - 340
[5] Artificial intelligence and Medical Parasitology: Applications and perspectives
Diab, Radwa G.
[J]. PARASITOLOGISTS UNITED JOURNAL, 2023, 16 (02) : 91 - 93
[6] International Workshop on Artificial Intelligence in Medical Applications
[J]. 2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 191 - 191
[7] International workshop on Artificial Intelligence in Medical Applications
[J]. 1600, IEEE Computer Society, 2001 L Street N.W., Suite 700, Washington, DC 20036-4928, United States
[8] Clinical and Research Medical Applications of Artificial Intelligence
Ramkumar, Prem N.
Kunze, Kyle N.
Haeberle, Heather S.
Karnuta, Jaret M.
Luu, Bryan C.
Nwachukwu, Benedict U.
Williams, Riley J.
[J]. ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY, 2021, 37 (05): : 1694 - 1697
[9] Trustworthy Artificial Intelligence in Medical Applications: A Mini Survey
Onari, Mohsen Abbaspour
Grau, Isel
Nobile, Marco S.
Zhang, Yingqian
[J]. 2023 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, CIBCB, 2023, : 71 - 78
[10] Artificial Intelligence in Medical Imaging, Opportunities, Applications and Risks
Nabavi, Shahabedin
Mohammadi, Mohammad
[J]. PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2021, 44 (02) : 591 - 593

← 1 2 3 4 5 →