Data set terminology of deep learning in medicine: a historical review and recommendation

被引：6

作者：

Walston, Shannon L. ^{[1
]}

Seki, Hiroshi ^{[1
]}

Takita, Hirotaka ^{[1
]}

Mitsuyama, Yasuhito ^{[1
]}

Sato, Shingo ^{[2
]}

Hagiwara, Akifumi ^{[3
]}

Ito, Rintaro ^{[4
]}

Hanaoka, Shouhei ^{[5
]}

Miki, Yukio ^{[1
]}

Ueda, Daiju ^{[1
,6
,7
]}

机构：

[1] Osaka Metropolitan Univ, Grad Sch Med, Dept Diagnost & Intervent Radiol, Osaka, Japan

[2] Thomas Jefferson Univ, Sidney Kimmel Canc Ctr, Philadelphia, PA USA

[3] Juntendo Univ, Sch Med, Dept Radiol, Tokyo, Japan

[4] Nagoya Univ, Dept Radiol, Nagoya, Japan

[5] Univ Tokyo Hosp, Dept Radiol, Tokyo, Japan

[6] Osaka Metropolitan Univ, Grad Sch Med, Dept Artificial Intelligence, Osaka, Japan

[7] Osaka Metropolitan Univ, Ctr Hlth Sci Innovat, Osaka, Japan

来源：

JAPANESE JOURNAL OF RADIOLOGY | 2024年 / 42卷 / 10期

关键词：

Terminology; Artificial intelligence; Deep learning; Data partition; Data splitting; ARTIFICIAL-INTELLIGENCE; VALIDATION; MODEL; PROGNOSIS; TOOL;

D O I：

10.1007/s11604-024-01608-1

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Medicine and deep learning-based artificial intelligence (AI) engineering represent two distinct fields each with decades of published history. The current rapid convergence of deep learning and medicine has led to significant advancements, yet it has also introduced ambiguity regarding data set terms common to both fields, potentially leading to miscommunication and methodological discrepancies. This narrative review aims to give historical context for these terms, accentuate the importance of clarity when these terms are used in medical deep learning contexts, and offer solutions to mitigate misunderstandings by readers from either field. Through an examination of historical documents, including articles, writing guidelines, and textbooks, this review traces the divergent evolution of terms for data sets and their impact. Initially, the discordant interpretations of the word 'validation' in medical and AI contexts are explored. We then show that in the medical field as well, terms traditionally used in the deep learning domain are becoming more common, with the data for creating models referred to as the 'training set', the data for tuning of parameters referred to as the 'validation (or tuning) set', and the data for the evaluation of models as the 'test set'. Additionally, the test sets used for model evaluation are classified into internal (random splitting, cross-validation, and leave-one-out) sets and external (temporal and geographic) sets. This review then identifies often misunderstood terms and proposes pragmatic solutions to mitigate terminological confusion in the field of deep learning in medicine. We support the accurate and standardized description of these data sets and the explicit definition of data set splitting terminologies in each publication. These are crucial methods for demonstrating the robustness and generalizability of deep learning applications in medicine. This review aspires to enhance the precision of communication, thereby fostering more effective and transparent research methodologies in this interdisciplinary field.

引用

页码：1100 / 1109

页数：10

共 50 条

[11] Deep Learning for Search and Recommendation
Liu, Wei
Xie, Kexin
Pang, Linsey
Bailey, James
Cao, Longbing
Zhang, Yuxi
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5171 - 5172
[12] Research on Recommendation of Big Data for Higher Education Based on Deep Learning
Zhao, Ang
Ma, Yanhua
SCIENTIFIC PROGRAMMING, 2022, 2022
[13] Big data and deep learning in preventive and rehabilitation medicine
Jaeger, M.
Mayer, C.
Hefter, H.
Siebler, M.
Kecskemethy, A.
ORTHOPADE, 2018, 47 (10): : 826 - 833
[14] Deep Learning-Based Recommendation System: Systematic Review and Classification
Li, Caiwen
Ishak, Iskandar
Ibrahim, Hamidah
Zolkepli, Maslina
Sidi, Fatimah
Li, Caili
IEEE ACCESS, 2023, 11 : 113790 - 113835
[15] Learning for Personalized Medicine: A Comprehensive Review From a Deep Learning Perspective
Zhang, Sushen
Bamakan, Seyed Mojtaba Hosseini
Qu, Qiang
Li, Sha
IEEE REVIEWS IN BIOMEDICAL ENGINEERING, 2019, 12 : 194 - 208
[16] The historical evolution of the fornix and its terminology: a review
Eray Dogan
Abuzer Gungor
Fikret Dogulu
Uğur Türe
Neurosurgical Review, 2022, 45 : 979 - 988
[17] HYPOLACTASIA AND LACTASE PERSISTENCE - HISTORICAL REVIEW AND THE TERMINOLOGY
SAHI, T
SCANDINAVIAN JOURNAL OF GASTROENTEROLOGY, 1994, 29 : 1 - 6
[18] The historical evolution of the fornix and its terminology: a review
Dogan, Eray
Gungor, Abuzer
Dogulu, Fikret
Ture, Ugur
NEUROSURGICAL REVIEW, 2022, 45 (02) : 979 - 988
[19] Interactive precision medicine revolution: unleashing a deep learning framework for drug response and recommendation
Gundavarapu, Mallikarjuna Rao
Venkata, Raghavender Kotla
Latha, S. Bhargavi
Kumar, N. V. Pavan
Deepa, R. N. Ashlin
Kotov, Evgeny Vladimirovich
Nautiyal, Rishi Dev
Alzubaidi, Laith H.
COGENT ENGINEERING, 2024, 11 (01):
[20] Research on imbalanced data set preprocessing based on deep learning
Wang Fangyu
Zhang Jianhui
Bu Youjun
Chen Bo
2021 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE (ACCTCS 2021), 2021, : 75 - 79

← 1 2 3 4 5 →