Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach

被引:2
|
作者
Mosqueira-Rey, Eduardo [1 ]
Hernandez-Pereira, Elena [1 ]
Bobes-Bascaran, Jose [1 ]
Alonso-Rios, David [1 ]
Perez-Sanchez, Alberto [1 ]
Fernandez-Leal, Angel [1 ]
Moret-Bonillo, Vicente [1 ]
Vidal-Insua, Yolanda [2 ]
Vazquez-Rivera, Francisca [2 ]
机构
[1] Univ Coruna CITIC, Dept Comp Sci & Informat Technol, Campus Elvina, La Coruna 15071, Spain
[2] Complejo Hosp CHUS, Serv Oncol Med, Rua Choupana S-N, Santiago De Compostela 15706, Spain
来源
NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 05期
关键词
Human-in-the-loop machine learning; Active learning; Interactive machine learning; Pancreatic cancer; Generative adversarial network; USABILITY EVALUATION;
D O I
10.1007/s00521-023-09197-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Any machine learning (ML) model is highly dependent on the data it uses for learning, and this is even more important in the case of deep learning models. The problem is a data bottleneck, i.e. the difficulty in obtaining an adequate number of cases and quality data. Another issue is improving the learning process, which can be done by actively introducing experts into the learning loop, in what is known as human-in-the-loop (HITL) ML. We describe an ML model based on a neural network in which HITL techniques were used to resolve the data bottleneck problem for the treatment of pancreatic cancer. We first augmented the dataset using synthetic cases created by a generative adversarial network. We then launched an active learning (AL) process involving human experts as oracles to label both new cases and cases by the network found to be suspect. This AL process was carried out simultaneously with an interactive ML process in which feedback was obtained from humans in order to develop better synthetic cases for each iteration of training. We discuss the challenges involved in including humans in the learning process, especially in relation to human-computer interaction, which is acquiring great importance in building ML models and can condition the success of a HITL approach. This paper also discusses the methodological approach adopted to address these challenges.
引用
收藏
页码:2597 / 2616
页数:20
相关论文
共 50 条
  • [1] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    José Bobes-Bascarán
    David Alonso-Ríos
    Alberto Pérez-Sánchez
    Ángel Fernández-Leal
    Vicente Moret-Bonillo
    Yolanda Vidal-Ínsua
    Francisca Vázquez-Rivera
    [J]. Neural Computing and Applications, 2024, 36 : 2597 - 2616
  • [2] A survey on active learning and human-in-the-loop deep learning for medical image analysis
    Budd, Samuel
    Robinson, Emma C.
    Kainz, Bernhard
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 71
  • [3] A survey of human-in-the-loop for machine learning
    Wu, Xingjiao
    Xiao, Luwei
    Sun, Yixuan
    Zhang, Junhang
    Ma, Tianlong
    He, Liang
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
  • [4] Human-in-the-loop Applied Machine Learning
    Brodley, Carla E.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
  • [5] Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models
    Zhao, Zhenge
    Xu, Panpan
    Scheidegger, Carlos
    Ren, Liu
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 780 - 790
  • [6] Enabling Autonomous Medical Image Data Annotation: A human-in-the-loop Reinforcement Learning Approach
    da Cruz, Leonardo C.
    Sierra-Franco, Cesar A.
    Silva-Calpa, Greis Francy M.
    Raposo, Alberto Barbosa
    [J]. PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 271 - 279
  • [7] Human-in-the-loop machine learning: a state of the art
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 3005 - 3054
  • [8] HELIX: Accelerating Human-in-the-loop Machine Learning
    Xin, Doris
    Ma, Litian
    Liu, Jialin
    Macke, Stephen
    Song, Shuchen
    Parameswaran, Aditya
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 1958 - 1961
  • [9] Information Filtering Method for Twitter Streaming Data Using Human-in-the-Loop Machine Learning
    Suzuki, Yu
    Nakamura, Satoshi
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 167 - 175
  • [10] Human-in-the-loop machine learning: a state of the art
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    David Alonso-Ríos
    José Bobes-Bascarán
    Ángel Fernández-Leal
    [J]. Artificial Intelligence Review, 2023, 56 : 3005 - 3054