A survey of human-in-the-loop for machine learning

被引:179
|
作者
Wu, Xingjiao [1 ,2 ]
Xiao, Luwei [2 ]
Sun, Yixuan [3 ]
Zhang, Junhang [2 ]
Ma, Tianlong [1 ,2 ]
He, Liang [1 ,2 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
[2] East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[3] Fudan Univ, Shanghai, Peoples R China
关键词
Human-in-the-loop; Machine learning; Deep learning; Data processing; Computer vision; Natural language processing; CLASSIFICATION; ANNOTATION; GENERATION; REMOVAL; FUSION; IF;
D O I
10.1016/j.future.2022.05.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine learning has become the state-of-the-art technique for many tasks including computer vision, natural language processing, speech processing tasks, etc. However, the unique challenges posed by machine learning suggest that incorporating user knowledge into the system can be beneficial. The purpose of integrating human domain knowledge is also to promote the automation of machine learning. Human-in-the-loop is an area that we see as increasingly important in future research due to the knowledge learned by machine learning cannot win human domain knowledge. Human-in-the-loop aims to train an accurate prediction model with minimum cost by integrating human knowledge and experience. Humans can provide training data for machine learning applications and directly accomplish tasks that are hard for computers in the pipeline with the help of machine-based approaches. In this paper, we survey existing works on human-in-the-loop from a data perspective and classify them into three categories with a progressive relationship: (1) the work of improving model performance from data processing, (2) the work of improving model performance through interventional model training, and (3) the design of the system independent human-in-the-loop. Using the above categorization, we summarize the major approaches in the field; along with their technical strengths/weaknesses, we have a simple classification and discussion in natural language processing, computer vision, and others. Besides, we provide some open challenges and opportunities. This survey intends to provide a high-level summarization for human-in-the-loop and to motivate interested readers to consider approaches for designing effective human-in-the-loop solutions. Keywords: Human-in-the-loop Machine learning Deep learning Data processing Computer vision Natural language processing (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:364 / 381
页数:18
相关论文
共 50 条
  • [31] Human-in-the-Loop Machine Learning to Increase Video Accessibility for Visually Impaired and Blind Users
    Yuksel, Beste F.
    Fazli, Pooyan
    Mathur, Umang
    Bisht, Vaishali
    Kim, Soo Jung
    Lee, Joshua Junhee
    Jin, Seung Jung
    Siu, Yue-Ting
    Miele, Joshua A.
    Yoon, Ilmi
    [J]. PROCEEDINGS OF THE 2020 ACM DESIGNING INTERACTIVE SYSTEMS CONFERENCE (DIS 2020), 2020, : 47 - 60
  • [32] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Bobes-Bascaran, Jose
    Alonso-Rios, David
    Perez-Sanchez, Alberto
    Fernandez-Leal, Angel
    Moret-Bonillo, Vicente
    Vidal-Insua, Yolanda
    Vazquez-Rivera, Francisca
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (05): : 2597 - 2616
  • [33] A Survey on Human-in-the-Loop Applications Towards an Internet of All
    Nunes, David Sousa
    Zhang, Pei
    Silva, Jorge Sa
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (02): : 944 - 965
  • [34] Human-in-the-Loop Mobile Networks: A Survey of Recent Advancements
    Duan, Lingjie
    Huang, Longbo
    Langbort, Cedric
    Pozdnukhov, Alexey
    Walrand, Jean
    Zhang, Lin
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (04) : 813 - 831
  • [35] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    José Bobes-Bascarán
    David Alonso-Ríos
    Alberto Pérez-Sánchez
    Ángel Fernández-Leal
    Vicente Moret-Bonillo
    Yolanda Vidal-Ínsua
    Francisca Vázquez-Rivera
    [J]. Neural Computing and Applications, 2024, 36 : 2597 - 2616
  • [36] HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop
    Yang, Yiwei
    Kandogan, Eser
    Li, Yunyao
    Lasecki, Walter S.
    Sen, Prithviraj
    [J]. PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 135 - 140
  • [37] Combining human intelligence and machine learning for fact-checking: Towards a hybrid human-in-the-loop framework
    La Barbera, David
    Roitero, Kevin
    Mizzaro, Stefano
    [J]. INTELLIGENZA ARTIFICIALE, 2023, 17 (02) : 163 - 172
  • [38] Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning: Protocol for a Human-in-the-Loop Machine Learning Study
    Jaiswal, Aditi
    Kruiper, Ruben
    Rasool, Abdur
    Nandkeolyar, Aayush
    Wall, Dennis P.
    Washington, Peter
    [J]. JMIR RESEARCH PROTOCOLS, 2024, 13
  • [39] Human-Machine Trust and Calibration Based on Human-in-the-Loop Experiment
    Wang, Yifan
    Guo, Jianbin
    Zeng, Shengkui
    Mao, Qirui
    Lu, Zhenping
    Wang, Zengkai
    [J]. 2022 4TH INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY ENGINEERING, SRSE, 2022, : 476 - 481
  • [40] Precise atom-to-atom mapping for organic reactions via human-in-the-loop machine learning
    Chen S.
    An S.
    Babazade R.
    Jung Y.
    [J]. Nature Communications, 15 (1)