A survey of human-in-the-loop for machine learning

被引：179

作者：

Wu, Xingjiao ^{[1
,2
]}

Xiao, Luwei ^{[2
]}

Sun, Yixuan ^{[3
]}

Zhang, Junhang ^{[2
]}

Ma, Tianlong ^{[1
,2
]}

He, Liang ^{[1
,2
]}

机构：

[1] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China

[2] East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China

[3] Fudan Univ, Shanghai, Peoples R China

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2022年 / 135卷

关键词：

Human-in-the-loop; Machine learning; Deep learning; Data processing; Computer vision; Natural language processing; CLASSIFICATION; ANNOTATION; GENERATION; REMOVAL; FUSION; IF;

D O I：

10.1016/j.future.2022.05.014

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Machine learning has become the state-of-the-art technique for many tasks including computer vision, natural language processing, speech processing tasks, etc. However, the unique challenges posed by machine learning suggest that incorporating user knowledge into the system can be beneficial. The purpose of integrating human domain knowledge is also to promote the automation of machine learning. Human-in-the-loop is an area that we see as increasingly important in future research due to the knowledge learned by machine learning cannot win human domain knowledge. Human-in-the-loop aims to train an accurate prediction model with minimum cost by integrating human knowledge and experience. Humans can provide training data for machine learning applications and directly accomplish tasks that are hard for computers in the pipeline with the help of machine-based approaches. In this paper, we survey existing works on human-in-the-loop from a data perspective and classify them into three categories with a progressive relationship: (1) the work of improving model performance from data processing, (2) the work of improving model performance through interventional model training, and (3) the design of the system independent human-in-the-loop. Using the above categorization, we summarize the major approaches in the field; along with their technical strengths/weaknesses, we have a simple classification and discussion in natural language processing, computer vision, and others. Besides, we provide some open challenges and opportunities. This survey intends to provide a high-level summarization for human-in-the-loop and to motivate interested readers to consider approaches for designing effective human-in-the-loop solutions. Keywords: Human-in-the-loop Machine learning Deep learning Data processing Computer vision Natural language processing (C) 2022 Elsevier B.V. All rights reserved.

引用

页码：364 / 381

页数：18

共 50 条

[31] Human-in-the-Loop Machine Learning to Increase Video Accessibility for Visually Impaired and Blind Users
Yuksel, Beste F.
Fazli, Pooyan
Mathur, Umang
Bisht, Vaishali
Kim, Soo Jung
Lee, Joshua Junhee
Jin, Seung Jung
Siu, Yue-Ting
Miele, Joshua A.
Yoon, Ilmi
[J]. PROCEEDINGS OF THE 2020 ACM DESIGNING INTERACTIVE SYSTEMS CONFERENCE (DIS 2020), 2020, : 47 - 60
[32] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
Mosqueira-Rey, Eduardo
Hernandez-Pereira, Elena
Bobes-Bascaran, Jose
Alonso-Rios, David
Perez-Sanchez, Alberto
Fernandez-Leal, Angel
Moret-Bonillo, Vicente
Vidal-Insua, Yolanda
Vazquez-Rivera, Francisca
[J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (05): : 2597 - 2616
[33] A Survey on Human-in-the-Loop Applications Towards an Internet of All
Nunes, David Sousa
Zhang, Pei
Silva, Jorge Sa
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (02): : 944 - 965
[34] Human-in-the-Loop Mobile Networks: A Survey of Recent Advancements
Duan, Lingjie
Huang, Longbo
Langbort, Cedric
Pozdnukhov, Alexey
Walrand, Jean
Zhang, Lin
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (04) : 813 - 831
[35] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
Eduardo Mosqueira-Rey
Elena Hernández-Pereira
José Bobes-Bascarán
David Alonso-Ríos
Alberto Pérez-Sánchez
Ángel Fernández-Leal
Vicente Moret-Bonillo
Yolanda Vidal-Ínsua
Francisca Vázquez-Rivera
[J]. Neural Computing and Applications, 2024, 36 : 2597 - 2616
[36] HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop
Yang, Yiwei
Kandogan, Eser
Li, Yunyao
Lasecki, Walter S.
Sen, Prithviraj
[J]. PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 135 - 140
[37] Combining human intelligence and machine learning for fact-checking: Towards a hybrid human-in-the-loop framework
La Barbera, David
Roitero, Kevin
Mizzaro, Stefano
[J]. INTELLIGENZA ARTIFICIALE, 2023, 17 (02) : 163 - 172
[38] Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning: Protocol for a Human-in-the-Loop Machine Learning Study
Jaiswal, Aditi
Kruiper, Ruben
Rasool, Abdur
Nandkeolyar, Aayush
Wall, Dennis P.
Washington, Peter
[J]. JMIR RESEARCH PROTOCOLS, 2024, 13
[39] Human-Machine Trust and Calibration Based on Human-in-the-Loop Experiment
Wang, Yifan
Guo, Jianbin
Zeng, Shengkui
Mao, Qirui
Lu, Zhenping
Wang, Zengkai
[J]. 2022 4TH INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY ENGINEERING, SRSE, 2022, : 476 - 481
[40] Precise atom-to-atom mapping for organic reactions via human-in-the-loop machine learning
Chen S.
An S.
Babazade R.
Jung Y.
[J]. Nature Communications, 15 (1)

← 1 2 3 4 5 →