Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

被引：0

作者：

Taylor, Matthew E. ^{[1
,2
,3
]}

机构：

[1] Univ Alberta, Edmonton, AB, Canada

[2] Alberta Machine Intelligence Inst, Edmonton, AB, Canada

[3] AI Redefined, Montreal, PQ, Canada

来源：

HHAI 2023: AUGMENTING HUMAN INTELLECT | 2023年 / 368卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Reinforcement Learning; Human-Agent Interaction; Human in the Loop; Interactive Machine Learning;

D O I：

10.3233/FAIA230098

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is typically framed as a machine learning paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL is fundamentally human in the loop (HitL). The reward functions (and other components) of a Markov decision process are defined by humans. The decisions to tackle a certain problem, and deploy a learned solution, are taken by humans. Humans can also play a critical role in providing information to the agent throughout its life cycle to better succeed at the problem in question. We end by highlighting a set of critical HitL research questions, which, if ignored, could cause RL to fail to live up to its full potential.

引用

页码：351 / 360

页数：10

共 50 条

[1] Human-in-the-loop Reinforcement Learning
Liang, Huanghuang
Yang, Lu
Cheng, Hong
Tu, Wenzhe
Xu, Mengjie
2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4511 - 4518
[2] Value Driven Representation for Human-in-the-Loop Reinforcement Learning
Keramati, Ramtin
Brunskill, Emma
ACM UMAP '19: PROCEEDINGS OF THE 27TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2019, : 176 - 180
[3] Where to Add Actions in Human-in-the-Loop Reinforcement Learning
Mandel, Travis
Liu, Yun-En
Brunskill, Emma
Popovic, Zoran
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2322 - 2328
[4] ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning
Chen, Sean
Gao, Jensen
Reddy, Siddharth
Berseth, Glen
Dragan, Anca D.
Levine, Sergey
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7505 - 7512
[5] Human-in-the-Loop Reinforcement Learning in Continuous-Action Space
Luo, Biao
Wu, Zhengke
Zhou, Fei
Wang, Bing-Chuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (11) : 1 - 10
[6] HEX: Human-in-the-loop explainability via deep reinforcement learning
Lash, Michael T.
Decision Support Systems, 2024, 187
[7] Shared Autonomy Based on Human-in-the-loop Reinforcement Learning with Policy Constraints
Li, Ming
Kang, Yu
Zhao, Yun-Bo
Zhu, Jin
You, Shiyi
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7349 - 7354
[8] Personalization of Hearing Aid Compression by Human-in-the-Loop Deep Reinforcement Learning
Alamdari, Nasim
Lobarinas, Edward
Kehtarnavaz, Nasser
IEEE ACCESS, 2020, 8 : 203503 - 203515
[9] Thermal comfort management leveraging deep reinforcement learning and human-in-the-loop
Cicirelli, Franco
Guerrieri, Antonio
Mastroianni, Carlo
Spezzano, Giandomenico
Vinci, Andrea
PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 160 - 165
[10] Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities
Retzlaff, Carl Orge
Das, Srijita
Wayllace, Christabel
Mousavi, Payam
Afshari, Mohammad
Yang, Tianpei
Saranti, Anna
Angerschmid, Alessa
Taylor, Matthew E.
Holzinger, Andreas
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 359 - 415

← 1 2 3 4 5 →