Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

被引:0
|
作者
Taylor, Matthew E. [1 ,2 ,3 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[3] AI Redefined, Montreal, PQ, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement Learning; Human-Agent Interaction; Human in the Loop; Interactive Machine Learning;
D O I
10.3233/FAIA230098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is typically framed as a machine learning paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL is fundamentally human in the loop (HitL). The reward functions (and other components) of a Markov decision process are defined by humans. The decisions to tackle a certain problem, and deploy a learned solution, are taken by humans. Humans can also play a critical role in providing information to the agent throughout its life cycle to better succeed at the problem in question. We end by highlighting a set of critical HitL research questions, which, if ignored, could cause RL to fail to live up to its full potential.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [1] Human-in-the-loop Reinforcement Learning
    Liang, Huanghuang
    Yang, Lu
    Cheng, Hong
    Tu, Wenzhe
    Xu, Mengjie
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4511 - 4518
  • [2] Value Driven Representation for Human-in-the-Loop Reinforcement Learning
    Keramati, Ramtin
    Brunskill, Emma
    [J]. ACM UMAP '19: PROCEEDINGS OF THE 27TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2019, : 176 - 180
  • [3] Where to Add Actions in Human-in-the-Loop Reinforcement Learning
    Mandel, Travis
    Liu, Yun-En
    Brunskill, Emma
    Popovic, Zoran
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2322 - 2328
  • [4] ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning
    Chen, Sean
    Gao, Jensen
    Reddy, Siddharth
    Berseth, Glen
    Dragan, Anca D.
    Levine, Sergey
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7505 - 7512
  • [5] Human-in-the-Loop Reinforcement Learning in Continuous-Action Space
    Luo, Biao
    Wu, Zhengke
    Zhou, Fei
    Wang, Bing-Chuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10
  • [6] Shared Autonomy Based on Human-in-the-loop Reinforcement Learning with Policy Constraints
    Li, Ming
    Kang, Yu
    Zhao, Yun-Bo
    Zhu, Jin
    You, Shiyi
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7349 - 7354
  • [7] Personalization of Hearing Aid Compression by Human-in-the-Loop Deep Reinforcement Learning
    Alamdari, Nasim
    Lobarinas, Edward
    Kehtarnavaz, Nasser
    [J]. IEEE ACCESS, 2020, 8 : 203503 - 203515
  • [8] Thermal comfort management leveraging deep reinforcement learning and human-in-the-loop
    Cicirelli, Franco
    Guerrieri, Antonio
    Mastroianni, Carlo
    Spezzano, Giandomenico
    Vinci, Andrea
    [J]. PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 160 - 165
  • [9] Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities
    Retzlaff, Carl Orge
    Das, Srijita
    Wayllace, Christabel
    Mousavi, Payam
    Afshari, Mohammad
    Yang, Tianpei
    Saranti, Anna
    Angerschmid, Alessa
    Taylor, Matthew E.
    Holzinger, Andreas
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 359 - 415
  • [10] Toward Human-in-the-Loop PID Control Based on CACLA Reinforcement Learning
    Zhong, Junpei
    Li, Yanan
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III, 2019, 11742 : 605 - 613