Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

被引:0
|
作者
Taylor, Matthew E. [1 ,2 ,3 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[3] AI Redefined, Montreal, PQ, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement Learning; Human-Agent Interaction; Human in the Loop; Interactive Machine Learning;
D O I
10.3233/FAIA230098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is typically framed as a machine learning paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL is fundamentally human in the loop (HitL). The reward functions (and other components) of a Markov decision process are defined by humans. The decisions to tackle a certain problem, and deploy a learned solution, are taken by humans. Humans can also play a critical role in providing information to the agent throughout its life cycle to better succeed at the problem in question. We end by highlighting a set of critical HitL research questions, which, if ignored, could cause RL to fail to live up to its full potential.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [21] Safety-Aware Human-in-the-Loop Reinforcement Learning With Shared Control for Autonomous Driving
    Huang, Wenhui
    Liu, Haochen
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 16181 - 16192
  • [22] Safe Q-Learning Approaches for Human-in-Loop Reinforcement Learning
    Veerabathraswamy, Swathi
    Bhatt, Nirav
    2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 16 - 21
  • [23] End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning
    Sharif, Mohammadreza
    Erdogmus, Deniz
    Amato, Christopher
    Padir, Taskin
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 2768 - 2774
  • [24] Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
    Chen, Xiaoyu
    Zhong, Han
    Yang, Zhuoran
    Wang, Zhaoran
    Wang, Liwei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [25] Active Learning for Human-in-the-Loop Customs Inspection
    Kim, Sundong
    Mai, Tung-Duong
    Han, Sungwon
    Park, Sungwon
    Nguyen, D. K. Thi
    So, Jaechan
    Singh, Karandeep
    Cha, Meeyoung
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12039 - 12052
  • [26] HELIX: Accelerating Human-in-the-loop Machine Learning
    Xin, Doris
    Ma, Litian
    Liu, Jialin
    Macke, Stephen
    Song, Shuchen
    Parameswaran, Aditya
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 1958 - 1961
  • [27] Continual learning classification method with human-in-the-loop
    Liu, Jia
    Li, Dong
    Shan, Wangweiyi
    Liu, Shulin
    METHODSX, 2023, 11
  • [28] Human-in-the-loop Learning for Dynamic Congestion Games
    Li H.
    Duan L.
    IEEE Transactions on Mobile Computing, 2024, 23 (12) : 1 - 12
  • [29] Human-in-the-loop machine learning: a state of the art
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 3005 - 3054
  • [30] Human-in-the-Loop Low-Shot Learning
    Wan, Sen
    Hou, Yimin
    Bao, Feng
    Ren, Zhiquan
    Dong, Yunfeng
    Dai, Qionghai
    Deng, Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3287 - 3292