Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches

被引:0
|
作者
Taylor, Matthew E. [1 ,2 ,3 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[3] AI Redefined, Montreal, PQ, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement Learning; Human-Agent Interaction; Human in the Loop; Interactive Machine Learning;
D O I
10.3233/FAIA230098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is typically framed as a machine learning paradigm where agents learn to act autonomously in complex environments. This paper argues instead that RL is fundamentally human in the loop (HitL). The reward functions (and other components) of a Markov decision process are defined by humans. The decisions to tackle a certain problem, and deploy a learned solution, are taken by humans. Humans can also play a critical role in providing information to the agent throughout its life cycle to better succeed at the problem in question. We end by highlighting a set of critical HitL research questions, which, if ignored, could cause RL to fail to live up to its full potential.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [31] Optimal Volt/Var Control for Unbalanced Distribution Networks With Human-in-the-Loop Deep Reinforcement Learning
    Sun, Xianzhuo
    Xu, Zhao
    Qiu, Jing
    Liu, Huichuan
    Wu, Huayi
    Tao, Yuechuan
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (03) : 2639 - 2651
  • [32] The Augmented Intelligence Perspective on Human-in-The-Loop Reinforcement Learning: Review, Concept Designs, and Future Directions
    Yau, Kok-Lim Alvin
    Saleem, Yasir
    Chong, Yung-Wey
    Fan, Xiumei
    Eyu, Jer Min
    Chieng, David
    IEEE Transactions on Human-Machine Systems, 2024, 54 (06): : 762 - 777
  • [33] Human-in-the-loop machine learning: a state of the art
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    David Alonso-Ríos
    José Bobes-Bascarán
    Ángel Fernández-Leal
    Artificial Intelligence Review, 2023, 56 : 3005 - 3054
  • [34] HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop
    Yang, Yiwei
    Kandogan, Eser
    Li, Yunyao
    Lasecki, Walter S.
    Sen, Prithviraj
    PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 135 - 140
  • [35] adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential Decision Making Human-in-the-Loop Systems
    Taherisadr, Mojtaba
    Stavroulakis, Stelios Andrew
    Elmalaki, Salma
    PROCEEDINGS 8TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2023, 2023, : 262 - 274
  • [36] Human-in-the-Loop Machine Learning for the Treatment of Pancreatic Cancer
    Mosqueira-Rey, Eduardo
    Perez-Sanchez, Alberto
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    Moret-Bonillo, Vicente
    Vidal-Insua, Yolanda
    Vazquez-Rivera, Francisca
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [37] ADAS-RL: Adaptive Vector Scaling Reinforcement Learning For Human-in-the-Loop Lane Departure Warning
    Ahadi-Sarkani, Armand
    Elmalaki, Salma
    CPHS'21: PROCEEDINGS OF THE 2021 THE FIRST ACM INTERNATIONAL WORKSHOP ON CYBER-PHYSICAL-HUMAN SYSTEM DESIGN AND IMPLEMENTATION, 2021, : 7 - 12
  • [38] Human-in-the-loop machine learning with applications for population health
    Long Chen
    Jiangtao Wang
    Bin Guo
    Liming Chen
    CCF Transactions on Pervasive Computing and Interaction, 2023, 5 : 1 - 12
  • [39] A Machine Learning System For Human-in-the-loop Video Surveillance
    Vural, Ulas
    Akgul, Yusuf Sinan
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1092 - 1095
  • [40] Human-In-The-Loop Task and Motion Planning for Imitation Learning
    Mandlekar, Ajay
    Garrett, Caelan
    Xu, Danfei
    Fox, Dieter
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229