Human Social Feedback for Efficient Interactive Reinforcement Agent Learning

被引:0
|
作者
Lin, Jinying [1 ]
Zhang, Qilei [1 ]
Gomez, Randy [2 ]
Nakamura, Keisuke [2 ]
He, Bo [1 ]
Li, Guangliang [1 ]
机构
[1] Ocean Univ China, Coll Informat Sci & Engn, Songling Rd 238, Qingdao 266100, Shandong, Peoples R China
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a branch of reinforcement learning, interactive reinforcement learning mainly studies the interaction process between humans and agents, allowing agents to learn from the intentions of human users and adapt to their preferences. In most of the current studies, human users need to intentionally provide explicit feedback via pressing keyboard buttons or mouse clicks. However, in our paper, we proposed an interactive reinforcement learning method that facilitates an agent to learn from human social signals facial feedback via a ordinary camera and gestural feedback via a leap motion sensor. Our method provides a natural way for ordinary people to train agents how to perform a task according to their preferences. We tested our method in two reinforcement learning benchmarking domains LoopMaze and Tetris, and compared to the state of the art the TAMER framework. Our experimental results show that when learning from facial feedback the recognition of which is very low, the TAMER agent can get a similar performance to that of learning from keypress feedback with slightly more feedback. When learning from gestural feedback with a more accurate recognition, the TAMER agent can obtain a similar performance to that of learning from keypress feedback with much less feedback received. Moreover, our results indicate that the recognition error of facial feedback has a large effect on the agent performance in the beginning training process than in the later training stage. Finally, our results indicate that with enough recognition accuracy, human social signals can effectively improve the learning efficiency of agents with less human feedback.
引用
收藏
页码:706 / 712
页数:7
相关论文
共 50 条
  • [21] Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
    Liu, Shukai
    Wu, Chenming
    Li, Ying
    Zhang, Liangjun
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7561 - 7567
  • [22] Efficient Interactive Multiclass Learning from Binary Feedback
    Ngo, Hung
    Luciw, Matthew
    Nagi, Jawas
    Forster, Alexander
    Schmidhuber, Jurgen
    Vien, Ngo Anh
    [J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)
  • [23] PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
    Lee, Kimin
    Smith, Laura
    Abbeel, Pieter
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [24] Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
    Dong, Shi
    Van Roy, Benjamin
    Zhou, Zhengyuan
    [J]. Journal of Machine Learning Research, 2022, 23
  • [25] Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
    Dong, Shi
    Van Roy, Benjamin
    Zhou, Zhengyuan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [26] Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks
    Daniel Harnack
    Julie Pivin-Bachler
    Nicolás Navarro-Guerrero
    [J]. Neural Computing and Applications, 2023, 35 : 16931 - 16943
  • [27] Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks
    Harnack, Daniel
    Pivin-Bachler, Julie
    Navarro-Guerrero, Nicolas
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23): : 16931 - 16943
  • [28] SCALEX: SCALability EXploration of Multi-Agent Reinforcement Learning Agents in Grid-Interactive Efficient Buildings
    Almilaify, Yara
    Nweye, Kingsley
    Nagy, Zoltan
    [J]. PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 261 - 264
  • [29] SOCIAL LEARNING THEORY AND HUMAN REINFORCEMENT
    Brauer, Jonathan R.
    Tittle, Charles R.
    [J]. SOCIOLOGICAL SPECTRUM, 2012, 32 (02) : 157 - 177
  • [30] Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors
    Cruz, Christian Arzate
    Igarashi, Takeo
    [J]. 2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 159 - 166