Human Social Feedback for Efficient Interactive Reinforcement Agent Learning

被引:0
|
作者
Lin, Jinying [1 ]
Zhang, Qilei [1 ]
Gomez, Randy [2 ]
Nakamura, Keisuke [2 ]
He, Bo [1 ]
Li, Guangliang [1 ]
机构
[1] Ocean Univ China, Coll Informat Sci & Engn, Songling Rd 238, Qingdao 266100, Shandong, Peoples R China
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a branch of reinforcement learning, interactive reinforcement learning mainly studies the interaction process between humans and agents, allowing agents to learn from the intentions of human users and adapt to their preferences. In most of the current studies, human users need to intentionally provide explicit feedback via pressing keyboard buttons or mouse clicks. However, in our paper, we proposed an interactive reinforcement learning method that facilitates an agent to learn from human social signals facial feedback via a ordinary camera and gestural feedback via a leap motion sensor. Our method provides a natural way for ordinary people to train agents how to perform a task according to their preferences. We tested our method in two reinforcement learning benchmarking domains LoopMaze and Tetris, and compared to the state of the art the TAMER framework. Our experimental results show that when learning from facial feedback the recognition of which is very low, the TAMER agent can get a similar performance to that of learning from keypress feedback with slightly more feedback. When learning from gestural feedback with a more accurate recognition, the TAMER agent can obtain a similar performance to that of learning from keypress feedback with much less feedback received. Moreover, our results indicate that the recognition error of facial feedback has a large effect on the agent performance in the beginning training process than in the later training stage. Finally, our results indicate that with enough recognition accuracy, human social signals can effectively improve the learning efficiency of agents with less human feedback.
引用
收藏
页码:706 / 712
页数:7
相关论文
共 50 条
  • [1] A Review on Interactive Reinforcement Learning From Human Social Feedback
    Lin, Jinying
    Ma, Zhen
    Gomez, Randy
    Nakamura, Keisuke
    He, Bo
    Li, Guangliang
    [J]. IEEE ACCESS, 2020, 8 : 120757 - 120765
  • [2] Human Feedback as Action Assignment in Interactive Reinforcement Learning
    Raza, Syed Ali
    Williams, Mary-Anne
    [J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2020, 14 (04)
  • [3] Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
    Li, Guangliang
    Gomez, Randy
    Nakamura, Keisuke
    He, Bo
    [J]. 2018 27TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2018), 2018, : 1156 - 1162
  • [4] Interactive Reinforcement Learning with Inaccurate Feedback
    Faulkner, Thylor A. Kessler
    Short, Elaine Schaertl
    Thomaz, Andrea L.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7498 - 7504
  • [5] Deep Reinforcement Learning with Interactive Feedback in a Human-Robot Environment
    Moreira, Ithan
    Rivas, Javier
    Cruz, Francisco
    Dazeley, Richard
    Ayala, Angel
    Fernandes, Bruno
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [6] Towards interactive reinforcement learning with intrinsic feedback
    Poole, Benjamin
    Lee, Minwoo
    [J]. NEUROCOMPUTING, 2024, 587
  • [7] Improving Reinforcement Learning with Interactive Feedback and Affordances
    Cruz, Francisco
    Magg, Sven
    Weber, Cornelius
    Wermter, Stefan
    [J]. FOUTH JOINT IEEE INTERNATIONAL CONFERENCES ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (IEEE ICDL-EPIROB 2014), 2014, : 165 - 170
  • [8] Interactive Lungs Auscultation with Reinforcement Learning Agent
    Grzywalski, Tomasz
    Belluzzo, Riccardo
    Drgas, Szymon
    Cwalinska, Agnieszka
    Hafke-Dys, Honorata
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 824 - 832
  • [9] Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning
    Pan, Xinlei
    Shen, Yilin
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1380 - 1387
  • [10] Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences
    Pinsler, Robert
    Akrour, Riad
    Osa, Takayuki
    Peters, Jan
    Neumann, Gerhard
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 596 - 601