Human Social Feedback for Efficient Interactive Reinforcement Agent Learning

被引：0

作者：

Lin, Jinying ^{[1
]}

Zhang, Qilei ^{[1
]}

Gomez, Randy ^{[2
]}

Nakamura, Keisuke ^{[2
]}

He, Bo ^{[1
]}

Li, Guangliang ^{[1
]}

机构：

[1] Ocean Univ China, Coll Informat Sci & Engn, Songling Rd 238, Qingdao 266100, Shandong, Peoples R China

[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan

来源：

2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN) | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a branch of reinforcement learning, interactive reinforcement learning mainly studies the interaction process between humans and agents, allowing agents to learn from the intentions of human users and adapt to their preferences. In most of the current studies, human users need to intentionally provide explicit feedback via pressing keyboard buttons or mouse clicks. However, in our paper, we proposed an interactive reinforcement learning method that facilitates an agent to learn from human social signals facial feedback via a ordinary camera and gestural feedback via a leap motion sensor. Our method provides a natural way for ordinary people to train agents how to perform a task according to their preferences. We tested our method in two reinforcement learning benchmarking domains LoopMaze and Tetris, and compared to the state of the art the TAMER framework. Our experimental results show that when learning from facial feedback the recognition of which is very low, the TAMER agent can get a similar performance to that of learning from keypress feedback with slightly more feedback. When learning from gestural feedback with a more accurate recognition, the TAMER agent can obtain a similar performance to that of learning from keypress feedback with much less feedback received. Moreover, our results indicate that the recognition error of facial feedback has a large effect on the agent performance in the beginning training process than in the later training stage. Finally, our results indicate that with enough recognition accuracy, human social signals can effectively improve the learning efficiency of agents with less human feedback.

引用

页码：706 / 712

页数：7

共 50 条

[1] A Review on Interactive Reinforcement Learning From Human Social Feedback
Lin, Jinying
Ma, Zhen
Gomez, Randy
Nakamura, Keisuke
He, Bo
Li, Guangliang
[J]. IEEE ACCESS, 2020, 8 : 120757 - 120765
[2] Human Feedback as Action Assignment in Interactive Reinforcement Learning
Raza, Syed Ali
Williams, Mary-Anne
[J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2020, 14 (04)
[3] Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
Li, Guangliang
Gomez, Randy
Nakamura, Keisuke
He, Bo
[J]. 2018 27TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2018), 2018, : 1156 - 1162
[4] Interactive Reinforcement Learning with Inaccurate Feedback
Faulkner, Thylor A. Kessler
Short, Elaine Schaertl
Thomaz, Andrea L.
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7498 - 7504
[5] Deep Reinforcement Learning with Interactive Feedback in a Human-Robot Environment
Moreira, Ithan
Rivas, Javier
Cruz, Francisco
Dazeley, Richard
Ayala, Angel
Fernandes, Bruno
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (16):
[6] Towards interactive reinforcement learning with intrinsic feedback
Poole, Benjamin
Lee, Minwoo
[J]. NEUROCOMPUTING, 2024, 587
[7] Improving Reinforcement Learning with Interactive Feedback and Affordances
Cruz, Francisco
Magg, Sven
Weber, Cornelius
Wermter, Stefan
[J]. FOUTH JOINT IEEE INTERNATIONAL CONFERENCES ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (IEEE ICDL-EPIROB 2014), 2014, : 165 - 170
[8] Interactive Lungs Auscultation with Reinforcement Learning Agent
Grzywalski, Tomasz
Belluzzo, Riccardo
Drgas, Szymon
Cwalinska, Agnieszka
Hafke-Dys, Honorata
[J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 824 - 832
[9] Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning
Pan, Xinlei
Shen, Yilin
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1380 - 1387
[10] Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences
Pinsler, Robert
Akrour, Riad
Osa, Takayuki
Peters, Jan
Neumann, Gerhard
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 596 - 601

← 1 2 3 4 5 →