Leveraging Expert Demonstrations in Robot Cooperation with Multi-Agent Reinforcement Learning

被引：1

作者：

Zhang, Zhaolong ^{[1
]}

Li, Yihui ^{[1
]}

Rojas, Juan ^{[2
]}

Guan, Yisheng ^{[1
]}

机构：

[1] Guangdong Univ Technol, Biomimet & Intelligent Robot Lab BIRL, Guangzhou 510006, Peoples R China

[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT II | 2021年 / 13014卷

关键词：

Reinforcement learning; Imitation learning; Robot manipulation; Robot learning;

D O I：

10.1007/978-3-030-89098-8_20

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While deep reinforcement learning (DRL) enhances the flexibility and intelligence of a single robot, it has proven challenging to solve the cooperatively of even basic tasks. And robotic manipulation is cumbersome and can easily yield getting trapped in local optima with reward shaping. As such sparse rewards are an attractive alternative. In this paper, we demonstrate how teams of robots are able to solve cooperative tasks. Additionally, we provide insights on how to facilitate exploration and faster learning in collaborative systems. First, we increased the amount of effective data samples in the replay buffer by leveraging virtual targets. Secondly, we introduce a small number of expert demonstrations to guide the robot during training via an additional loss that forces the policy network to learn the expert data faster. Finally, to improve the quality of behavior cloning, we propose a Judge mechanism that updates the strategy by selecting optimal action while training. Furthermore, our algorithms were tested in simulation using both dual arms and teams of two robots with single arms.

引用

页码：211 / 222

页数：12

共 50 条

[1] Expert demonstrations guide reward decomposition for multi-agent cooperation
Liu Weiwei
Jing Wei
Liu Shanqi
Ruan Yudi
Zhang Kexin
Yang Jiang
Liu Yong
[J]. Neural Computing and Applications, 2023, 35 : 19847 - 19863
[2] Expert demonstrations guide reward decomposition for multi-agent cooperation
Liu, Weiwei
Jing, Wei
Liu, Shanqi
Ruan, Yudi
Zhang, Kexin
Yang, Jiang
Liu, Yong
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19847 - 19863
[3] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning
Zhang, Han
Zhang, Xiaohui
Feng, Zhao
Xiao, Xiaohui
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 159 - 166
[4] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
Wang, Huimu
Qiu, Tenghai
Liu, Zhen
Pu, Zhiqiang
Yi, Jianqiang
Yuan, Wanmai
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[5] Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning
Yu, Xin
Shi, Rongye
Feng, Pu
Tian, Yongkai
Li, Simin
Liao, Shuhao
Wu, Wenjun
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17583 - 17590
[6] A cooperation model using reinforcement learning for multi-agent
Lee, M
Lee, J
Jeong, HJ
Lee, Y
Choi, S
Gatton, TM
[J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 5, 2006, 3984 : 675 - 681
[7] Hybrid Learning for Multi-agent Cooperation with Sub-optimal Demonstrations
Peng, Peixi
Xing, Junliang
Cao, Lili
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3037 - 3043
[8] A multi-agent reinforcement learning approach to robot soccer
Yong Duan
Bao Xia Cui
Xin He Xu
[J]. Artificial Intelligence Review, 2012, 38 : 193 - 211
[9] A multi-agent reinforcement learning approach to robot soccer
Duan, Yong
Cui, Bao Xia
Xu, Xin He
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
[10] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation
Park, Soohyun
Kim, Jae Pyoung
Park, Chanyoung
Jung, Soyi
Kim, Joongheon
[J]. IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) : 106 - 112

← 1 2 3 4 5 →