Leveraging Expert Demonstrations in Robot Cooperation with Multi-Agent Reinforcement Learning

被引:1
|
作者
Zhang, Zhaolong [1 ]
Li, Yihui [1 ]
Rojas, Juan [2 ]
Guan, Yisheng [1 ]
机构
[1] Guangdong Univ Technol, Biomimet & Intelligent Robot Lab BIRL, Guangzhou 510006, Peoples R China
[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Peoples R China
关键词
Reinforcement learning; Imitation learning; Robot manipulation; Robot learning;
D O I
10.1007/978-3-030-89098-8_20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While deep reinforcement learning (DRL) enhances the flexibility and intelligence of a single robot, it has proven challenging to solve the cooperatively of even basic tasks. And robotic manipulation is cumbersome and can easily yield getting trapped in local optima with reward shaping. As such sparse rewards are an attractive alternative. In this paper, we demonstrate how teams of robots are able to solve cooperative tasks. Additionally, we provide insights on how to facilitate exploration and faster learning in collaborative systems. First, we increased the amount of effective data samples in the replay buffer by leveraging virtual targets. Secondly, we introduce a small number of expert demonstrations to guide the robot during training via an additional loss that forces the policy network to learn the expert data faster. Finally, to improve the quality of behavior cloning, we propose a Judge mechanism that updates the strategy by selecting optimal action while training. Furthermore, our algorithms were tested in simulation using both dual arms and teams of two robots with single arms.
引用
收藏
页码:211 / 222
页数:12
相关论文
共 50 条
  • [1] Expert demonstrations guide reward decomposition for multi-agent cooperation
    Liu Weiwei
    Jing Wei
    Liu Shanqi
    Ruan Yudi
    Zhang Kexin
    Yang Jiang
    Liu Yong
    [J]. Neural Computing and Applications, 2023, 35 : 19847 - 19863
  • [2] Expert demonstrations guide reward decomposition for multi-agent cooperation
    Liu, Weiwei
    Jing, Wei
    Liu, Shanqi
    Ruan, Yudi
    Zhang, Kexin
    Yang, Jiang
    Liu, Yong
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19847 - 19863
  • [3] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning
    Zhang, Han
    Zhang, Xiaohui
    Feng, Zhao
    Xiao, Xiaohui
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 159 - 166
  • [4] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning
    Yu, Xin
    Shi, Rongye
    Feng, Pu
    Tian, Yongkai
    Li, Simin
    Liao, Shuhao
    Wu, Wenjun
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17583 - 17590
  • [6] A cooperation model using reinforcement learning for multi-agent
    Lee, M
    Lee, J
    Jeong, HJ
    Lee, Y
    Choi, S
    Gatton, TM
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 5, 2006, 3984 : 675 - 681
  • [7] Hybrid Learning for Multi-agent Cooperation with Sub-optimal Demonstrations
    Peng, Peixi
    Xing, Junliang
    Cao, Lili
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3037 - 3043
  • [8] A multi-agent reinforcement learning approach to robot soccer
    Yong Duan
    Bao Xia Cui
    Xin He Xu
    [J]. Artificial Intelligence Review, 2012, 38 : 193 - 211
  • [9] A multi-agent reinforcement learning approach to robot soccer
    Duan, Yong
    Cui, Bao Xia
    Xu, Xin He
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
  • [10] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation
    Park, Soohyun
    Kim, Jae Pyoung
    Park, Chanyoung
    Jung, Soyi
    Kim, Joongheon
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) : 106 - 112