Cooperative Multi-Robot Hierarchical Reinforcement Learning

被引:0
|
作者
Setyawan, Gembong Edhi [1 ]
Hartono, Pitoyo [2 ]
Sawada, Hideyuki [1 ]
机构
[1] Waseda Univ, Sch Adv Sci & Engn, Dept Appl Phys, 3-4-1 Okubo,Shinjuku ku, Tokyo 1698555, Japan
[2] Chukyo Univ, Sch Engn, 101-2 Yagoto Honmachi,Showa ku, Nagoya, Aichi 4668666, Japan
关键词
Multi-robot system; hierarchical deep reinforcement learning; path-finding; task decomposition;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent advances in multi-robot deep reinforcement learning have made it possible to perform efficient exploration in problem space, but it remains a significant challenge in many complex domains. To alleviate this problem, a hierarchical approach has been designed in which agents can operate at many levels to complete tasks more efficiently. This paper proposes a novel technique called Multi-Agent Hierarchical Deep Deterministic Policy Gradient that combines the benefits of multiple robot systems with the hierarchical system used in Deep Reinforcement Learning. Here, agents acquire the ability to decompose a problem into simpler subproblems with varying time scales. Furthermore, this study develops a framework to formulate tasks into multiple levels. The upper levels function to learn policies for defining lower levels' subgoals, whereas the lowest level depicts robot's learning policies for primitive actions in the real environment. The proposed method is implemented and validated in a modified Multiple Particle Environment (MPE) scenario.
引用
收藏
页码:35 / 44
页数:10
相关论文
共 50 条
  • [1] Cooperative Multi-Robot Task Allocation with Reinforcement Learning
    Park, Bumjin
    Kang, Cheongwoong
    Choi, Jaesik
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [2] A Reinforcement Learning Algorithm in Cooperative Multi-Robot Domains
    Fernando Fern??ndez
    Daniel Borrajo
    Lynne E. Parker
    [J]. Journal of Intelligent and Robotic Systems, 2005, 43 : 161 - 174
  • [3] Multi-robot cooperation based on hierarchical reinforcement learning
    Cheng, Xiaobei
    Shen, Jing
    Liu, Haibo
    Gu, Guochang
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 90 - +
  • [4] A reinforcement learning algorithm in cooperative multi-robot domains
    Fernández, F
    Borrajo, D
    Parker, LE
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2005, 43 (2-4) : 161 - 174
  • [5] Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems
    Dongbing Gu
    Erfu Yang
    [J]. Journal of Intelligent and Robotic Systems, 2007, 48 : 7 - 22
  • [6] Multi-robot cooperative behavior generation based on reinforcement learning
    Li, Dong-Mei
    Chen, Wei-Dong
    Xi, Yu-Geng
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2005, 39 (08): : 1331 - 1335
  • [7] Fuzzy policy reinforcement learning in cooperative multi-robot systems
    Gu, Dongbing
    Yang, Erfu
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2007, 48 (01) : 7 - 22
  • [8] Cooperative Multi-Robot Reinforcement Learning: A Framework in Hybrid State Space
    Sun, Xueqing
    Mao, Tao
    Kralik, Jerald D.
    Ray, Laura E.
    [J]. 2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 1190 - 1196
  • [9] Cooperative Multi-Robot Navigation in Dynamic Environment with Deep Reinforcement Learning
    Han, Ruihua
    Chen, Shengduo
    Hao, Qi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 448 - 454
  • [10] A Combined Hierarchical Reinforcement Learning Based Approach For Multi-robot Cooperative Target Searching in Complex Unknown Environments
    Cai, Yifan
    Yang, Simon X.
    Xu, Xin
    [J]. PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 52 - 59