Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit

被引:27
|
作者
Yu, Chao [1 ]
Dong, Yinzhao [2 ]
Li, Yangning [2 ]
Chen, Yatong [2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
来源
JOURNAL OF ENGINEERING-JOE | 2020年 / 2020卷 / 13期
基金
中国国家自然科学基金;
关键词
multi-robot systems; learning systems; game theory; learning (artificial intelligence); multi-agent systems; control engineering computing; distributed control; environmental agents; pursuit-evasion problem; distributed multiagent deep reinforcement learning; distributed artificial intelligence; multirobot systems; multirobot pursuit game; deep RL methods; decentralised-execution scheme; multiagent deep RL approach; individual leaning update process; individual action output;
D O I
10.1049/joe.2019.1200
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
As a popular research topic in the area of distributed artificial intelligence, the multi-robot pursuit problem is widely used as a testbed for evaluating coordinated and cooperative strategies in multi-robot systems. This study the problem of multi-robot pursuit game using reinforcement learning (RL) techniques is studied. Unlike most existing studies that apply fully centralised deep RL methods based on the centralised-learning and decentralised-execution scheme, the authors propose a fully decentralised multi-agent deep RL approach by modelling each agent as an individual deep RL agent that has its own individual learning system (i.e. individual action-value function, individual leaning update process, and individual action output). To realise coordination among agents, the limited information of other environmental agents is used as input of the learning process. Experimental results show that both distributed and centralised approaches can ultimately solve the pursuit-evasion problem in different dimensions, but the learning efficiency and coordination performance of the proposed distributed approach are much better than the traditional centralised approach.
引用
下载
收藏
页码:499 / 504
页数:6
相关论文
共 50 条
  • [21] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
    Linh Vu
    Tuyen Vu
    Thanh Long Vu
    Srivastava, Anurag
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760
  • [23] Multi-agent deep reinforcement learning strategy for distributed energy
    Xi, Lei
    Sun, Mengmeng
    Zhou, Huan
    Xu, Yanchun
    Wu, Junnan
    Li, Yanying
    MEASUREMENT, 2021, 185
  • [24] Negotiation agent based on Deep reinforcement learning for multi-agent cooperative distributed predictive control.
    Aponte-Rengifo, O.
    Francisco, M.
    Vega, P.
    IFAC PAPERSONLINE, 2023, 56 (02): : 1496 - 1501
  • [25] Multi-robot Cooperative Pursuit via Potential Field-Enhanced Reinforcement Learning
    Zhang, Zheng
    Wang, Xiaohan
    Zhang, Qingrui
    Hu, Tianjiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8808 - 8814
  • [26] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [27] Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
    Jing, Gangshan
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    Sharma, Piyush K.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3273 - 3278
  • [28] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
    Zemzem, Wiem
    Tagina, Moncef
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
  • [29] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
    Lan, Xuejing
    Yan, Jiapei
    He, Shude
    Zhao, Zhijia
    Zou, Tao
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585
  • [30] Distributed Deep Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet-of-Vehicles
    Zhou, Huan
    Jiang, Kai
    He, Shibo
    Min, Geyong
    Wu, Jie
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9595 - 9609