Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit

被引：27

作者：

Yu, Chao ^{[1
]}

Dong, Yinzhao ^{[2
]}

Li, Yangning ^{[2
]}

Chen, Yatong ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China

[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China

来源：

JOURNAL OF ENGINEERING-JOE | 2020年 / 2020卷 / 13期

基金：

中国国家自然科学基金;

关键词：

multi-robot systems; learning systems; game theory; learning (artificial intelligence); multi-agent systems; control engineering computing; distributed control; environmental agents; pursuit-evasion problem; distributed multiagent deep reinforcement learning; distributed artificial intelligence; multirobot systems; multirobot pursuit game; deep RL methods; decentralised-execution scheme; multiagent deep RL approach; individual leaning update process; individual action output;

D O I：

10.1049/joe.2019.1200

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

As a popular research topic in the area of distributed artificial intelligence, the multi-robot pursuit problem is widely used as a testbed for evaluating coordinated and cooperative strategies in multi-robot systems. This study the problem of multi-robot pursuit game using reinforcement learning (RL) techniques is studied. Unlike most existing studies that apply fully centralised deep RL methods based on the centralised-learning and decentralised-execution scheme, the authors propose a fully decentralised multi-agent deep RL approach by modelling each agent as an individual deep RL agent that has its own individual learning system (i.e. individual action-value function, individual leaning update process, and individual action output). To realise coordination among agents, the limited information of other environmental agents is used as input of the learning process. Experimental results show that both distributed and centralised approaches can ultimately solve the pursuit-evasion problem in different dimensions, but the learning efficiency and coordination performance of the proposed distributed approach are much better than the traditional centralised approach.

引用

下载

页码：499 / 504

页数：6

共 50 条

[21] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
Chen, Hao
Yang, Guangkai
Zhang, Junge
Yin, Qiyue
Huang, Kaiqi
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[22] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
Linh Vu
Tuyen Vu
Thanh Long Vu
Srivastava, Anurag
IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760
[23] Multi-agent deep reinforcement learning strategy for distributed energy
Xi, Lei
Sun, Mengmeng
Zhou, Huan
Xu, Yanchun
Wu, Junnan
Li, Yanying
MEASUREMENT, 2021, 185
[24] Negotiation agent based on Deep reinforcement learning for multi-agent cooperative distributed predictive control.
Aponte-Rengifo, O.
Francisco, M.
Vega, P.
IFAC PAPERSONLINE, 2023, 56 (02): : 1496 - 1501
[25] Multi-robot Cooperative Pursuit via Potential Field-Enhanced Reinforcement Learning
Zhang, Zheng
Wang, Xiaohan
Zhang, Qingrui
Hu, Tianjiang
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8808 - 8814
[26] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
[27] Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
Jing, Gangshan
Bai, He
George, Jemin
Chakrabortty, Aranya
Sharma, Piyush K.
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3273 - 3278
[28] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
Zemzem, Wiem
Tagina, Moncef
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
[29] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
Lan, Xuejing
Yan, Jiapei
He, Shude
Zhao, Zhijia
Zou, Tao
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585
[30] Distributed Deep Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet-of-Vehicles
Zhou, Huan
Jiang, Kai
He, Shibo
Min, Geyong
Wu, Jie
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9595 - 9609

← 1 2 3 4 5 →