DARL1N: Distributed multi-Agent Reinforcement Learning with One-hop Neighbors

被引：5

作者：

Wang, Baoqian ^{[1
,2
]}

Xie, Junfei ^{[3
]}

Atanasov, Nikolay ^{[1
]}

机构：

[1] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA

[2] San Diego State Univ, La Jolla, CA 92182 USA

[3] San Diego State Univ, Dept Elect & Comp Engn, San Diego, CA 92182 USA

来源：

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年

关键词：

D O I：

10.1109/IROS47612.2022.9981441

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent reinforcement learning (MARL) methods face a curse of dimensionality in the policy and value function representations as the number of agents increases. The development of distributed or parallel training techniques is also hindered by the global coupling among the agent dynamics, requiring simultaneous state transitions. This paper introduces Distributed multi-Agent Reinforcement Learning with One-hop Neighbors (DARL1N). DARL1N is an off-policy actor-critic MARL method that breaks the curse of dimensionality and achieves distributed training by restricting the agent interactions to one-hop neighborhoods. Each agent optimizes its value and policy functions over a one-hop neighborhood, reducing the representation complexity, yet maintaining expressiveness by training with varying numbers and states of neighbors. This structure enables the key contribution of DARL1N: a distributed training procedure in which each compute node simulates the state transitions of only a small subset of the agents, greatly accelerating the training of large-scale MARL policies. Comparisons with state-of-the-art MARL methods show that DARL1N significantly reduces training time without sacrificing policy quality as the number of agents increases.

引用

页码：9003 / 9010

页数：8

共 50 条

[31] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
Lan, Xuejing
Yan, Jiapei
He, Shude
Zhao, Zhijia
Zou, Tao
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585
[32] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
Heredia, Paulo C.
Mou, Shaoshuai
IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
[33] Multi-agent systems on sensor networks: A distributed reinforcement learning approach
Tham, CK
Renaud, JC
PROCEEDINGS OF THE 2005 INTELLIGENT SENSORS, SENSOR NETWORKS & INFORMATION PROCESSING CONFERENCE, 2005, : 423 - 429
[34] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
Menon, Bharat R.
Menon, Sangeetha B.
Srinivasan, Dipti
Jain, Lakhmi
2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
[35] Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
Yan, Yuzi
Li, Xiaoxiang
Qiu, Xinyou
Qiu, Jiantao
Wang, Jian
Wang, Yu
Shen, Yuan
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 1661 - 1667
[36] Distributed Multi-Agent Reinforcement Learning and Its application to Robot Soccer
Fan, Bo
Pu, Jiexin
2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 667 - 671
[37] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
Urmonov, Odilbek
Kim, HyungWon
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[38] Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning
Hu, Shucheng
Ren, Tao
Niu, Jianwei
Hu, Zheyuan
Xing, Guoliang
2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 575 - 583
[39] Transactive Multi-Agent Reinforcement Learning for Distributed Energy Price Localization
Spangher, Lucas
BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 244 - 245
[40] Multi-Hop UAV Relay Covert Communication: A Multi-Agent Reinforcement Learning Approach
Bai, Hengzhi
Wang, Haichao
Du, Jiatao
He, Rongrong
Li, Guoxin
Xu, Yuhua
2024 INTERNATIONAL CONFERENCE ON UBIQUITOUS COMMUNICATION, UCOM 2024, 2024, : 356 - 360

← 1 2 3 4 5 →