DARL1N: Distributed multi-Agent Reinforcement Learning with One-hop Neighbors

被引:5
|
作者
Wang, Baoqian [1 ,2 ]
Xie, Junfei [3 ]
Atanasov, Nikolay [1 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[2] San Diego State Univ, La Jolla, CA 92182 USA
[3] San Diego State Univ, Dept Elect & Comp Engn, San Diego, CA 92182 USA
关键词
D O I
10.1109/IROS47612.2022.9981441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-agent reinforcement learning (MARL) methods face a curse of dimensionality in the policy and value function representations as the number of agents increases. The development of distributed or parallel training techniques is also hindered by the global coupling among the agent dynamics, requiring simultaneous state transitions. This paper introduces Distributed multi-Agent Reinforcement Learning with One-hop Neighbors (DARL1N). DARL1N is an off-policy actor-critic MARL method that breaks the curse of dimensionality and achieves distributed training by restricting the agent interactions to one-hop neighborhoods. Each agent optimizes its value and policy functions over a one-hop neighborhood, reducing the representation complexity, yet maintaining expressiveness by training with varying numbers and states of neighbors. This structure enables the key contribution of DARL1N: a distributed training procedure in which each compute node simulates the state transitions of only a small subset of the agents, greatly accelerating the training of large-scale MARL policies. Comparisons with state-of-the-art MARL methods show that DARL1N significantly reduces training time without sacrificing policy quality as the number of agents increases.
引用
收藏
页码:9003 / 9010
页数:8
相关论文
共 50 条
  • [31] Distributed cooperative reinforcement learning for multi-agent system with collision avoidance
    Lan, Xuejing
    Yan, Jiapei
    He, Shude
    Zhao, Zhijia
    Zou, Tao
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 567 - 585
  • [32] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
    Heredia, Paulo C.
    Mou, Shaoshuai
    IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
  • [33] Multi-agent systems on sensor networks: A distributed reinforcement learning approach
    Tham, CK
    Renaud, JC
    PROCEEDINGS OF THE 2005 INTELLIGENT SENSORS, SENSOR NETWORKS & INFORMATION PROCESSING CONFERENCE, 2005, : 423 - 429
  • [34] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
    Menon, Bharat R.
    Menon, Sangeetha B.
    Srinivasan, Dipti
    Jain, Lakhmi
    2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
  • [35] Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
    Yan, Yuzi
    Li, Xiaoxiang
    Qiu, Xinyou
    Qiu, Jiantao
    Wang, Jian
    Wang, Yu
    Shen, Yuan
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 1661 - 1667
  • [36] Distributed Multi-Agent Reinforcement Learning and Its application to Robot Soccer
    Fan, Bo
    Pu, Jiexin
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 667 - 671
  • [37] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
    Urmonov, Odilbek
    Kim, HyungWon
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [38] Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning
    Hu, Shucheng
    Ren, Tao
    Niu, Jianwei
    Hu, Zheyuan
    Xing, Guoliang
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 575 - 583
  • [39] Transactive Multi-Agent Reinforcement Learning for Distributed Energy Price Localization
    Spangher, Lucas
    BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 244 - 245
  • [40] Multi-Hop UAV Relay Covert Communication: A Multi-Agent Reinforcement Learning Approach
    Bai, Hengzhi
    Wang, Haichao
    Du, Jiatao
    He, Rongrong
    Li, Guoxin
    Xu, Yuhua
    2024 INTERNATIONAL CONFERENCE ON UBIQUITOUS COMMUNICATION, UCOM 2024, 2024, : 356 - 360