Networked Multi-Agent Reinforcement Learning in Continuous Spaces

被引:0
|
作者
Zhang, Kaiqing [1 ]
Yang, Zhuoran [2 ]
Basar, Tamer [1 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Champaign, IL 61820 USA
[2] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many real-world tasks on practical control systems involve the learning and decision-making of multiple agents, under limited communications and observations. In this paper, we study the problem of networked multi-agent reinforcement learning (MARL), where multiple agents perform reinforcement learning in a common environment, and are able to exchange information via a possibly time-varying communication network. In particular, we focus on a collaborative MARL setting where each agent has individual reward functions, and the objective of all the agents is to maximize the network-wide averaged long-term return. To this end, we propose a fully decentralized actor-critic algorithm that only relies on neighbor-to-neighbor communications among agents. To promote the use of the algorithm on practical control systems, we focus on the setting with continuous state and action spaces, and adopt the newly proposed expected policy gradient to reduce the variance of the gradient estimate. We provide convergence guarantees for the algorithm when linear function approximation is employed, and corroborate our theoretical results via simulations.
引用
收藏
页码:2771 / 2776
页数:6
相关论文
共 50 条
  • [1] Multi-Agent Reinforcement Learning in Stochastic Networked Systems
    Lin, Yiheng
    Qu, Guannan
    Huang, Longbo
    Wierman, Adam
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
    Zhang, Kaiqing
    Yang, Zhuoran
    Liu, Han
    Zhang, Tong
    Basar, Tamer
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces
    Fu, Haotian
    Tang, Hongyao
    Hao, Jianye
    Lei, Zihan
    Chen, Yingfeng
    Fan, Changjie
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2329 - 2335
  • [4] Decentralized multi-agent reinforcement learning with networked agents: recent advances
    Zhang, Kaiqing
    Yang, Zhuoran
    Basar, Tamer
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814
  • [5] Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward
    Qu, Guannan
    Lin, Yiheng
    Wierman, Adam
    Li, Na
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
    Qu, Chao
    Mannor, Shie
    Xu, Huan
    Qi, Yuan
    Song, Le
    Xiong, Junwu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    [J]. LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 256 - 266
  • [8] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [9] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
    Corcoran, Diarmuid
    Kreuger, Per
    Boman, Magnus
    [J]. 2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
  • [10] A Universal Transcoding and Transmission Method for Livecast with Networked Multi-Agent Reinforcement Learning
    Chen, Xingyan
    Xu, Changqiao
    Wang, Mu
    Wu, Zhonghui
    Yang, Shujie
    Zhong, Lujie
    Muntean, Gabriel-Miro
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,