Distributed reinforcement learning in multi-agent networks

被引:0
|
作者
Kar, Soummya [1 ]
Moura, Jose M. F. [1 ]
Poor, H. Vincent [2 ]
机构
[1] Carnegie Mellon Univ, Dept ECE, Pittsburgh, PA 15213 USA
[2] Princeton Univ, Dept EE, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Multi-agent stochastic control; distributed Q-learning; reinforcement learning; collaborative network processing; consensus plus innovations; distributed stochastic approximation;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed reinforcement learning algorithms for collaborative multi-agent Markov decision processes (MDPs) are presented and analyzed. The networked setup consists of a collection of agents (learners) which respond differently (depending on their instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. With the objective of jointly learning the optimal stationary control policy (in the absence of global state transition and local agent cost statistics) that minimizes network-averaged infinite horizon discounted cost, the paper presents distributed variants of Q-learning of the consensus + innovations type in which each agent sequentially refines its learning parameters by locally processing its instantaneous payoff data and the information received from neighboring agents. Under broad conditions on the multi-agent decision model and mean connectivity of the inter-agent communication network, the proposed distributed algorithms are shown to achieve optimal learning asymptotically, i. e., almost surely (a. s.) each network agent is shown to learn the value function and the optimal stationary control policy of the collaborative MDP asymptotically. Further, convergence rate estimates for the proposed class of distributed learning algorithms are obtained.
引用
收藏
页码:296 / +
页数:2
相关论文
共 50 条
  • [21] Multi-Agent Deep Reinforcement Learning for Distributed Resource Management in Wirelessly Powered Communication Networks
    Hwang, Sangwon
    Kim, Hanjin
    Lee, Hoon
    Lee, Inkyu
    IEEE Transactions on Vehicular Technology, 2020, 69 (11): : 14055 - 14060
  • [22] Multi-Agent Deep Reinforcement Learning for Distributed Resource Management in Wirelessly Powered Communication Networks
    Hwang, Sangwon
    Kim, Hanjin
    Lee, Hoon
    Lee, Inkyu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 14055 - 14060
  • [23] Distributed Traffic Engineering in Hybrid Software Defined Networks: A Multi-Agent Reinforcement Learning Framework
    Guo, Yingya
    Lin, Bin
    Tang, Qi
    Ma, Yulong
    Luo, Huan
    Tian, Han
    Chen, Kai
    IEEE Transactions on Network and Service Management, 2024, 21 (06): : 6759 - 6769
  • [24] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269
  • [25] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269
  • [26] Multi-Agent Reinforcement Learning for Spectrum Sharing in Vehicular Networks
    Liang, Le
    Ye, Hao
    Li, Geoffrey Ye
    2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
  • [27] Multi-agent reinforcement learning algorithm based on neural networks
    Tang, Lianggui
    Yang, Hu
    An, Bo
    Cheng, Daijie
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1569 - 1574
  • [28] A Survey on Multi-Agent Reinforcement Learning Methods for Vehicular Networks
    Althamary, Ibrahim
    Huang, Chih-Wei
    Lin, Phone
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1154 - 1159
  • [29] Efficient Communications for Multi-Agent Reinforcement Learning in Wireless Networks
    Lv, Zefang
    Du, Yousong
    Chen, Yifan
    Xiao, Liang
    Han, Shuai
    Ji, Xiangyang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 583 - 588
  • [30] Learning Distributed Coordinated Policy in Catching Game with Multi-Agent Reinforcement Learning
    Liu, Xiangyu
    Tan, Ying
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,