Distributed reinforcement learning in multi-agent networks

被引:0
|
作者
Kar, Soummya [1 ]
Moura, Jose M. F. [1 ]
Poor, H. Vincent [2 ]
机构
[1] Carnegie Mellon Univ, Dept ECE, Pittsburgh, PA 15213 USA
[2] Princeton Univ, Dept EE, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Multi-agent stochastic control; distributed Q-learning; reinforcement learning; collaborative network processing; consensus plus innovations; distributed stochastic approximation;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed reinforcement learning algorithms for collaborative multi-agent Markov decision processes (MDPs) are presented and analyzed. The networked setup consists of a collection of agents (learners) which respond differently (depending on their instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. With the objective of jointly learning the optimal stationary control policy (in the absence of global state transition and local agent cost statistics) that minimizes network-averaged infinite horizon discounted cost, the paper presents distributed variants of Q-learning of the consensus + innovations type in which each agent sequentially refines its learning parameters by locally processing its instantaneous payoff data and the information received from neighboring agents. Under broad conditions on the multi-agent decision model and mean connectivity of the inter-agent communication network, the proposed distributed algorithms are shown to achieve optimal learning asymptotically, i. e., almost surely (a. s.) each network agent is shown to learn the value function and the optimal stationary control policy of the collaborative MDP asymptotically. Further, convergence rate estimates for the proposed class of distributed learning algorithms are obtained.
引用
收藏
页码:296 / +
页数:2
相关论文
共 50 条
  • [41] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
    Menon, Bharat R.
    Menon, Sangeetha B.
    Srinivasan, Dipti
    Jain, Lakhmi
    2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
  • [42] Multi-agent reinforcement learning via distributed MPC as a function approximator
    Mallick S.
    Airaldi F.
    Dabiri A.
    De Schutter B.
    Automatica, 2024, 167
  • [43] Dynamic distributed constraint optimization using multi-agent reinforcement learning
    Maryam Shokoohi
    Mohsen Afsharchi
    Hamed Shah-Hoseini
    Soft Computing, 2022, 26 : 3601 - 3629
  • [44] Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game
    Qu Z.-W.
    Pan Z.-T.
    Chen Y.-H.
    Li H.-T.
    Wang X.
    Chen, Yong-Heng (cyh@jlu.edu.cn), 1600, Science Press (20): : 76 - 82and100
  • [45] Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
    Yan, Yuzi
    Li, Xiaoxiang
    Qiu, Xinyou
    Qiu, Jiantao
    Wang, Jian
    Wang, Yu
    Shen, Yuan
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 1661 - 1667
  • [46] Distributed Multi-Agent Reinforcement Learning and Its application to Robot Soccer
    Fan, Bo
    Pu, Jiexin
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 667 - 671
  • [47] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
    Urmonov, Odilbek
    Kim, HyungWon
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [48] Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning
    Hu, Shucheng
    Ren, Tao
    Niu, Jianwei
    Hu, Zheyuan
    Xing, Guoliang
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 575 - 583
  • [49] Transactive Multi-Agent Reinforcement Learning for Distributed Energy Price Localization
    Spangher, Lucas
    BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 244 - 245
  • [50] Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox
    Yin, Qiyue
    Yu, Tongtong
    Shen, Shengqi
    Yang, Jun
    Zhao, Meijing
    Ni, Wancheng
    Huang, Kaiqi
    Liang, Bin
    Wang, Liang
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (03) : 411 - 430