TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Cassano, Lucas [1 ,2 ]
Alghunaim, Sulaiman A. [1 ,2 ]
Sayed, Ali H. [2 ]
机构
[1] Univ Calif Los Angeles, Dept Elect & Comp Engn, Los Angeles, CA 90024 USA
[2] Ecole Polytech Fed Lausanne, Sch Engn, Lausanne, Switzerland
关键词
Reinforcement learning; multi-agent learning; off-policy; optimal policy; distributed algorithm;
D O I
10.1109/icassp.2019.8683168
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work presents a fully distributed algorithm for learning the optimal policy in a multi-agent cooperative reinforcement learning scenario. We focus on games that can only be solved through coordinated team work. We consider situations in which K players interact simultaneously with an environment and with each other to attain a common goal. In the algorithm, agents only communicate with other agents in their immediate neighborhood and choose their actions independently of one another based only on local information. Learning is done off-policy, which results in high data efficiency. The proposed algorithm is of the stochastic primal-dual kind and can be shown to converge even when used in conjunction with a wide class of function approximators.
引用
收藏
页码:3062 / 3066
页数:5
相关论文
共 50 条
  • [1] Uncertainty modified policy for multi-agent reinforcement learning
    Zhao, Xinyu
    Liu, Jianxiang
    Wu, Faguo
    Zhang, Xiao
    Wang, Guojian
    [J]. APPLIED INTELLIGENCE, 2024, 54 (22) : 12020 - 12034
  • [2] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [3] Learning to Share in Multi-Agent Reinforcement Learning
    Yi, Yuxuan
    Li, Ge
    Wang, Yaowei
    Lu, Zongqing
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Learning Distributed Coordinated Policy in Catching Game with Multi-Agent Reinforcement Learning
    Liu, Xiangyu
    Tan, Ying
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [5] Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
    Mu, Ronghui
    Ruan, Wenjie
    Marcolino, Leandro Soriano
    Jin, Gaojie
    Ni, Qiang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15046 - 15054
  • [6] Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward
    Sheikh, Hassam Ullah
    Boloni, Ladislau
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Team-wise effective communication in multi-agent reinforcement learning
    Yang, Ming
    Zhao, Kaiyan
    Wang, Yiming
    Dong, Renzhi
    Du, Yali
    Liu, Furui
    Zhou, Mingliang
    Hou, U. Leong
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (02)
  • [8] Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning
    Emami, Patrick
    Zhang, Xiangyu
    Biagioni, David
    Zamzam, Ahmed S.
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2372 - 2378
  • [9] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    [J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [10] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    [J]. 2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +