Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning

被引：0

作者：

Diddigi, Raghuram Bharadwaj ^{[1
]}

Reddy, D. Sai Koti ^{[2
]}

Prabuchandran, K. J. ^{[3
]}

Bhatnagar, Shalabh ^{[1
]}

机构：

[1] Indian Inst Sci, Bangalore, Karnataka, India

[2] IBM Res, Bangalore, Karnataka, India

[3] Amazon IISc, Bangalore, Karnataka, India

来源：

AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS | 2019年

关键词：

Constrained Reinforcement Learning; Multi-agent Learning; Actor-Critic Algorithms; Cooperative Stochastic Game; FUNCTION APPROXIMATION;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Multi-agent reinforcement learning has gained lot of popularity primarily owing to the success of deep function approximation architectures. However, many real-life multi-agent applications often impose constraints on the joint action sequence that can be taken by the agents. In this work, we formulate such problems in the framework of constrained cooperative stochastic games. Under this setting, the goal of the agents is to obtain joint action sequence that minimizes a total cost objective criterion subject to total cost penalty/budget functional constraints. To this end, we utilize the Lagrangian formulation and propose actor-critic algorithms. Through experiments on a constrained multi-agent grid world task, we demonstrate that our algorithms converge to near-optimal joint action sequences satisfying the given constraints.

引用

页码：1931 / 1933

页数：3

共 50 条

[1] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
Prashant Trivedi
Nandyala Hemachandra
[J]. Dynamic Games and Applications, 2023, 13 : 25 - 55
[2] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
Trivedi, Prashant
Hemachandra, Nandyala
[J]. DYNAMIC GAMES AND APPLICATIONS, 2023, 13 (01) : 25 - 55
[3] Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning
Lin, Yixuan
Gade, Shripad
Sandhu, Romeil
Liu, Ji
[J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3953 - 3958
[4] Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Christianos, Filippos
Schafer, Lukas
Albrecht, Stefano V.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[5] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
Heredia, Paulo C.
Mou, Shaoshuai
[J]. IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
[6] A multi-agent reinforcement learning using Actor-Critic methods
Li, Chun-Gui
Wang, Meng
Yuan, Qing-Neng
[J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 878 - 882
[7] Actor-Critic for Multi-Agent Reinforcement Learning with Self-Attention
Zhao, Juan
Zhu, Tong
Xiao, Shuo
Gao, Zongqian
Sun, Hao
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (09)
[8] Multi-agent reinforcement learning by the actor-critic model with an attention interface
Zhang, Lixiang
Li, Jingchen
Zhu, Yi'an
Shi, Haobin
Hwang, Kao-Shing
[J]. NEUROCOMPUTING, 2022, 471 : 275 - 284
[9] Structural relational inference actor-critic for multi-agent reinforcement learning
Zhang, Xianjie
Liu, Yu
Xu, Xiujuan
Huang, Qiong
Mao, Hangyu
Carie, Anil
[J]. NEUROCOMPUTING, 2021, 459 : 383 - 394
[10] Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
Xiao, Yuchen
Lyu, Xueguang
Amato, Christopher
[J]. 2021 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS), 2021, : 155 - 163

← 1 2 3 4 5 →