A New Advantage Actor-Critic Algorithm For Multi-Agent Environments

被引:1
|
作者
Paczolay, Gabor [1 ]
Harmati, Istvan [1 ]
机构
[1] Budapest Univ Technol & Econ, Dept Control Engn, Budapest, Hungary
关键词
reinforcement learning; multiagent learning;
D O I
10.1109/ismcr51255.2020.9263738
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is one of the most researched fields of artificial intelligence right now. Newer and newer algorithms are being developed, especially for deep reinforcement learning, where the selected action is computed with the assist of a neural network. One of the subcategories of reinforcement learning is multi-agent reinforcement learning, where multiple agents are present in the world. In our paper, we modify an already existing algorithm, the Advantage Actor-Critic (A2C) to be suitable for multi-agent scenarios. Afterwards, we test the modified algorithm on our testbed, a cooperative-competitive pursuit-evasion environment.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Multi-agent Actor-Critic Reinforcement Learning Based In-network Load Balance
    Mai, Tianle
    Yao, Haipeng
    Xiong, Zehui
    Guo, Song
    Niyato, Dusit Tao
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [42] Workflow scheduling based on asynchronous advantage actor-critic algorithm in multi-cloud environment
    Tang, Xuhao
    Liu, Fagui
    Wang, Bin
    Xu, Dishi
    Jiang, Jun
    Wu, Qingbo
    Chen, C. L. Philip
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [43] A Hessian Actor-Critic Algorithm
    Wang, Jing
    Paschalidis, Ioannis Ch
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
  • [44] An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction
    Liu, Guiliang
    Li, Xu
    Sun, Miningming
    Li, Ping
    [J]. PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 217 - 225
  • [45] Research on load frequency control of multi-microgrids in an isolated system based on the multi-agent soft actor-critic algorithm
    Xie, Li Long
    Li, Yonghui
    Fan, Peixiao
    Wan, Li
    Zhang, Kanjun
    [J]. IET RENEWABLE POWER GENERATION, 2024, 18 (07) : 1230 - 1246
  • [46] Advantage Actor-Critic for Autonomous Intersection Management
    Ayeelyan, John
    Lee, Guan-Hung
    Hsu, Hsiu-Chun
    Hsiung, Pao-Ann
    [J]. VEHICLES, 2022, 4 (04): : 1391 - 1412
  • [47] An Actor-Critic Algorithm With Second-Order Actor and Critic
    Wang, Jing
    Paschalidis, Ioannis Ch.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
  • [48] Adaptive Advantage Estimation for Actor-Critic Algorithms
    Chen, Yurou
    Zhang, Fengyi
    Liu, Zhiyong
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [49] Supervised Advantage Actor-Critic for Recommender Systems
    Xin, Xin
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
  • [50] Factored Multi-Agent Soft Actor-Critic for Cooperative Multi-Target Tracking of UAV Swarms
    Yue, Longfei
    Yang, Rennong
    Zuo, Jialiang
    Yan, Mengda
    Zhao, Xiaoru
    Lv, Maolong
    [J]. DRONES, 2023, 7 (03)