Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion

被引：1

作者：

Akchurina, Natalia ^{[1
]}

机构：

[1] Univ Gesamthsch Paderborn, Int Grad Sch Dynam Intelligent Syst, D-4790 Paderborn, Germany

来源：

ECAI 2008, PROCEEDINGS | 2008年 / 178卷

关键词：

D O I：

10.3233/978-1-58603-891-5-433

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A reinforcement learning algorithm for multi-agent systems based on variable Hurwicz's optimistic-pessimistic criterion is proposed. The formal proof of its convergence is given. Hurwicz's criterion allows to embed initial knowledge of how friendly the environment in which the agent is supposed to function will be. Thorough testing of the developed algorithm against well-known reinforcement learning algorithms has shown that in many cases its successful performance can be explained by its tendency to force the other agents to follow the policy which is more profitable for it. In addition the variability of Hurwicz's criterion allowed it to converge to best-response against opponents with stationary policies.

引用

页码：433 / +

页数：2

共 50 条

[41] Extended Variable Speed Limit control using Multi-agent Reinforcement Learning
Kusic, Kresimir
Dusparic, Ivana
Gueriau, Maxime
Greguric, Martin
Ivanjko, Edouard
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[42] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
[43] An evolutionary multi-agent reinforcement learning algorithm for multi-UAV air combat
Wang, Baolai
Gao, Xianzhong
Xie, Tao
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[44] Multi-Agent Packet Routing (MAPR): Co-Operative Packet Routing Algorithm with Multi-Agent Reinforcement Learning
Modi, Aniket
Shah, Rishi
Jain, Krishnanshu
Verma, Rohit
Shorey, Rajeev
Saran, Huzur
2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
[45] Greedy Action Selection and Pessimistic Q-Value Updating in Multi-Agent Reinforcement Learning with Sparse Interaction
Kujirai T.
Yokota T.
SICE Journal of Control, Measurement, and System Integration, 2019, 12 (03) : 76 - 84
[46] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
Cassano, Lucas
Alghunaim, Sulaiman A.
Sayed, Ali H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
[47] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
Xu, Dongsheng
Qiao, Peng
Dou, Yong
2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
[48] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[49] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
[50] Concept Learning for Interpretable Multi-Agent Reinforcement Learning
Zabounidis, Renos
Campbell, Joseph
Stepputtis, Simon
Hughes, Dana
Sycara, Katia
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1828 - 1837

← 1 2 3 4 5 →