Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion

被引：1

作者：

Akchurina, Natalia ^{[1
]}

机构：

[1] Univ Gesamthsch Paderborn, Int Grad Sch Dynam Intelligent Syst, D-4790 Paderborn, Germany

来源：

ECAI 2008, PROCEEDINGS | 2008年 / 178卷

关键词：

D O I：

10.3233/978-1-58603-891-5-433

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A reinforcement learning algorithm for multi-agent systems based on variable Hurwicz's optimistic-pessimistic criterion is proposed. The formal proof of its convergence is given. Hurwicz's criterion allows to embed initial knowledge of how friendly the environment in which the agent is supposed to function will be. Thorough testing of the developed algorithm against well-known reinforcement learning algorithms has shown that in many cases its successful performance can be explained by its tendency to force the other agents to follow the policy which is more profitable for it. In addition the variability of Hurwicz's criterion allowed it to converge to best-response against opponents with stationary policies.

引用

页码：433 / +

页数：2

共 50 条

[31] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
Lee, Hyun-Rok
Lee, Taesik
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
[32] A Multi-Agent Reinforcement Learning Algorithm for Disambiguation in a Spoken Dialogue System
Wang, Fangju
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 116 - 123
[33] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
Chen, Haoqiang
Liu, Yadong
Zhou, Zongtan
Hu, Dewen
Zhang, Ming
APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205
[34] Improving reinforcement learning algorithm using emotions in a multi-agent system
Daneshvar, R
Lucas, C
INTELLIGENT VIRTUAL AGENTS, 2003, 2792 : 361 - 362
[35] A two-layered multi-agent reinforcement learning model and algorithm
Wang, Ben-Nian
Gao, Yang
Chen, Zhao-Qian
Xie, Jun-Yuan
Chen, Shi-Fu
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (04) : 1366 - 1376
[36] Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient
Du, Peng
Li, Fenglian
Shao, Jianli
NEUROCOMPUTING, 2024, 596
[37] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
Haoqiang Chen
Yadong Liu
Zongtan Zhou
Dewen Hu
Ming Zhang
Applied Intelligence, 2020, 50 : 4195 - 4205
[38] Study on learning algorithm of transfer reinforcement for multi-agent formation control
Hu P.
Pan Q.
Guo Y.
Zhao C.
Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (02): : 389 - 399
[39] Multi-agent Reinforcement Learning Based on K-Means Algorithm
Liu Changan
Liu Fei
Liu Chunyang
Wu Hua
CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 414 - 418
[40] DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Bura, Archana
HasanzadeZonuzy, Aria
Kalathil, Dileep
Shakkottai, Srinivas
Chamberland, Jean-Francois
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →