Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion

被引:1
|
作者
Akchurina, Natalia [1 ]
机构
[1] Univ Gesamthsch Paderborn, Int Grad Sch Dynam Intelligent Syst, D-4790 Paderborn, Germany
来源
ECAI 2008, PROCEEDINGS | 2008年 / 178卷
关键词
D O I
10.3233/978-1-58603-891-5-433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A reinforcement learning algorithm for multi-agent systems based on variable Hurwicz's optimistic-pessimistic criterion is proposed. The formal proof of its convergence is given. Hurwicz's criterion allows to embed initial knowledge of how friendly the environment in which the agent is supposed to function will be. Thorough testing of the developed algorithm against well-known reinforcement learning algorithms has shown that in many cases its successful performance can be explained by its tendency to force the other agents to follow the policy which is more profitable for it. In addition the variability of Hurwicz's criterion allowed it to converge to best-response against opponents with stationary policies.
引用
收藏
页码:433 / +
页数:2
相关论文
共 50 条
  • [31] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
    Lee, Hyun-Rok
    Lee, Taesik
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
  • [32] A Multi-Agent Reinforcement Learning Algorithm for Disambiguation in a Spoken Dialogue System
    Wang, Fangju
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 116 - 123
  • [33] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
    Chen, Haoqiang
    Liu, Yadong
    Zhou, Zongtan
    Hu, Dewen
    Zhang, Ming
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205
  • [34] Improving reinforcement learning algorithm using emotions in a multi-agent system
    Daneshvar, R
    Lucas, C
    INTELLIGENT VIRTUAL AGENTS, 2003, 2792 : 361 - 362
  • [35] A two-layered multi-agent reinforcement learning model and algorithm
    Wang, Ben-Nian
    Gao, Yang
    Chen, Zhao-Qian
    Xie, Jun-Yuan
    Chen, Shi-Fu
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (04) : 1366 - 1376
  • [36] Multi-agent reinforcement learning clustering algorithm based on silhouette coefficient
    Du, Peng
    Li, Fenglian
    Shao, Jianli
    NEUROCOMPUTING, 2024, 596
  • [37] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
    Haoqiang Chen
    Yadong Liu
    Zongtan Zhou
    Dewen Hu
    Ming Zhang
    Applied Intelligence, 2020, 50 : 4195 - 4205
  • [38] Study on learning algorithm of transfer reinforcement for multi-agent formation control
    Hu P.
    Pan Q.
    Guo Y.
    Zhao C.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (02): : 389 - 399
  • [39] Multi-agent Reinforcement Learning Based on K-Means Algorithm
    Liu Changan
    Liu Fei
    Liu Chunyang
    Wu Hua
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 414 - 418
  • [40] DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
    Bura, Archana
    HasanzadeZonuzy, Aria
    Kalathil, Dileep
    Shakkottai, Srinivas
    Chamberland, Jean-Francois
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,