Periodic Communication for Distributed Multi-agent Reinforcement Learning under Partially Observable Environment

被引:1
|
作者
Kim, Seonghyun [1 ]
Lee, Donghun [1 ]
Jang, Ingook [1 ]
Kim, Hyunseok [1 ]
Son, Youngsung [1 ]
机构
[1] Elect & Telecommun Res Inst, Smart ICT Convergence Res Dept, Daejeon, South Korea
关键词
multi-agent reinforcement learning; partially observable Markov decision process; periodic communication;
D O I
10.1109/ictc46691.2019.8939754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In area of the reinforcement learning, an environment is important because when a well-known reinforcement learning technique for an environment is applied to another environment, it does not guarantee whether the technique also works well or not. To apply reinforcement learning techniques to real world environments, a partial observation condition is one of important issues. In addition, a communication is also important for a multi-agent setting under the partial observation condition. In this paper, a periodic communication for distributed multi-agent reinforcement learning is studied. Via the periodic communication, each agent is able to share an auxiliary observation which is a compressed version of an observation. After sharing auxiliary observations, each agent makes a decision based on own observation and other's shared observation. However, due to the periodicity of a communication, an absence of sharing auxiliary observation occurs for a non-communication phase. Thus, it is necessary to consider how to compensate the absence of shared auxiliary observations within non-communication phase. To this end, several methods are proposed to predict auxiliary observations of other agents. In simulation results, it is shown that an affection of the partial observation condition and performances according to the methods of compensation of auxiliary observations.
引用
收藏
页码:940 / 942
页数:3
相关论文
共 50 条
  • [1] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [2] A reinforcement learning scheme for a partially-observable multi-agent game
    Ishii, S
    Fujita, H
    Mitsutake, M
    Yamazaki, T
    Matsuda, J
    Matsuno, Y
    [J]. MACHINE LEARNING, 2005, 59 (1-2) : 31 - 54
  • [3] Reinforcement learning for cooperative actions in a partially observable multi-agent system
    Taniguchi, Yuki
    Mori, Takeshi
    Ishii, Shin
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 1, PROCEEDINGS, 2007, 4668 : 229 - +
  • [4] A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game
    Shin Ishii
    Hajime Fujita
    Masaoki Mitsutake
    Tatsuya Yamazaki
    Jun Matsuda
    Yoichiro Matsuno
    [J]. Machine Learning, 2005, 59 : 31 - 54
  • [5] Partially Observable Multi-Agent Deep Reinforcement Learning for Cognitive Resource Management
    Yang, Ning
    Zhang, Haijun
    Berry, Randall
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [6] Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
    Mao, Weichao
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6124 - 6131
  • [7] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
    Lee, Hyun-Rok
    Lee, Taesik
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
  • [8] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 17 - 23
  • [9] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011
  • [10] Parallel and distributed multi-agent reinforcement learning
    Kaya, M
    Arslan, A
    [J]. PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, : 437 - 441