A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game

被引:1
|
作者
Shin Ishii
Hajime Fujita
Masaoki Mitsutake
Tatsuya Yamazaki
Jun Matsuda
Yoichiro Matsuno
机构
[1] CREST,Nara Institute of Science and Technology
[2] Japan Science and Technology Agency,undefined
[3] Nara Institute of Science and Technology,undefined
[4] National Institute of Information and Communications Technology,undefined
[5] Osaka Gakuin University,undefined
[6] Ricoh Co. Ltd.,undefined
来源
Machine Learning | 2005年 / 59卷
关键词
reinforcement learning; POMDP; multi-agent system; card game; model-based;
D O I
暂无
中图分类号
学科分类号
摘要
We formulate an automatic strategy acquisition problem for the multi-agent card game “Hearts” as a reinforcement learning problem. The problem can approximately be dealt with in the framework of a partially observable Markov decision process (POMDP) for a single-agent system. Hearts is an example of imperfect information games, which are more difficult to deal with than perfect information games. A POMDP is a decision problem that includes a process for estimating unobservable state variables. By regarding missing information as unobservable state variables, an imperfect information game can be formulated as a POMDP. However, the game of Hearts is a realistic problem that has a huge number of possible states, even when it is approximated as a single-agent system. Therefore, further approximation is necessary to make the strategy acquisition problem tractable. This article presents an approximation method based on estimating unobservable state variables and predicting the actions of the other agents. Simulation results show that our reinforcement learning method is applicable to such a difficult multi-agent problem.
引用
收藏
页码:31 / 54
页数:23
相关论文
共 50 条
  • [1] A reinforcement learning scheme for a partially-observable multi-agent game
    Ishii, S
    Fujita, H
    Mitsutake, M
    Yamazaki, T
    Matsuda, J
    Matsuno, Y
    [J]. MACHINE LEARNING, 2005, 59 (1-2) : 31 - 54
  • [2] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
    Lee, Hyun-Rok
    Lee, Taesik
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
  • [3] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 17 - 23
  • [4] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011
  • [5] Reinforcement learning for cooperative actions in a partially observable multi-agent system
    Taniguchi, Yuki
    Mori, Takeshi
    Ishii, Shin
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 1, PROCEEDINGS, 2007, 4668 : 229 - +
  • [6] A reinforcement learning scheme for a multi-agent card game
    Fujita, H
    Matsuno, Y
    Ishii, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
  • [7] Partially Observable Multi-Agent Deep Reinforcement Learning for Cognitive Resource Management
    Yang, Ning
    Zhang, Haijun
    Berry, Randall
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [8] Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
    Mao, Weichao
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6124 - 6131
  • [9] Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning
    Doshi-Velez, Finale
    Pfau, David
    Wood, Frank
    Roy, Nicholas
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) : 394 - 407
  • [10] Periodic Communication for Distributed Multi-agent Reinforcement Learning under Partially Observable Environment
    Kim, Seonghyun
    Lee, Donghun
    Jang, Ingook
    Kim, Hyunseok
    Son, Youngsung
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 940 - 942