A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game

被引:1
|
作者
Shin Ishii
Hajime Fujita
Masaoki Mitsutake
Tatsuya Yamazaki
Jun Matsuda
Yoichiro Matsuno
机构
[1] CREST,Nara Institute of Science and Technology
[2] Japan Science and Technology Agency,undefined
[3] Nara Institute of Science and Technology,undefined
[4] National Institute of Information and Communications Technology,undefined
[5] Osaka Gakuin University,undefined
[6] Ricoh Co. Ltd.,undefined
来源
Machine Learning | 2005年 / 59卷
关键词
reinforcement learning; POMDP; multi-agent system; card game; model-based;
D O I
暂无
中图分类号
学科分类号
摘要
We formulate an automatic strategy acquisition problem for the multi-agent card game “Hearts” as a reinforcement learning problem. The problem can approximately be dealt with in the framework of a partially observable Markov decision process (POMDP) for a single-agent system. Hearts is an example of imperfect information games, which are more difficult to deal with than perfect information games. A POMDP is a decision problem that includes a process for estimating unobservable state variables. By regarding missing information as unobservable state variables, an imperfect information game can be formulated as a POMDP. However, the game of Hearts is a realistic problem that has a huge number of possible states, even when it is approximated as a single-agent system. Therefore, further approximation is necessary to make the strategy acquisition problem tractable. This article presents an approximation method based on estimating unobservable state variables and predicting the actions of the other agents. Simulation results show that our reinforcement learning method is applicable to such a difficult multi-agent problem.
引用
收藏
页码:31 / 54
页数:23
相关论文
共 50 条
  • [31] Modeling and Algorithms of Multi-agent Reinforcement Learning Using Stochastic Game
    Xie Guangqiang
    Chen Xuesong
    [J]. 2011 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION AND INDUSTRIAL APPLICATION (ICIA2011), VOL II, 2011, : 374 - 377
  • [32] Modeling and Algorithms of Multi-agent Reinforcement Learning Using Stochastic Game
    Xie Guangqiang
    Chen Xuesong
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL VII, 2010, : 375 - 378
  • [33] Cloud game computing offload based on Multi-Agent Reinforcement Learning
    Tian, Kaicong
    Yang, Hongwen
    Liu, Yitong
    Zheng, Qingbi
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [34] Hierarchical reinforcement learning based on multi-agent cooperation game theory
    Tang, Hengliang
    Dong, Chengang
    [J]. International Journal of Wireless and Mobile Computing, 2019, 16 (04): : 369 - 376
  • [35] Multi-Agent Partial Observable Safe Reinforcement Learning for Counter Uncrewed Aerial Systems
    Pierre, Jean-Elie
    Sun, Xiang
    Fierro, Rafael
    [J]. IEEE ACCESS, 2023, 11 : 78192 - 78206
  • [36] Trustable Policy Collaboration Scheme for Multi-Agent Stigmergic Reinforcement Learning
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    [J]. IEEE COMMUNICATIONS LETTERS, 2022, 26 (04) : 823 - 827
  • [37] Mixed Observable RRT: Multi-Agent Mission-Planning in Partially Observable Environments
    Johansson, Kasper
    Rosolia, Ugo
    Ubellacker, Wyatt
    Singletary, Andrew
    Ames, Aaron D.
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1386 - 1392
  • [38] Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes
    Tilak, Omkar
    Mukhopadhyay, Snehasis
    [J]. AI COMMUNICATIONS, 2011, 24 (04) : 293 - 309
  • [39] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,