Reinforcement learning for cooperative actions in a partially observable multi-agent system

被引:0
|
作者
Taniguchi, Yuki [1 ]
Mori, Takeshi [1 ]
Ishii, Shin [1 ]
机构
[1] NAIST, Grad Sch Informat Sci, Takayama, Ikoma 6300192, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we apply a policy gradient-based reinforcement learning to allowing multiple agents to perform cooperative actions in a partially observable environment. We introduce an auxiliary state variable, an internal state, whose stochastic process is Markov, for extracting important features of multi-agent's dynamics. Computer simulations show that every agent can identify an appropriate internal state model and acquire a good policy; this approach is shown to be more effective than a traditional memory-based method.
引用
收藏
页码:229 / +
页数:2
相关论文
共 50 条
  • [1] Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
    Mao, Weichao
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6124 - 6131
  • [2] A reinforcement learning scheme for a partially-observable multi-agent game
    Ishii, S
    Fujita, H
    Mitsutake, M
    Yamazaki, T
    Matsuda, J
    Matsuno, Y
    [J]. MACHINE LEARNING, 2005, 59 (1-2) : 31 - 54
  • [3] A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game
    Shin Ishii
    Hajime Fujita
    Masaoki Mitsutake
    Tatsuya Yamazaki
    Jun Matsuda
    Yoichiro Matsuno
    [J]. Machine Learning, 2005, 59 : 31 - 54
  • [4] Partially Observable Multi-Agent Deep Reinforcement Learning for Cognitive Resource Management
    Yang, Ning
    Zhang, Haijun
    Berry, Randall
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [5] Multi-agent reinforcement learning algorithm to solve a partially-observable multi-agent problem in disaster response
    Lee, Hyun-Rok
    Lee, Taesik
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 296 - 308
  • [6] The Cooperative Reinforcement Learning in a Multi-Agent Design System
    Liu, Hong
    Wang, Jihua
    [J]. PROCEEDINGS OF THE 2013 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2013, : 139 - 144
  • [7] Cooperative Multi-Agent Reinforcement Learning in Express System
    Li, Yexin
    Zheng, Yu
    Yang, Qiang
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 805 - 814
  • [8] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 17 - 23
  • [9] Periodic Communication for Distributed Multi-agent Reinforcement Learning under Partially Observable Environment
    Kim, Seonghyun
    Lee, Donghun
    Jang, Ingook
    Kim, Hyunseok
    Son, Youngsung
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 940 - 942
  • [10] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
    Osada, H
    Fujita, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011