Scalable Planning and Learning for Multiagent POMDPs

被引:0
|
作者
Amato, Christopher [1 ]
Oliehoek, Frans A. [2 ,3 ]
机构
[1] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
[3] Univ Liverpool, Dept CS, Liverpool, Merseyside, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online, sample-based planning algorithms for POMDPs have shown great promise in scaling to problems with large state spaces, but they become intractable for large action and observation spaces. This is particularly problematic in multiagent POMDPs where the action and observation space grows exponentially with the number of agents. To combat this intractability, we propose a novel scalable approach based on sample-based planning and factored value functions that exploits structure present in many multiagent settings. This approach applies not only in the planning case, but also in the Bayesian reinforcement learning setting. Experimental results show that we are able to provide high quality solutions to large multiagent planning and learning problems.
引用
收藏
页码:1995 / 2002
页数:8
相关论文
共 50 条
  • [1] Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs
    Yabu, Yuichi
    Yokoo, Makoto
    Iwasaki, Atsushi
    [J]. AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2009, 5044 : 13 - 24
  • [2] Integrated common sense learning and planning in POMDPs
    Washington University, 1 Brookings Dr., St. Louis
    MO
    63130, United States
    [J]. J. Mach. Learn. Res., (1-37):
  • [3] Randomized Coordination Search for Scalable Multiagent Planning
    Ure, N. Kemal
    How, Jonathan P.
    Vian, John
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1793 - 1794
  • [4] Integrated Common Sense Learning and Planning in POMDPs
    Juba, Brendan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [5] Partition Learning for Multiagent Planning
    Wood, Jared
    Hedrick, J. Karl
    [J]. JOURNAL OF ROBOTICS, 2012, 2012
  • [6] Compositional Learning-based Planning for Vision POMDPs
    Deglurkar, Sampada
    Lim, Michael H.
    Tucker, Johnathan
    Sunberg, Zachary N.
    Faust, Aleksandra
    Tomlin, Claire J.
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [7] Scalable Reinforcement Learning for Multiagent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    [J]. OPERATIONS RESEARCH, 2022, 70 (06) : 3601 - 3628
  • [8] Scalable Decision-Theoretic Planning in Open and Typed Multiagent Systems
    Eck, Adam
    Shah, Maulik
    Doshi, Prashant
    Soh, Leen-Kiat
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7127 - 7134
  • [9] Planning, learning and coordination in multiagent decision processes
    Boutilier, C
    [J]. THEORETICAL ASPECTS OF RATIONALITY AND KNOWLEDGE, 1996, : 195 - 210
  • [10] Barrier Functions for Multiagent-POMDPs with DTL Specifications
    Ahmadi, Mohamadreza
    Singletary, Andrew
    Burdick, Joel W.
    Ames, Aaron D.
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 1380 - 1385