Planning with Macro-Actions in Decentralized POMDPs

被引:0
|
作者
Amato, Christopher [1 ]
Konidaris, George D. [1 ]
Kaelbling, Leslie P. [1 ]
机构
[1] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decentralized partially observableMarkov decision processes (Dec-POMDPs) are general models for decentralized decision making under uncertainty. However, they typically model a problem at a low level of granularity, where each agent's actions are primitive operations lasting exactly one time step. We address the case where each agent has macroactions: temporally extended actions which may require different amounts of time to execute. We model macroactions as options in a factored Dec-POMDP model, focusing on options which depend only on information available to an individual agent while executing. This enables us to model systems where coordination decisions only occur at the level of deciding which macro-actions to execute, and the macro-actions themselves can then be executed to completion. The core technical difficulty when using options in a Dec-POMDP is that the options chosen by the agents no longer terminate at the same time. We present extensions of two leading Dec-POMDP algorithms for generating a policy with options and discuss the resulting form of optimality. Our results show that these algorithms retain agent coordination while allowing near-optimal solutions to be generated for significantly longer horizons and larger state-spaces than previous Dec-POMDP methods.
引用
收藏
页码:1273 / 1280
页数:8
相关论文
共 50 条
  • [1] Modeling and Planning with Macro-Actions in Decentralized POMDPs
    Amato, Christopher
    Konidaris, George
    Kaelbling, Leslie P.
    How, Jonathan P.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 817 - 859
  • [2] Approximate planning in POMDPs with macro-actions
    Theocharous, G
    Kaelbling, LP
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 775 - 782
  • [3] Mining useful Macro-actions in Planning
    Castellanos-Paez, Sandra
    Pellier, Damien
    Fiorino, Humbert
    Pesty, Sylvie
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR), 2016,
  • [4] PUMA: Planning under Uncertainty with Macro-Actions
    He, Ruijie
    Brunskill, Emma
    Roy, Nicholas
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1089 - 1095
  • [5] Efficient Planning under Uncertainty with Macro-actions
    He, Ruijie
    Brunskill, Emma
    Roy, Nicholas
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 40 : 523 - 570
  • [6] Enhancing Temporal Planning by Sequential Macro-Actions
    De Bortoli, Marco
    Chrpa, Lukas
    Gebser, Martin
    Steinbauer-Wagner, Gerald
    [J]. LOGICS IN ARTIFICIAL INTELLIGENCE, JELIA 2023, 2023, 14281 : 595 - 604
  • [7] Implicit Learning of Compiled Macro-Actions for Planning
    Newton, M. A. Hakim
    Levine, John
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 323 - 328
  • [8] MAGIC: Learning Macro-Actions for Online POMDP Planning
    Lee, Yiyuan
    Cai, Panpan
    Hsu, David
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [9] Exploiting macro-actions and predicting plan length in planning as satisfiability
    Gerevini, Alfonso Emilio
    Saetti, Alessandro
    Vallati, Mauro
    [J]. AI COMMUNICATIONS, 2015, 28 (02) : 323 - 344
  • [10] Exploiting Macro-actions and Predicting Plan Length in Planning as Satisfiability
    Gerevini, Alfonso Emilio
    Saetti, Alessandro
    Vallati, Mauro
    [J]. AI(STAR)IA 2011: ARTIFICIAL INTELLIGENCE AROUND MAN AND BEYOND, 2011, 6934 : 189 - 200