Implicit Learning of Compiled Macro-Actions for Planning

被引:2
|
作者
Newton, M. A. Hakim [1 ,2 ]
Levine, John [3 ]
机构
[1] Griffith Univ, Natl ICT Australia NICTA, Nathan, Qld 4111, Australia
[2] Griffith Univ, IIIS, Nathan, Qld 4111, Australia
[3] Univ Strathclyde, Comp & Informat Sci, Glasgow, Lanark, Scotland
关键词
HEURISTIC-SEARCH; OPERATORS; FF;
D O I
10.3233/978-1-60750-606-5-323
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We build a comprehensive macro-learning system and contribute in three different dimensions that have previously not been addressed adequately. Firstly, we learn macro-sets considering implicitly the interactions between constituent macros. Secondly, we effectively learn macros that are not found in given example plans. Lastly, we improve or reduce degradation of plan-length when macros are used; note, our main objective is to achieve fast planning. Our macro-learning system significantly outperforms a very recent macro-learning method both in solution speed and plan length.
引用
收藏
页码:323 / 328
页数:6
相关论文
共 50 条
  • [1] MAGIC: Learning Macro-Actions for Online POMDP Planning
    Lee, Yiyuan
    Cai, Panpan
    Hsu, David
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [2] Mining useful Macro-actions in Planning
    Castellanos-Paez, Sandra
    Pellier, Damien
    Fiorino, Humbert
    Pesty, Sylvie
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR), 2016,
  • [3] Learning macro-actions in reinforcement learning
    Randlov, J
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1045 - 1051
  • [4] Planning with Macro-Actions in Decentralized POMDPs
    Amato, Christopher
    Konidaris, George D.
    Kaelbling, Leslie P.
    [J]. AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1273 - 1280
  • [5] Approximate planning in POMDPs with macro-actions
    Theocharous, G
    Kaelbling, LP
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 775 - 782
  • [6] Learning Useful Macro-actions for Planning with N-Grams
    Dulac, Adrien
    Pellier, Damien
    Fiorino, Humbert
    Janiszek, David
    [J]. 2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 803 - 810
  • [7] PUMA: Planning under Uncertainty with Macro-Actions
    He, Ruijie
    Brunskill, Emma
    Roy, Nicholas
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1089 - 1095
  • [8] Modeling and Planning with Macro-Actions in Decentralized POMDPs
    Amato, Christopher
    Konidaris, George
    Kaelbling, Leslie P.
    How, Jonathan P.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 817 - 859
  • [9] Efficient Planning under Uncertainty with Macro-actions
    He, Ruijie
    Brunskill, Emma
    Roy, Nicholas
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 40 : 523 - 570
  • [10] Enhancing Temporal Planning by Sequential Macro-Actions
    De Bortoli, Marco
    Chrpa, Lukas
    Gebser, Martin
    Steinbauer-Wagner, Gerald
    [J]. LOGICS IN ARTIFICIAL INTELLIGENCE, JELIA 2023, 2023, 14281 : 595 - 604