Value-Based Subgoal Discovery and Path Planning for Reaching Long-Horizon Goals

被引:0
|
作者
Pateria, Shubham [1 ]
Subagdja, Budhitama [1 ]
Tan, Ah-Hwee [1 ]
Quek, Chai [2 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore 178902, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Planning; Visualization; Task analysis; Simultaneous localization and mapping; Navigation; Training; Predictive models; Long-horizon goal-reaching; motion planning; path planning; reinforcement learning (RL); subgoal discovery; subgoal graph;
D O I
10.1109/TNNLS.2023.3240004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to reach long-horizon goals in spatial traversal tasks is a significant challenge for autonomous agents. Recent subgoal graph-based planning methods address this challenge by decomposing a goal into a sequence of shorter-horizon subgoals. These methods, however, use arbitrary heuristics for sampling or discovering subgoals, which may not conform to the cumulative reward distribution. Moreover, they are prone to learning erroneous connections (edges) between subgoals, especially those lying across obstacles. To address these issues, this article proposes a novel subgoal graph-based planning method called learning subgoal graph using value-based subgoal discovery and automatic pruning (LSGVP). The proposed method uses a subgoal discovery heuristic that is based on a cumulative reward (value) measure and yields sparse subgoals, including those lying on the higher cumulative reward paths. Moreover, LSGVP guides the agent to automatically prune the learned subgoal graph to remove the erroneous edges. The combination of these novel features helps the LSGVP agent to achieve higher cumulative positive rewards than other subgoal sampling or discovery heuristics, as well as higher goal-reaching success rates than other state-of-the-art subgoal graph-based planning methods.
引用
收藏
页码:10288 / 10300
页数:13
相关论文
共 50 条
  • [1] Long-Horizon Informative Path Planning with Obstacles and Time Constraints
    Ge, Yaolin
    Olaisen, Andre Julius Hovd
    Eidsvik, Jo
    Jain, R. Praveen
    Johansen, Tor Arne
    [J]. IFAC PAPERSONLINE, 2022, 55 (31): : 124 - 129
  • [2] Deterministic policy optimization with clipped value expansion and long-horizon planning
    Gao, Shiqing
    Shi, Haibo
    Wang, Fang
    Wang, Zijian
    Zhang, Siyu
    Li, Yunxia
    Sun, Yaoru
    [J]. NEUROCOMPUTING, 2022, 483 : 299 - 310
  • [3] Structured Motion Generation with Predictive Learning: Proposing Subgoal for Long-Horizon Manipulation
    Saito, Namiko
    Moura, Joao
    Ogata, Tetsuya
    Aoyama, Marina Y.
    Murata, Shingo
    Sugano, Shigeki
    Vijayakumar, Sethu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9566 - 9572
  • [4] Collaborative foresight: Complementing long-horizon strategic planning
    Weigand, Kirk
    Flanagan, Thomas
    Dye, Kevin
    Jones, Peter
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2014, 85 : 134 - 152
  • [5] ISPIL: Interactive Sub-Goal-Planning Imitation Learning for Long-Horizon Tasks With Diverse Goals
    Ochoa, Cynthia
    Oh, Hanbit
    Kwon, Yuhwan
    Domae, Yukiyasu
    Matsubara, Takamitsu
    [J]. IEEE Access, 2024, 12 : 197616 - 197631
  • [6] Robust Task and Motion Planning for Long-Horizon Architectural Construction Planning
    Hartmann, Valentin N.
    Oguz, Ozgur S.
    Driess, Danny
    Toussaint, Marc
    Menges, Achim
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6886 - 6893
  • [7] Optimal Value and Growth Tilts in Long-Horizon Portfolios
    Jurek, Jakub W.
    Viceira, Luis M.
    [J]. REVIEW OF FINANCE, 2011, 15 (01) : 29 - 74
  • [8] Long-horizon Robotic Search and Classification using Sampling-based Motion Planning
    Hollinger, Geoffrey A.
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XI, 2015,
  • [9] Long-Horizon Multi-Robot Rearrangement Planning for Construction Assembly
    Hartmann, Valentin N.
    Orthey, Andreas
    Driess, Danny
    Oguz, Ozgur S.
    Toussaint, Marc
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 239 - 252
  • [10] Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors
    Pertsch, Karl
    Rybkin, Oleh
    Ebert, Frederik
    Finn, Chelsea
    Jayaraman, Dinesh
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33