Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

被引:0
|
作者
Pertsch, Karl [1 ]
Rybkin, Oleh [2 ]
Ebert, Frederik [3 ]
Finn, Chelsea [4 ]
Jayaraman, Dinesh [2 ]
Levine, Sergey [3 ]
机构
[1] USC, Los Angeles, CA 90007 USA
[2] UPenn, Philadelphia, PA USA
[3] Univ Calif Berkeley, Berkeley, CA USA
[4] Stanford Univ, Stanford, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to predict and plan into the future is fundamental for agents acting in the world. To reach a faraway goal, we predict trajectories at multiple timescales, first devising a coarse plan towards the goal and then gradually filling in details. In contrast, current learning approaches for visual prediction and planning fail on long-horizon tasks as they generate predictions (1) without considering goal information, and (2) at the finest temporal resolution, one step at a time. In this work we propose a framework for visual prediction and planning that is able to overcome both of these limitations. First, we formulate the problem of predicting towards a goal and propose the corresponding class of latent space goal-conditioned predictors (GCPs). GCPs significantly improve planning efficiency by constraining the search space to only those trajectories that reach the goal. Further, we show how GCPs can be naturally formulated as hierarchical models that, given two observations, predict an observation between them, and by recursively subdividing each part of the trajectory generate complete sequences. This divide-and-conquer strategy is effective at long-term prediction, and enables us to design an effective hierarchical planning algorithm that optimizes trajectories in a coarse-to-fine manner. We show that by using both goal-conditioning and hierarchical prediction, GCPs enable us to solve visual planning tasks with much longer horizon than previously possible.
引用
收藏
页数:13
相关论文
共 48 条
  • [1] Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning
    Hoang, Christopher
    Sohn, Sungryull
    Choi, Jongwook
    Carvalho, Wilka
    Lee, Honglak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [2] Planning with Goal-Conditioned Policies
    Nasiriany, Soroush
    Pong, Vitchyr H.
    Lin, Steven
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
    Lisheng Wu
    Ke Chen
    [J]. Machine Learning, 2024, 113 : 2527 - 2557
  • [4] Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
    Li, Jinning
    Tang, Chen
    Tomizuka, Masayoshi
    Zhan, Wei
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10216 - 10223
  • [5] Goal exploration augmentation via pre-trained skills for sparse-reward long-horizon goal-conditioned reinforcement learning
    Wu, Lisheng
    Chen, Ke
    [J]. MACHINE LEARNING, 2024, 113 (05) : 2527 - 2557
  • [6] Sample Complexity of Goal-Conditioned Hierarchical Reinforcement Learning
    Robert, Arnaud
    Pike-Burke, Ciara
    Faisal, A. Aldo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Goal-conditioned Offline Planning from Curious Exploration
    Bagatella, Marco
    Martius, Georg
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Hierarchical Planning for Long-Horizon Manipulation with Geometric and Symbolic Scene Graphs
    Zhu, Yifeng
    Tremblay, Jonathan
    Birchfield, Stan
    Zhu, Yuke
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6541 - 6548
  • [9] Generating Goal-conditioned Sub-goals for Hierarchical Learning
    Choi, Jinwoo
    Seo, Seung-Woo
    [J]. 2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [10] Goal-Conditioned Reinforcement Learning With Disentanglement-Based Reachability Planning
    Qian, Zhifeng
    You, Mingyu
    Zhou, Hongjun
    Xu, Xuanhui
    He, Bin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08): : 4721 - 4728