Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks

被引:93
|
作者
Fang, Kuan [1 ]
Toshev, Alexander [2 ]
Li Fei-Fei [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Google Brain, Mountain View, CA USA
关键词
NAVIGATION; VISION;
D O I
10.1109/CVPR.2019.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many robotic applications require the agent to perform long-horizon tasks in partially observable environments. In such applications, decision making at any step can depend on observations received far in the past. Hence, being able to properly memorize and utilize the long-term history is crucial. In this work, we propose a novel memory-based policy, named Scene Memory Transformer (SMT). The proposed policy embeds and adds each observation to a memory and uses the attention mechanism to exploit spatio-temporal dependencies. This model is generic and can be efficiently trained with reinforcement learning over long episodes. On a range of visual navigation tasks, SMT demonstrates superior performance to existing reactive and memory-based policies by a margin.
引用
收藏
页码:538 / 547
页数:10
相关论文
共 50 条
  • [1] Skill Learning for Long-Horizon Sequential Tasks
    Alves, Joao
    Lau, Nuno
    Silva, Filipe
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 713 - 724
  • [2] ERRA: An Embodied Representation and Reasoning Architecture for Long-Horizon Language-Conditioned Manipulation Tasks
    Zhao, Chao
    Yuan, Shuai
    Jiang, Chunli
    Cai, Junhao
    Yu, Hongyu
    Wang, Michael Yu
    Chen, Qifeng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3230 - 3237
  • [3] Hierarchical Learning from Demonstrations for Long-Horizon Tasks
    Li, Boyao
    Li, Jiayi
    Lu, Tao
    Cai, Yinghao
    Wang, Shuo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4545 - 4551
  • [4] Programming-by-Demonstration for Long-Horizon Robot Tasks
    Patton, Noah
    Rahmani, Kia
    Missula, Meghana
    Biswas, Joydeep
    Dillig, Isil
    [J]. PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2024, 8 (POPL): : 512 - 545
  • [5] Long-Horizon Returns
    Fama, Eugene F.
    French, Kenneth R.
    [J]. REVIEW OF ASSET PRICING STUDIES, 2018, 8 (02): : 232 - 252
  • [6] Hierarchical Planning for Long-Horizon Manipulation with Geometric and Symbolic Scene Graphs
    Zhu, Yifeng
    Tremblay, Jonathan
    Birchfield, Stan
    Zhu, Yuke
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6541 - 6548
  • [7] IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
    Lee, Youngwoon
    Hu, Edward S.
    Lim, Joseph J.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6343 - 6349
  • [8] Expert Demonstration Collection of Long-Horizon Construction Tasks in Virtual Reality
    Li, Rui
    Zou, Zhengbo
    [J]. COMPUTING IN CIVIL ENGINEERING 2023-VISUALIZATION, INFORMATION MODELING, AND SIMULATION, 2024, : 239 - 247
  • [9] A Framework of Robot Skill Learning From Complex and Long-Horizon Tasks
    Wu, Hongmin
    Yan, Wu
    Xu, Zhihao
    Cheng, Taobo
    Zhou, Xuefeng
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 3628 - 3638
  • [10] Uncertainty-aware hierarchical reinforcement learning for long-horizon tasks
    Hu, Wenning
    Wang, Hongbin
    He, Ming
    Wang, Nianbin
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 28555 - 28569