Long-Short Term Policy for Visual Object Navigation

被引:0
|
作者
Bai, Yubing [1 ,2 ]
Song, Xinhang [1 ,2 ]
Li, Weijie [1 ,2 ]
Zhang, Sixian [1 ,2 ]
Jiang, Shuqiang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Inst Intelligent Comp Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
10.1109/IROS55552.2023.10341652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of visual object navigation for an agent is to find the target objects accurately. Recent works mainly focus on the feature of embedding, attempting to learn better features with different variants, such as object distribution and graph representations. However, some typical navigation problems in complex environments, such as partially known and obstacle problems, may not be effectively addressed by previous feature embedding methods. In this paper, we propose a framework with a long-short objective policy, where the hidden states are classified according to the navigation objectives at that moment and separately rewarded. Specifically, we consider two objectives: the long-term objective is to go closer to the target, and the short-term objective is for obstacle avoidance and exploration. To alleviate the effect of long-term and short-term alternation, we build a state memory and propose an adjustment gate to update the state memory. Finally, all past hidden states are reweighted and combined for action prediction with an action-boosting gate. Experimental results on RoboTHOR show that the proposed method can significantly outperform the state-of-the-art.
引用
收藏
页码:9035 / 9042
页数:8
相关论文
共 50 条
  • [1] Long-short Term Prediction for Occluded Multiple Object Tracking
    Sun, Zhihong
    Chen, Jun
    Mukherjee, Mithun
    Ruan, Weijian
    Liang, Chao
    Yu, Yi
    Zhang, Dan
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [2] Visual Tracking With Long-Short Term Based Correlation Filter
    Yang, Yuxiang
    Xing, Weiwei
    Zhang, Shunli
    Gao, Limin
    Yu, Qi
    Che, Xiaoping
    Lu, Wei
    [J]. IEEE ACCESS, 2020, 8 : 20257 - 20269
  • [3] Visual Relocalization using Long-Short Term Memory Fully Convolutional Network
    Zhou, Lipu
    [J]. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 602 - 609
  • [4] Visual Attention Based on Long-Short Term Memory Model for Image Caption Generation
    Qu, Shiru
    Xi, Yuling
    Ding, Songtao
    [J]. 2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 4789 - 4794
  • [5] Supervised Guiding Long-Short Term Memory for Image Caption Generation based on Object Classes
    Wang, Jian
    Cao, Zhiguo
    Xiao, Yang
    Qi, Xinyuan
    [J]. MIPPR 2017: PATTERN RECOGNITION AND COMPUTER VISION, 2017, 10609
  • [6] Online Portfolio Selection with Long-Short Term Forecasting
    Li R.
    Liu J.
    [J]. Operations Research Forum, 3 (4)
  • [7] Generating Video Description with Long-Short Term Memory
    Li, Shuohao
    Zhang, Jun
    Guo, Qiang
    Lei, Jun
    Tu, Dan
    [J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 73 - 78
  • [8] Effect of short-term and long-short systemic corticosteroid on the electroretinogram
    Lam, BL
    Chang, Q
    Hamasaki, DI
    Davis, JL
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2000, 41 (04) : S242 - S242
  • [9] Multi Long-Short Term Memory Models for Short Term Traffic Flow Prediction
    Xue, Zelong
    Xue, Yang
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 3272 - 3275
  • [10] Short Term Prediction of Wind Speed Based on Long-Short Term Memory Networks
    Salman, Umar T.
    Rehman, Shafiqur
    Alawode, Basit
    Alhems, Luai M.
    [J]. FME TRANSACTIONS, 2021, 49 (03): : 643 - 652