Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships

被引:16
|
作者
Lyu, Yunlian [1 ]
Shi, Yimin [1 ]
Zhang, Xianggang [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Aeronaut & Astronaut, Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Graph convolutional networks; Visual navigation; 3D scenes; LANGUAGE;
D O I
10.1007/s11063-022-10796-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Embodied Artificial Intelligence has become popular in recent years. Its task shifts from focusing on internet images to active settings, involving an embodied agent to perceive and act within 3D environments. In this paper, we study the Target-driven Visual Navigation (TDVN) in 3D indoor scenes using deep reinforcement learning techniques. The generalization of TDVN is a long-standing ill-posed issue, where the agent is expected to transfer intelligent knowledge from training domains to unseen domains. To address this issue, we propose a model that combines visual and relational graph features to learn the navigation policy. Graph convolutional networks are used to obtain graph features, which encodes spatial relations between objects. We also adopt a Target Skill Extension module to generate sub-targets, in order to allow the agent to learn from its failures. For evaluation, we perform experiments in the AI2-THOR. Experimental results show that our proposed model outperforms baselines under various metrics.
引用
收藏
页码:3979 / 3998
页数:20
相关论文
共 50 条
  • [1] Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
    Yunlian Lyu
    Yimin Shi
    Xianggang Zhang
    [J]. Neural Processing Letters, 2022, 54 : 3979 - 3998
  • [2] INDOOR TARGET-DRIVEN VISUAL NAVIGATION BASED ON SPATIAL SEMANTIC INFORMATION
    Yan, Jiaojie
    Zhang, Qieshi
    Cheng, Jun
    Ren, Ziliang
    Li, Tian
    Yang, Zhuo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 571 - 575
  • [3] Target-Driven Visual Navigation Using Causal Intervention
    Zhao, Xinzhou
    Wang, Tian
    Liu, Kexin
    Zhang, Baochang
    Li, Ce
    Snoussi, Hichem
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3508 - 3513
  • [4] Target-Driven Visual Navigation by Using Causal Intervention
    Zhao, Xinzhou
    Wang, Tian
    Li, Yanjing
    Zhang, Baochang
    Liu, Kexin
    Liu, Deyuan
    Wang, Chuanyun
    Snoussi, Hichem
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1294 - 1304
  • [5] Visual Context Embedding and Deadlock Processing for Target-driven Visual Navigation
    Kim, Jin-Hwan
    Choi, Jeong-Hyun
    Kim, Incheol
    [J]. Journal of Institute of Control, Robotics and Systems, 2023, 29 (01): : 35 - 47
  • [6] Target-driven Indoor Visual Navigation Using Inverse Reinforcement Learning
    Wang, Xitong
    Fang, Qiang
    Xu, Xin
    [J]. 2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [7] Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning
    Devo, Alessandro
    Mezzetti, Giacomo
    Costante, Gabriele
    Fravolini, Mario L.
    Valigi, Paolo
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) : 1546 - 1561
  • [8] Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning
    Wu, Qiaoyun
    Gong, Xiaoxi
    Xu, Kai
    Manocha, Dinesh
    Dong, Jingxuan
    Wang, Jun
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (01) : 175 - 182
  • [9] Target-driven visual navigation in indoor scenes using reinforcement learning and imitation learning
    Fang, Qiang
    Xu, Xin
    Wang, Xitong
    Zeng, Yujun
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (02) : 167 - 176
  • [10] A New Representation of Universal Successor Features for Enhancing the Generalization of Target-Driven Visual Navigation
    Hu, Jiaocheng
    Ma, Yuexin
    Jiang, Haiyun
    He, Shaofeng
    Liu, Gelu
    Weng, Qizhen
    Zhu, Xiangwei
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9510 - 9517