Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

被引:0
|
作者
Lei, Xiaohan [1 ]
Wang, Min [2 ]
Zhou, Wengang [1 ,2 ]
Li, Li [1 ]
Li, Houqiang [1 ,2 ]
机构
[1] Univ Sci & Technol China, MoE Key Lab Brain Inspired Intelligent Percept &, Hefei, Anhui, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52733.2024.01545
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new embodied vision task, Instance ImageGoal Navigation (IIN) aims to navigate to a specified object depicted by a goal image in an unexplored environment. The main challenge of this task lies in identifying the target object from different viewpoints while rejecting similar distractors. Existing ImageGoal Navigation methods usually adopt the simple Exploration-Exploitation framework and ignore the identification of specific instance during navigation. In this work, we propose to imitate the human behaviour of "getting closer to confirm" when distinguishing objects from a distance. Specifically, we design a new modular navigation framework named Instance-aware Exploration-Verification- Exploitation (IEVE) for instance-level image goal navigation. Our method allows for active switching among the exploration, verification, and exploitation actions, thereby facilitating the agent in making reasonable decisions under different situations. On the challenging HabitatMatterport 3D semantic (HM3D-SEM) dataset, our method surpasses previous state-of-the-art work, with a classical segmentation model (0.684 vs. 0.561 success) or a robust model (0.702 vs. 0.561 success). Our code will be made publicly available at https://github.com/XiaohanLei/IEVE.
引用
收藏
页码:16329 / 16339
页数:11
相关论文
共 50 条
  • [21] MARS: An Instance-Aware, Modular and Realistic Simulator for Autonomous Driving
    Wu, Zirui
    Liu, Tianyu
    Luo, Liyi
    Zhong, Zhide
    Chen, Jianteng
    Xiao, Hongmin
    Hou, Chao
    Lou, Haozhe
    Chen, Yuantao
    Yang, Runyi
    Huang, Yuxin
    Ye, Xiaoyu
    Yan, Zike
    Shi, Yongliang
    Liao, Yiyi
    Zhao, Hao
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 3 - 15
  • [22] Instance-aware Contrastive Learning for Occluded Human Mesh Reconstruction
    Gwon, Mi-Gyeong
    Um, Gi-Mun
    Cheong, Won-Sik
    Kim, Wonjun
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10553 - 10562
  • [23] InsMOS: Instance-Aware Moving Object Segmentation in LiDAR Data
    Wang, Neng
    Shi, Chenghao
    Guo, Ruibin
    Lu, Huimin
    Zheng, Zhiqiang
    Chen, Xieyuanli
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7598 - 7605
  • [24] InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Kim, Gyeongnyeon
    Kim, Seungryong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18300 - 18310
  • [25] Instance-Aware Hashing for Multi-Label Image Retrieval
    Lai, Hanjiang
    Yan, Pan
    Shu, Xiangbo
    Wei, Yunchao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2469 - 2479
  • [26] Progressive Instance-Aware Feature Learning for Compositional Action Recognition
    Yan, Rui
    Xie, Lingxi
    Shu, Xiangbo
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10317 - 10330
  • [27] Deep Correlation Filter Tracking With Shepherded Instance-Aware Proposals
    Liang, Yanjie
    Wu, Qiangqiang
    Liu, Yi
    Yan, Yan
    Wang, Hanzi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11408 - 11421
  • [28] Artistic Instance-Aware Image Filtering by Convolutional Neural Networks
    Tehrani, Milad
    Bagheri, Mahnoosh
    Ahmadi, Mahdi
    Norouzi, Alireza
    Karimi, Nader
    Samavi, Shadrokh
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 710 - 714
  • [29] Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
    Huang, Yan
    Wang, Wei
    Wang, Liang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7254 - 7262
  • [30] INSTANCE-AWARE SIMPLIFICATION OF 3D POLYGONAL MESHES
    Azim, Tahir
    Cheslack-Postava, Ewen
    Levis, Philip
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,