Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

被引：0

作者：

Lei, Xiaohan ^{[1
]}

Wang, Min ^{[2
]}

Zhou, Wengang ^{[1
,2
]}

Li, Li ^{[1
]}

Li, Houqiang ^{[1
,2
]}

机构：

[1] Univ Sci & Technol China, MoE Key Lab Brain Inspired Intelligent Percept &, Hefei, Anhui, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Anhui, Peoples R China

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52733.2024.01545

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a new embodied vision task, Instance ImageGoal Navigation (IIN) aims to navigate to a specified object depicted by a goal image in an unexplored environment. The main challenge of this task lies in identifying the target object from different viewpoints while rejecting similar distractors. Existing ImageGoal Navigation methods usually adopt the simple Exploration-Exploitation framework and ignore the identification of specific instance during navigation. In this work, we propose to imitate the human behaviour of "getting closer to confirm" when distinguishing objects from a distance. Specifically, we design a new modular navigation framework named Instance-aware Exploration-Verification- Exploitation (IEVE) for instance-level image goal navigation. Our method allows for active switching among the exploration, verification, and exploitation actions, thereby facilitating the agent in making reasonable decisions under different situations. On the challenging HabitatMatterport 3D semantic (HM3D-SEM) dataset, our method surpasses previous state-of-the-art work, with a classical segmentation model (0.684 vs. 0.561 success) or a robust model (0.702 vs. 0.561 success). Our code will be made publicly available at https://github.com/XiaohanLei/IEVE.

引用

页码：16329 / 16339

页数：11

共 50 条

[21] MARS: An Instance-Aware, Modular and Realistic Simulator for Autonomous Driving
Wu, Zirui
Liu, Tianyu
Luo, Liyi
Zhong, Zhide
Chen, Jianteng
Xiao, Hongmin
Hou, Chao
Lou, Haozhe
Chen, Yuantao
Yang, Runyi
Huang, Yuxin
Ye, Xiaoyu
Yan, Zike
Shi, Yongliang
Liao, Yiyi
Zhao, Hao
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 3 - 15
[22] Instance-aware Contrastive Learning for Occluded Human Mesh Reconstruction
Gwon, Mi-Gyeong
Um, Gi-Mun
Cheong, Won-Sik
Kim, Wonjun
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10553 - 10562
[23] InsMOS: Instance-Aware Moving Object Segmentation in LiDAR Data
Wang, Neng
Shi, Chenghao
Guo, Ruibin
Lu, Huimin
Zheng, Zhiqiang
Chen, Xieyuanli
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7598 - 7605
[24] InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
Kim, Soohyun
Baek, Jongbeom
Park, Jihye
Kim, Gyeongnyeon
Kim, Seungryong
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18300 - 18310
[25] Instance-Aware Hashing for Multi-Label Image Retrieval
Lai, Hanjiang
Yan, Pan
Shu, Xiangbo
Wei, Yunchao
Yan, Shuicheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2469 - 2479
[26] Progressive Instance-Aware Feature Learning for Compositional Action Recognition
Yan, Rui
Xie, Lingxi
Shu, Xiangbo
Zhang, Liyan
Tang, Jinhui
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10317 - 10330
[27] Deep Correlation Filter Tracking With Shepherded Instance-Aware Proposals
Liang, Yanjie
Wu, Qiangqiang
Liu, Yi
Yan, Yan
Wang, Hanzi
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11408 - 11421
[28] Artistic Instance-Aware Image Filtering by Convolutional Neural Networks
Tehrani, Milad
Bagheri, Mahnoosh
Ahmadi, Mahdi
Norouzi, Alireza
Karimi, Nader
Samavi, Shadrokh
2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 710 - 714
[29] Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
Huang, Yan
Wang, Wei
Wang, Liang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7254 - 7262
[30] INSTANCE-AWARE SIMPLIFICATION OF 3D POLYGONAL MESHES
Azim, Tahir
Cheslack-Postava, Ewen
Levis, Philip
2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,

← 1 2 3 4 5 →