Revisit Anything: Visual Place Recognition via Image Segment Retrieval

被引:0
|
作者
Garg, Kartik [1 ]
Shubodh, Sai [2 ]
Kolathaya, Shishir [1 ]
Krishna, Madhava [2 ]
Garg, Sourav [3 ]
机构
[1] Indian Inst Sci IISc, Bengaluru, India
[2] Int Inst Informat Technol, Hyderabad, India
[3] Univ Adelaide, Adelaide, SA, Australia
来源
关键词
Visual Place Recognition; Image Segmentation; Robotics; SCALE;
D O I
10.1007/978-3-031-73113-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately recognizing a revisited place is crucial for embodied agents to localize and navigate. This requires visual representations to be distinct, despite strong variations in camera viewpoint and scene appearance. Existing visual place recognition pipelines encode the whole image and search for matches. This poses a fundamental challenge in matching two images of the same place captured from different camera viewpoints: the similarity of what overlaps can be dominated by the dissimilarity of what does not overlap. We address this by encoding and searching for image segments instead of the whole images. We propose to use open-set image segmentation to decompose an image into 'meaningful' entities (i.e., things and stuff). This enables us to create a novel image representation as a collection of multiple overlapping subgraphs connecting a segment with its neighboring segments, dubbed SuperSegment. Furthermore, to efficiently encode these SuperSegments into compact vector representations, we propose a novel factorized representation of feature aggregation. We show that retrieving these partial representations leads to significantly higher recognition recall than the typical whole image based retrieval. Our segments-based approach, dubbed SegVLAD, sets a new state-of-the-art in place recognition on a diverse selection of benchmark datasets, while being applicable to both generic and task-specialized image encoders. Finally, we demonstrate the potential of our method to "revisit anything" by evaluating our method on an object instance retrieval task, which bridges the two disparate areas of research: visual place recognition and object-goal navigation, through their common aim of recognizing goal objects specific to a place. Source code: https://github.com/AnyLoc/Revisit-Anything.
引用
收藏
页码:326 / 343
页数:18
相关论文
共 50 条
  • [21] CAHIR: Co-Attentive Hierarchical Image Representations for Visual Place Recognition
    Peng, Guohao
    Li, Heshan
    Huang, Yifeng
    Zhang, Jun
    Wen, Mingxing
    Rahul, Singh
    Wang, Danwei
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6087 - 6094
  • [22] Efficient Decentralized Visual Place Recognition From Full-Image Descriptors
    Cieslewski, Titus
    Scaramuzza, Davide
    2017 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS), 2017,
  • [23] Segment anything model for medical image analysis: An experimental study
    Mazurowski, Maciej A.
    Dong, Haoyu
    Gu, Hanxue
    Yang, Jichen
    Konz, Nicholas
    Zhang, Yixin
    MEDICAL IMAGE ANALYSIS, 2023, 89
  • [24] Image retrieval and pattern recognition
    Tao, B
    Dickinson, B
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS, 1996, 2916 : 130 - 139
  • [25] Retrieval of Misplaced Items Using a Mobile Robot via Visual Object Recognition
    Wang, Qi
    Zhang, Senlin
    Liu, Meiqin
    Sheng, Weihua
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1188 - 1193
  • [26] Visual Place Recognition via a Multitask Learning Method With Attentive Feature Aggregation
    Guan, Peiyu
    Cao, Zhiqiang
    Yu, Junzhi
    Tan, Min
    Wang, Shuo
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1263 - 1278
  • [27] Spiking Neural Networks for Visual Place Recognition Via Weighted Neuronal Assignments
    Hussaini, Somayeh
    Milford, Michael
    Fischer, Tobias
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 4094 - 4101
  • [28] Fast and Memory Efficient Graph Optimization via ICM for Visual Place Recognition
    Schubert, Stefan
    Neubert, Peer
    Protzel, Peter
    Robotics: Science and Systems, 2021,
  • [29] Fast and Memory Efficient Graph Optimization via ICM for Visual Place Recognition
    Schubert, Stefan
    Neubert, Peer
    Protzel, Peter
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [30] Visual Place Recognition via Semantic and Geometric Descriptor for Automated Valet Parking
    Yu, Jingrui
    Su, Jianbo
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1142 - 1147