Revisit Anything: Visual Place Recognition via Image Segment Retrieval

被引:0
|
作者
Garg, Kartik [1 ]
Shubodh, Sai [2 ]
Kolathaya, Shishir [1 ]
Krishna, Madhava [2 ]
Garg, Sourav [3 ]
机构
[1] Indian Inst Sci IISc, Bengaluru, India
[2] Int Inst Informat Technol, Hyderabad, India
[3] Univ Adelaide, Adelaide, SA, Australia
来源
关键词
Visual Place Recognition; Image Segmentation; Robotics; SCALE;
D O I
10.1007/978-3-031-73113-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately recognizing a revisited place is crucial for embodied agents to localize and navigate. This requires visual representations to be distinct, despite strong variations in camera viewpoint and scene appearance. Existing visual place recognition pipelines encode the whole image and search for matches. This poses a fundamental challenge in matching two images of the same place captured from different camera viewpoints: the similarity of what overlaps can be dominated by the dissimilarity of what does not overlap. We address this by encoding and searching for image segments instead of the whole images. We propose to use open-set image segmentation to decompose an image into 'meaningful' entities (i.e., things and stuff). This enables us to create a novel image representation as a collection of multiple overlapping subgraphs connecting a segment with its neighboring segments, dubbed SuperSegment. Furthermore, to efficiently encode these SuperSegments into compact vector representations, we propose a novel factorized representation of feature aggregation. We show that retrieving these partial representations leads to significantly higher recognition recall than the typical whole image based retrieval. Our segments-based approach, dubbed SegVLAD, sets a new state-of-the-art in place recognition on a diverse selection of benchmark datasets, while being applicable to both generic and task-specialized image encoders. Finally, we demonstrate the potential of our method to "revisit anything" by evaluating our method on an object instance retrieval task, which bridges the two disparate areas of research: visual place recognition and object-goal navigation, through their common aim of recognizing goal objects specific to a place. Source code: https://github.com/AnyLoc/Revisit-Anything.
引用
收藏
页码:326 / 343
页数:18
相关论文
共 50 条
  • [41] I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
    Wei, Xiaobao
    Cao, Jiajun
    Jin, Yizhu
    Lu, Ming
    Wang, Guangyu
    Zhang, Shanghang
    COMPUTER VISION - ECCV 2024, PT X, 2025, 15068 : 90 - 107
  • [42] Visual Place Recognition with Repetitive Structures
    Torii, Akihiko
    Sivic, Josef
    Okutomi, Masatoshi
    Pajdla, Tomas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2346 - 2359
  • [43] Evaluation and Improvement of Segment Anything Model for Interactive Histopathology Image Segmentation
    Kim, SeungKyu
    Oh, Hyun-Jic
    Min, Seonghui
    Jeong, Won-Ki
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 245 - 255
  • [44] SAM-DEBLUR: LET SEGMENT ANYTHING BOOST IMAGE DEBLURRING
    Li, Siwei
    Liu, Mingxuan
    Zhang, Yating
    Chen, Shu
    Li, Haoxiang
    Dou, Zifei
    Chen, Hong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2445 - 2449
  • [45] Visual Place Recognition with Repetitive Structures
    Torii, Akihiko
    Sivic, Josef
    Pajdla, Tomas
    Okutomi, Masatoshi
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 883 - 890
  • [46] A Survey on Deep Visual Place Recognition
    Masone, Carlo
    Caputo, Barbara
    IEEE ACCESS, 2021, 9 : 19516 - 19547
  • [47] Location Graphs for Visual Place Recognition
    Stumm, Elena
    Mei, Christopher
    Lacroix, Simon
    Chli, Margarita
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 5475 - 5480
  • [48] The Research Status of Visual Place Recognition
    Wang, Bo
    Wu, Xin-sheng
    Chen, An
    Chen, Chun-yu
    Liu, Hai-ming
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [49] Visual place recognition for autonomous robots
    Tagare, HD
    McDermott, D
    Xiao, H
    1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 2530 - 2535
  • [50] Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter
    Li, Yaqin
    Wang, Dandan
    Yuan, Cao
    Li, Hao
    Hu, Jing
    SENSORS, 2023, 23 (18)