MixVPR: Feature Mixing for Visual Place Recognition

被引:72
|
作者
Ali-bey, Amar [1 ]
Chaib-draa, Brahim [1 ]
Giguere, Philippe [1 ]
机构
[1] Univ Laval, Quebec City, PQ, Canada
关键词
MODEL;
D O I
10.1109/WACV56688.2023.00301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous driving as well as other computer vision tasks. It refers to the process of identifying a place depicted in a query image using only computer vision. At large scale, repetitive structures, weather and illumination changes pose a real challenge, as appearances can drastically change over time. Along with tackling these challenges, an efficient VPR technique must also be practical in real-world scenarios where latency matters. To address this, we introduce MixVPR, a new holistic feature aggregation technique that takes feature maps from pre-trained backbones as a set of global features. Then, it incorporates a global relationship between elements in each feature map in a cascade of feature mixing, eliminating the need for local or pyramidal aggregation as done in NetVLAD or TransVPR. We demonstrate the effectiveness of our technique through extensive experiments on multiple large-scale benchmarks. Our method outperforms all existing techniques by a large margin while having less than half the number of parameters compared to CosPlace and NetVLAD. We achieve a new all-time high recall@1 score of 94.6% on Pitts250k-test, 88.0% on MapillarySLS, and more importantly, 58.4% on Nordland. Finally, our method outperforms two-stage retrieval techniques such as Patch-NetVLAD, TransVPR and SuperGLUE all while being orders of magnitude faster.
引用
收藏
页码:2997 / 3006
页数:10
相关论文
共 50 条
  • [21] VLAD-BuFF: Burst-Aware Fast Feature Aggregation for Visual Place Recognition
    Khaliq, Ahmad
    Xu, Ming
    Hausler, Stephen
    Milford, Michael
    Garg, Sourav
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 447 - 466
  • [22] Visual Place Recognition with Repetitive Structures
    Torii, Akihiko
    Sivic, Josef
    Okutomi, Masatoshi
    Pajdla, Tomas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2346 - 2359
  • [23] Visual Place Recognition with Repetitive Structures
    Torii, Akihiko
    Sivic, Josef
    Pajdla, Tomas
    Okutomi, Masatoshi
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 883 - 890
  • [24] A Survey on Deep Visual Place Recognition
    Masone, Carlo
    Caputo, Barbara
    IEEE ACCESS, 2021, 9 : 19516 - 19547
  • [25] Location Graphs for Visual Place Recognition
    Stumm, Elena
    Mei, Christopher
    Lacroix, Simon
    Chli, Margarita
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 5475 - 5480
  • [26] The Research Status of Visual Place Recognition
    Wang, Bo
    Wu, Xin-sheng
    Chen, An
    Chen, Chun-yu
    Liu, Hai-ming
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [27] Visual place recognition for autonomous robots
    Tagare, HD
    McDermott, D
    Xiao, H
    1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 2530 - 2535
  • [28] RidgeVPR: A Global Positioning Framework in Sparse Feature Outdoor Environments Using Visual Place Recognition and Ridge Line Feature Matching
    Zheng, Shuai
    Yu, Bingzhuo
    Chen, Yingjie
    Zhang, Songhao
    Hong, Jun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9424 - 9440
  • [29] Fair Feature Distillation for Visual Recognition
    Jung, Sangwon
    Lee, Donggyu
    Park, Taeeon
    Moon, Taesup
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12110 - 12119
  • [30] Heterogeneous Feature Machines for Visual Recognition
    Cao, Liangliang
    Luo, Jiebo
    Liang, Feng
    Huang, Thomas S.
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1095 - 1102