MixVPR: Feature Mixing for Visual Place Recognition

被引:72
|
作者
Ali-bey, Amar [1 ]
Chaib-draa, Brahim [1 ]
Giguere, Philippe [1 ]
机构
[1] Univ Laval, Quebec City, PQ, Canada
关键词
MODEL;
D O I
10.1109/WACV56688.2023.00301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous driving as well as other computer vision tasks. It refers to the process of identifying a place depicted in a query image using only computer vision. At large scale, repetitive structures, weather and illumination changes pose a real challenge, as appearances can drastically change over time. Along with tackling these challenges, an efficient VPR technique must also be practical in real-world scenarios where latency matters. To address this, we introduce MixVPR, a new holistic feature aggregation technique that takes feature maps from pre-trained backbones as a set of global features. Then, it incorporates a global relationship between elements in each feature map in a cascade of feature mixing, eliminating the need for local or pyramidal aggregation as done in NetVLAD or TransVPR. We demonstrate the effectiveness of our technique through extensive experiments on multiple large-scale benchmarks. Our method outperforms all existing techniques by a large margin while having less than half the number of parameters compared to CosPlace and NetVLAD. We achieve a new all-time high recall@1 score of 94.6% on Pitts250k-test, 88.0% on MapillarySLS, and more importantly, 58.4% on Nordland. Finally, our method outperforms two-stage retrieval techniques such as Patch-NetVLAD, TransVPR and SuperGLUE all while being orders of magnitude faster.
引用
收藏
页码:2997 / 3006
页数:10
相关论文
共 50 条
  • [1] Enhancing Visual Place Recognition With Hybrid Attention Mechanisms in MixVPR
    Hu, Jun
    Nie, Jiwei
    Ning, Zuotao
    Feng, Chaolu
    Wang, Luyang
    Li, Jingyao
    Cheng, Shuai
    IEEE ACCESS, 2024, 12 : 159847 - 159859
  • [2] MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition
    Quach M.-D.
    Vo D.-M.
    Pham H.-A.
    SN Computer Science, 5 (6)
  • [3] MIXVPR++: Enhanced Visual Place Recognition With Hierarchical-Region Feature-Mixer and Adaptive Gabor Texture Fuser
    Nie, Jiwei
    Xue, Dingyu
    Pan, Feng
    Cheng, Shuai
    Liu, Wei
    Hu, Jun
    Ning, Zuotao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 580 - 587
  • [4] DINO-Mix enhancing visual place recognition with foundational vision model and feature mixing
    Huang, Gaoshuang
    Zhou, Yang
    Hu, Xiaofei
    Zhang, Chenglong
    Zhao, Luying
    Gan, Wenjian
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] Adding Cues to Binary Feature Descriptors for Visual Place Recognition
    Schlegel, Dominik
    Grisetti, Giorgio
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5488 - 5494
  • [6] Unsupervised Feature Learning for Visual Place Recognition in Changing Environments
    Zhao, Dongye
    Si, Bailu
    Tang, Fengzhen
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] A Multi-Domain Feature Learning Method for Visual Place Recognition
    Yin, Peng
    Xu, Lingyun
    Li, Xueqian
    Yin, Chen
    Li, Yingli
    Srivatsan, Rangaprasad Arun
    Li, Lu
    Ji, Jianmin
    He, Yuqing
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 319 - 324
  • [8] Explicit feature disentanglement for visual place recognition across appearance changes
    Tang, Li
    Wang, Yue
    Tan, Qimeng
    Xiong, Rong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (06)
  • [9] Place Recognition Based Visual Localization Using LBP Feature and SVM
    Qiao, Yongliang
    Cappelle, Cindy
    Ruichek, Yassine
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND ITS APPLICATIONS, MICAI 2015, PT II, 2015, 9414 : 393 - 404
  • [10] Salient Feature Selection for CNN-Based Visual Place Recognition
    Chen, Yutian
    Gan, Wenyan
    Jiao, Shanshan
    Xu, Youwei
    Feng, Yuntian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12) : 3102 - 3107