CoCALC: A Self-Supervised Visual Place Recognition Approach Combining Appearance and Geometric Information

被引:1
|
作者
Li, Kangyu [1 ]
Wang, Xifeng [2 ]
Shi, Leilei [1 ]
Geng, Niuniu [1 ]
机构
[1] Machinery Technol Dev Co Ltd, Beijing, Peoples R China
[2] China Acad Machinery Sci & Technol, Beijing, Peoples R China
关键词
Convolutional neural network; robotic vision; visual place recognition; visual simultaneous localization and mapping; LOOP-CLOSURE DETECTION; ROBOT LOCALIZATION; IMAGE FEATURES; LARGE-SCALE; FAB-MAP; SLAM; EFFICIENT; MODEL; BAGS;
D O I
10.1109/ACCESS.2023.3246803
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual place recognition (VPR) is considered among the most complicated tasks in SLAM due to the multiple challenges of drastic variations in both appearance and viewpoint. To address this issue, this article presents a self-supervised and lightweight VPR approach (namely CoCALC) that fully utilizes the appearance and geometric information provided by images. The main thing that makes CoCALC ultra-lightweight (only 0.27 MB) is our use of Depthwise Separable Convolution (DSC), a simple but effective architecture that enables our model to generate a more robust image representation. The network trained specifically for VPR can efficiently extract deep convolutional features from salient image regions that have relatively higher entropy, thereby expanding its applications on resource-limited platforms without GPUs. To further eliminate the negative consequences of the high percent false matches, a novel band-matrix-based geometric check is employed to filter out the incorrect matching of image patches, and the impact of different bandwidths on the recall rate is discussed. Results on several benchmark datasets confirm that the proposed CoCALC can yield state-of-the-art performance and superior generalization with acceptable efficiency.
引用
收藏
页码:17207 / 17217
页数:11
相关论文
共 50 条
  • [1] Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition
    Cai, Danwei
    Wang, Weiqing
    Li, Ming
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1422 - 1435
  • [2] Self-supervised Visual Odometry Based on Geometric Consistency
    Song, Rujun
    Liu, Jiaqi
    Liao, Kaisheng
    Xiao, Zhuoling
    Yan, Bo
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [3] GeoAdapt: Self-Supervised Test-Time Adaptation in LiDAR Place Recognition Using Geometric Priors
    Knights, Joshua
    Hausler, Stephen
    Sridharan, Sridha
    Fookes, Clinton
    Moghadam, Peyman
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 915 - 922
  • [4] Self-Supervised Domain Calibration and Uncertainty Estimation for Place Recognition
    Lajoie, Pierre-Yves
    Beltrame, Giovanni
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 792 - 799
  • [5] Self-supervised Geometric Perception
    Yang, Heng
    Dong, Wei
    Carlone, Luca
    Koltun, Vladlen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14345 - 14356
  • [6] Self-Supervised Deep Visual Odometry Based on Geometric Attention Model
    Dai, Jiajia
    Gong, Xiaoxi
    Li, Yida
    Wang, Jun
    Wei, Mingqiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3157 - 3166
  • [7] Robust Self-Supervised Audio-Visual Speech Recognition
    Shi, Bowen
    Hsu, Wei-Ning
    Mohamed, Abdelrahman
    INTERSPEECH 2022, 2022, : 2118 - 2122
  • [8] Self-supervised learning for visual tracking and recognition of human hand
    Wu, Y
    Huang, TS
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 243 - 248
  • [9] Geometric Consistency for Self-Supervised End-to-End Visual Odometry
    Iyer, Ganesh
    Murthy, J. Krishna
    Gupta, Gunshi
    Krishna, K. Madhava
    Paull, Liam
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 380 - 388
  • [10] Contrastive Information Maximization Clustering for Self-Supervised Speaker Recognition
    Fatban, Abderrahim
    Alam, Jahangir
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 383 - 388