LCPR: A Multi-Scale Attention-Based LiDAR-Camera Fusion Network for Place Recognition

被引:2
|
作者
Zhou, Zijie [1 ]
Xu, Jingyi [2 ]
Xiong, Guangming [1 ]
Ma, Junyi [1 ,3 ]
机构
[1] Beijing Inst Technol, Beijing 100081, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[3] HAOMOAI Technol Co Ltd, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Laser radar; Point cloud compression; Image coding; Feature extraction; Cameras; Image recognition; Three-dimensional displays; Place recognition; SLAM; sensor fusion; deep learning; DISTINCTIVE IMAGE FEATURES;
D O I
10.1109/LRA.2023.3346753
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Place recognition is one of the most crucial modules for autonomous vehicles to identify places that were previously visited in GPS-invalid environments. Sensor fusion is considered an effective method to overcome the weaknesses of individual sensors. In recent years, multimodal place recognition fusing information from multiple sensors has gathered increasing attention. However, most existing multimodal place recognition methods only use limited field-of-view camera images, which leads to an imbalance between features from different modalities and limits the effectiveness of sensor fusion. In this letter, we present a novel neural network named LCPR for robust multimodal place recognition, which fuses LiDAR point clouds with multi-view RGB images to generate discriminative and yaw-rotation invariant representations of the environment. A multi-scale attention-based fusion module is proposed to fully exploit the panoramic views from different modalities of the environment and their correlations. We evaluate our method on the nuScenes dataset, and the experimental results show that our method can effectively utilize multi-view camera and LiDAR data to improve the place recognition performance while maintaining strong robustness to viewpoint changes.
引用
收藏
页码:1342 / 1349
页数:8
相关论文
共 50 条
  • [31] Attention-based multi-scale feature fusion network for myopia grading using optical coherence tomography images
    Huang, Gengyou
    Wen, Yang
    Qian, Bo
    Bi, Lei
    Chen, Tingli
    Sheng, Bin
    VISUAL COMPUTER, 2024, 40 (09): : 6627 - 6638
  • [32] Road Detection through CRF based LiDAR-Camera Fusion
    Gu, Shuo
    Zhang, Yigong
    Tang, Jinhui
    Yang, Jiang
    Kong, Hui
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 3832 - 3838
  • [33] Facial Expression Recognition Based on Multi-scale Feature Fusion Convolutional Neural Network and Attention Mechanism
    Wu, Yana
    Jia, Kebin
    Sun, Zhonghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 324 - 335
  • [34] LiDAR-Camera Fusion Based High-Resolution Network for Efficient Road Segmentation
    Huang, Shuhao
    Xiong, Guangming
    Zhu, Baochang
    Gong, Jianwei
    Chen, Huiyan
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 830 - 835
  • [35] AM-MSFF: A Pest Recognition Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Zhang, Meng
    Yang, Wenzhong
    Chen, Danny
    Fu, Chenghao
    Wei, Fuyuan
    ENTROPY, 2024, 26 (05)
  • [36] Pedestrian Attribute Recognition Algorithm Based on Multi-Scale Attention Network
    Li Na
    Wu Yangyang
    Liu Ying
    Xing Jin
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)
  • [37] Multi-Scale Bilateral Attention Fusion Network For Pansharpening
    Guo Z.
    Li J.
    Lei J.
    Liu J.
    Zhou S.
    Wang B.
    Kasabov N.K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 15
  • [38] Attention-based Pyramid Aggregation Network for Visual Place Recognition
    Zhu, Yingying
    Wang, Jiong
    Xie, Lingxi
    Zheng, Liang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 99 - 107
  • [39] A Mutual Guidance Attention-Based Multi-Level Fusion Network for Hyperspectral and LiDAR Classification
    Zhang, Tongzhen
    Xiao, Song
    Dong, Wenqian
    Qu, Jiahui
    Yang, Yufei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [40] Multi-scale fusion visual attention network for facial micro-expression recognition
    Pan, Hang
    Yang, Hongling
    Xie, Lun
    Wang, Zhiliang
    FRONTIERS IN NEUROSCIENCE, 2023, 17