LCPR: A Multi-Scale Attention-Based LiDAR-Camera Fusion Network for Place Recognition

被引:2
|
作者
Zhou, Zijie [1 ]
Xu, Jingyi [2 ]
Xiong, Guangming [1 ]
Ma, Junyi [1 ,3 ]
机构
[1] Beijing Inst Technol, Beijing 100081, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[3] HAOMOAI Technol Co Ltd, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Laser radar; Point cloud compression; Image coding; Feature extraction; Cameras; Image recognition; Three-dimensional displays; Place recognition; SLAM; sensor fusion; deep learning; DISTINCTIVE IMAGE FEATURES;
D O I
10.1109/LRA.2023.3346753
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Place recognition is one of the most crucial modules for autonomous vehicles to identify places that were previously visited in GPS-invalid environments. Sensor fusion is considered an effective method to overcome the weaknesses of individual sensors. In recent years, multimodal place recognition fusing information from multiple sensors has gathered increasing attention. However, most existing multimodal place recognition methods only use limited field-of-view camera images, which leads to an imbalance between features from different modalities and limits the effectiveness of sensor fusion. In this letter, we present a novel neural network named LCPR for robust multimodal place recognition, which fuses LiDAR point clouds with multi-view RGB images to generate discriminative and yaw-rotation invariant representations of the environment. A multi-scale attention-based fusion module is proposed to fully exploit the panoramic views from different modalities of the environment and their correlations. We evaluate our method on the nuScenes dataset, and the experimental results show that our method can effectively utilize multi-view camera and LiDAR data to improve the place recognition performance while maintaining strong robustness to viewpoint changes.
引用
收藏
页码:1342 / 1349
页数:8
相关论文
共 50 条
  • [1] MSANet: LiDAR-Camera Online Calibration with Multi-Scale Fusion and Attention Mechanisms
    Xiong, Fengguang
    Zhang, Zhiqiang
    Kong, Yu
    Shen, Chaofan
    Hu, Mingyue
    Kuang, Liqun
    Han, Xie
    REMOTE SENSING, 2024, 16 (22)
  • [2] Cross Attention-Based Multi-Scale Convolutional Fusion Network for Hyperspectral and LiDAR Joint Classification
    Ge, Haimiao
    Wang, Liguo
    Pan, Haizhu
    Liu, Yanzhong
    Li, Cheng
    Lv, Dan
    Ma, Huiyu
    REMOTE SENSING, 2024, 16 (21)
  • [3] FE-Fusion-VPR: Attention-Based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events
    Hou, Kuanxu
    Kong, Delei
    Jiang, Junjie
    Zhuang, Hao
    Huang, Xinjie
    Fang, Zheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3526 - 3533
  • [4] Enet-CRF-Lidar: Lidar and Camera Fusion for Multi-Scale Object Recognition
    Deng, Qitian
    Li, Xu
    Ni, Peizhou
    Li, Honghai
    Zheng, Zhiyong
    IEEE ACCESS, 2019, 7 : 174335 - 174344
  • [5] Multi-Scale Spatial Transformer Network for LiDAR-Camera 3D Object Detection
    Wang, Zhifan
    Zhang, Xiaohong
    Wang, Shidong
    Xin, Tong
    Zhang, Haofeng
    Lu, Jianfeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] An attention-based multi-scale convolution network for intelligent underwater acoustic signal recognition
    Zhou, Aolong
    Li, Xiaoyong
    Zhang, Wen
    Zhao, Chengwu
    Ren, Kaijun
    Ma, Yanxin
    Song, Junqiang
    OCEAN ENGINEERING, 2023, 287
  • [7] GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing
    Liu, Xiaohong
    Ma, Yongrui
    Shi, Zhihao
    Chen, Jun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7313 - 7322
  • [8] Channel Attention in LiDAR-camera Fusion for Lane Line Segmentation
    Zhang, Xinyu
    Li, Zhiwei
    Gao, Xin
    Jin, Dafeng
    Li, Jun
    PATTERN RECOGNITION, 2021, 118
  • [9] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Li, Fuquan
    Zhou, Yonghui
    Chen, YanLi
    Li, Jie
    Dong, ZhiCheng
    Tan, Mian
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
  • [10] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Fuquan Li
    Yonghui Zhou
    YanLi Chen
    Jie Li
    ZhiCheng Dong
    Mian Tan
    Complex & Intelligent Systems, 2024, 10 : 705 - 719