MSGFusion: Muti-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection

被引:0
|
作者
Zhu, Huming [1 ]
Xue, Yiyu [1 ]
Cheng, Xinyue [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
关键词
3D object detection; multi-modal fusion; multiscale; semantic segmentation;
D O I
10.1109/IJCNN60899.2024.10651407
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is a key technology in automatic driving perception, which can provide the basis for safe and reliable autonomous driving. Aiming at the problem of false positive of low resolution object in point clouds, we present Multi-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection(MSGFusion), which deeply fuses the features of image and LiDAR points. Specifically, we design multi-scale DenseFusion, which serially aggregate images features, pointwise features and voxel-wise feature volumes at different scales. At the same time, we design a new Image-based Predicted Keypoint Weighting(I-PKW). It predicts the object points based on the predicted foreground score map. Given the 3D proposals generated by the voxel CNN, we propose RoI-Pillar pooling. It abstracts the feature by aggregating the keypoints in the RoI by pillars. Compared with RoI-grid pooling, pillar-based feature encoding is more consistent with the distribution of fused feature keypoints to accurately regress the classification confidence and bounding box. Extensive experiments on the KITTI dataset show the superiority of MSGFusion.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A LiDAR-Camera Fusion 3D Object Detection Algorithm
    Liu, Leyuan
    He, Jian
    Ren, Keyan
    Xiao, Zhonghua
    Hou, Yibin
    INFORMATION, 2022, 13 (04)
  • [2] SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
    Qin, Yiran
    Wang, Chaoqun
    Kang, Zijian
    Ma, Ningning
    Li, Zhen
    Zhang, Ruimao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21957 - 21967
  • [3] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [4] Rethinking the Late Fusion of LiDAR-Camera Based 3D Object Detection
    Yu, Lehang
    Zhang, Jing
    Liu, Zhong
    Yue, Haosong
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [5] Cross-Supervised LiDAR-Camera Fusion for 3D Object Detection
    Zuo, Chao Jie
    Gu, Cao Yu
    Guo, Yi Kun
    Miao, Xiao Dong
    IEEE ACCESS, 2025, 13 : 10447 - 10458
  • [6] Detecting as Labeling: Rethinking LiDAR-Camera Fusion in 3D Object Detection
    Huang, Junjie
    Ye, Yun
    Liang, Zhujin
    Shan, Yi
    Du, Dalong
    COMPUTER VISION-ECCV 2024, PT XXII, 2025, 15080 : 439 - 455
  • [7] TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
    Bai, Xuyang
    Hu, Zeyu
    Zhu, Xinge
    Huang, Qingqiu
    Chen, Yilun
    Fu, Hangbo
    Tai, Chiew-Lan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1080 - 1089
  • [8] LiDAR-Camera Fusion in Perspective View for 3D Object Detection in Surface Mine
    Ai, Yunfeng
    Yang, Xue
    Song, Ruiqi
    Cui, Chenglin
    Li, Xinqing
    Cheng, Qi
    Tian, Bin
    Chen, Long
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3721 - 3730
  • [9] FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
    Yin, Zixuan
    Sun, Han
    Liu, Ningzhong
    Zhou, Huiyu
    Shen, Jiaquan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 505 - 517
  • [10] Deep structural information fusion for 3D object detection on LiDAR-camera system
    An, Pei
    Liang, Junxiong
    Yu, Kun
    Fang, Bin
    Ma, Jie
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 214