SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

被引:4
|
作者
Qin, Yiran [1 ]
Wang, Chaoqun [1 ]
Kang, Zijian [2 ]
Ma, Ningning [2 ]
Li, Zhen [3 ]
Zhang, Ruimao [1 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen CUHK Shenzhen, Shenzhen Res Inst Big Data, Sch Data Sci, Shenzhen, Peoples R China
[2] NIO, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Future Intelligent Network Res Inst, Sch Sci & Engn, Shenzhen CUHK Shenzhen, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV51070.2023.02012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
LiDAR-Camera fusion-based 3D detection is a critical task for automatic driving. In recent years, many LiDARCamera fusion approaches sprung up and gained promising performances compared with single-modal detectors, but always lack carefully designed and effective supervision for the fusion process. In this paper, we propose a novel training strategy called SupFusion, which provides an auxiliary feature level supervision for effective LiDAR-Camera fusion and significantly boosts detection performance. Our strategy involves a data enhancement method named Polar Sampling, which densifies sparse objects and trains an assistant model to generate high-quality features as the supervision. These features are then used to train the LiDAR-Camera fusion model, where the fusion feature is optimized to simulate the generated high-quality features. Furthermore, we propose a simple yet effective deep fusion module, which contiguously gains superior performance compared with previous fusion methods with SupFusion strategy. In such a manner, our proposal shares the following advantages. Firstly, SupFusion introduces auxiliary feature-level supervision which could boost LiDAR-Camera detection performance without introducing extra inference costs. Secondly, the proposed deep fusion could continuously improve the detector's abilities. Our proposed SupFusion and deep fusion module is plug-and-play, we make extensive experiments to demonstrate its effectiveness. Specifically, we gain around 2% 3D mAP improvements on KITTI benchmark based on multiple LiDAR-Camera 3D detectors. Our code is available at https://github.com/IranQin/SupFusion.
引用
收藏
页码:21957 / 21967
页数:11
相关论文
共 50 条
  • [1] A LiDAR-Camera Fusion 3D Object Detection Algorithm
    Liu, Leyuan
    He, Jian
    Ren, Keyan
    Xiao, Zhonghua
    Hou, Yibin
    [J]. INFORMATION, 2022, 13 (04)
  • [2] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [3] TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
    Bai, Xuyang
    Hu, Zeyu
    Zhu, Xinge
    Huang, Qingqiu
    Chen, Yilun
    Fu, Hangbo
    Tai, Chiew-Lan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1080 - 1089
  • [4] LiDAR-Camera Fusion in Perspective View for 3D Object Detection in Surface Mine
    Ai, Yunfeng
    Yang, Xue
    Song, Ruiqi
    Cui, Chenglin
    Li, Xinqing
    Cheng, Qi
    Tian, Bin
    Chen, Long
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3721 - 3730
  • [5] FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
    Yin, Zixuan
    Sun, Han
    Liu, Ningzhong
    Zhou, Huiyu
    Shen, Jiaquan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 505 - 517
  • [6] FusionRCNN: LiDAR-Camera Fusion for Two-Stage 3D Object Detection
    Xu, Xinli
    Dong, Shaocong
    Xu, Tingfa
    Ding, Lihe
    Wang, Jie
    Jiang, Peng
    Song, Liqiang
    Li, Jianan
    [J]. REMOTE SENSING, 2023, 15 (07)
  • [7] Deep structural information fusion for 3D object detection on LiDAR-camera system
    An, Pei
    Liang, Junxiong
    Yu, Kun
    Fang, Bin
    Ma, Jie
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 214
  • [8] DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
    Li, Yingwei
    Yu, Adams Wei
    Meng, Tianjian
    Caine, Ben
    Ngiam, Jiquan
    Peng, Daiyi
    Shen, Junyang
    Lu, Yifeng
    Zhou, Denny
    Le, Quoc, V
    Yuille, Alan
    Tan, Mingxing
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17161 - 17170
  • [9] URFormer: Unified Representation LiDAR-Camera 3D Object Detection with Transformer
    Zhang, Guoxin
    Xie, Jun
    Liu, Lin
    Wang, Zhepeng
    Yang, Kuihe
    Song, Ziying
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 401 - 413
  • [10] FS-Net: LiDAR-Camera Fusion With Matched Scale for 3D Object Detection in Autonomous Driving
    Zhang, Lei
    Li, Xu
    Tang, Kaichen
    Jiang, Yunzhe
    Yang, Liu
    Zhang, Yonggang
    Chen, Xianyi
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12154 - 12165