SGFNet: Segmentation Guided Fusion Network for 3D Object Detection

被引:2
|
作者
Wang, Yunlong [1 ]
Jiang, Kun [1 ]
Wen, Tuopu [1 ]
Jiao, Xinyu [1 ]
Wijaya, Benny [1 ]
Miao, Jinyu [1 ]
Shi, Yining [1 ]
Fu, Zheng [1 ]
Yang, Mengmeng [1 ]
Yang, Diange [1 ]
机构
[1] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous vehicle navigation; sensor fusion; deep learning for visual perception;
D O I
10.1109/LRA.2023.3326697
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The self-driving application requires accurate 3D object detection as it is essential in several tasks, such as path and motion planning. However, up until this point, fusion-based detectors with cameras and LiDAR sensors have always been inferior to LiDAR-only detectors. This can be attributed to the dual scene representation problem caused by the differentiated modality of LiDAR points and images. Moreover, the projection of the image pixels is not guaranteed to reach its point cloud counterparts due to the sparsity of the points, losing image content in the fusion process. Bearing these in mind, we propose Segmentation Guided Fusion Network (SGFNet), an efficient multi-sensor fusion-based 3D object detector. It first separates feature extractions of images and points with unified high-dimensional feature representation through the novel-proposed auxiliary foreground segmentation head, and then projects hierarchical feature maps instead of the raw image pixels onto points to obtain the unified feature map, achieving consistent data modality. Such image feature maps are with several spatial resolutions to keep more image content during the projection process. Finally, the unified feature map is fed into a fusion-based region proposal module and bounding box regression head to generate accurate 3D bounding boxes. Extensive experiments conducted on KITTI and nuScenes datasets demonstrate that SGFNet achieves competitive performance on fusion-based 3D object detection tasks and reports a novel state-of-the-art in terms of 3D average precision metric.
引用
收藏
页码:8239 / 8246
页数:8
相关论文
共 50 条
  • [1] SGF3D: Similarity-guided fusion network for 3D object detection
    Li, Chunzheng
    Wang, Gaihua
    Long, Qian
    Zhou, Zhengshu
    [J]. IMAGE AND VISION COMPUTING, 2024, 142
  • [2] SGF3D: Similarity-guided fusion network for 3D object detection
    Li, Chunzheng
    Wang, Gaihua
    Long, Qian
    Zhou, Zhengshu
    [J]. Image and Vision Computing, 2024, 142
  • [3] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    [J]. NEUROCOMPUTING, 2021, 437 : 107 - 117
  • [4] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    [J]. Neurocomputing, 2021, 437 : 107 - 117
  • [5] Globally Guided Progressive Fusion Network for 3D Pancreas Segmentation
    Fang, Chaowei
    Li, Guanbin
    Pan, Chengwei
    Li, Yiming
    Yu, Yizhou
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 210 - 218
  • [6] Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation
    Meyer, Gregory P.
    Charland, Jake
    Hegde, Darshan
    Laddha, Ankit
    Vallespi-Gonzalez, Carlos
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1230 - 1237
  • [7] SGFNet: Semantic-Guided Fusion Network for RGB-Thermal Semantic Segmentation
    WangLi, Yike
    Li, Gongyang
    Liu, Zhi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7737 - 7748
  • [8] Transformer guided progressive fusion network for 3D pancreas and pancreatic mass segmentation
    Qu, Taiping
    Li, Xiuli
    Wang, Xiheng
    Deng, Wenyi
    Mao, Li
    He, Ming
    Li, Xiao
    Wang, Yun
    Liu, Zaiyi
    Zhang, Longjiang
    Jin, Zhengyu
    Xue, Huadan
    Yu, Yizhou
    [J]. MEDICAL IMAGE ANALYSIS, 2023, 86
  • [9] MF-Net: Meta Fusion Network for 3D object detection
    Meng, Zhaoxin
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    Yang, Fangchun
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Cascaded Cross-Modality Fusion Network for 3D Object Detection
    Chen, Zhiyu
    Lin, Qiong
    Sun, Jing
    Feng, Yujian
    Liu, Shangdong
    Liu, Qiang
    Ji, Yimu
    Xu, He
    [J]. SENSORS, 2020, 20 (24) : 1 - 14