S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [21] SCP: SCENE COMPLETION PRE-TRAINING FOR 3D OBJECT DETECTION
    Shan, Yiming
    Xia, Yan
    Chen, Yuhong
    Cremers, Daniel
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 41 - 46
  • [22] RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion
    Li, Jie
    Liu, Yu
    Gong, Dong
    Shi, Qinfeng
    Yuan, Xia
    Zhao, Chunxia
    Reid, Ian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7685 - 7694
  • [23] Camera-Based 3D Semantic Scene Completion With Sparse Guidance Network
    Mei, Jianbiao
    Yang, Yu
    Wang, Mengmeng
    Zhu, Junyu
    Ra, Jongwon
    Ma, Yukai
    Li, Laijian
    Liu, Yong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5468 - 5481
  • [24] Volatile ZnPhen(S2CNEt(2))(2) and MnPhen(S(2)CNEt(2))(6) complexes and the MnZn(2)Phen(3)(S(2)CNEt(2))(6) phase: Thermal behavior and crystal and molecular structures of two modifications of ZnPhen(S(2)CNEt(2))(2)
    Zemskova, SM
    Glinskaya, LA
    Klevtsova, RF
    Gromilov, SA
    Durasov, VB
    Nadolinnyi, VA
    Larionov, SV
    JOURNAL OF STRUCTURAL CHEMISTRY, 1995, 36 (03) : 484 - 495
  • [25] Two Stream 3D Semantic Scene Completion
    Garbade, Martin
    Chen, Yueh-Tung
    Sawatzky, Johann
    Gall, Juergen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 416 - 425
  • [26] MonoScene: Monocular 3D Semantic Scene Completion
    Anh-Quan Cao
    de Charette, Raoul
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3981 - 3991
  • [27] LMSCNet: Lightweight Multiscale 3D Semantic Completion
    Roldao, Luis
    de Charette, Raoul
    Verroust-Blondet, Anne
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 111 - 119
  • [28] A GEOMETRIC CONVOLUTIONAL NEURAL NETWORK FOR 3D OBJECT DETECTION
    Lu, Yawen
    Guo, Qianyu
    Lu, Guoyu
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [29] A New Monocular 3D Object Detection with Neural Network
    Hong, Weijie
    Liu, Yiguang
    Zheng, Yunan
    Wang, Ying
    Shi, Xuelei
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 174 - 185
  • [30] VENet: Voting Enhancement Network for 3D Object Detection
    Xie, Qian
    Lai, Yu-Kun
    Wu, Jing
    Wang, Zhoutao
    Lu, Dening
    Wei, Mingqiang
    Wang, Jun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3692 - 3701