S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [31] 3D Object Completion via Class-Conditional Generative Adversarial Network
    Chen, Yu-Chieh
    Tan, Daniel Stanley
    Cheng, Wen-Huang
    Hua, Kai-Lung
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 54 - 66
  • [32] Semantic Enabled 3D Object Retrieval
    Zhou, Jiang
    Ma, Xinyu
    MICRO NANO DEVICES, STRUCTURE AND COMPUTING SYSTEMS, 2011, 159 : 128 - 131
  • [33] Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection
    Cserni, Marton
    Rovid, Andras
    IEEE ACCESS, 2024, 12 : 167153 - 167167
  • [34] Visual-Inertial-Semantic Scene Representation for 3D Object Detection
    Dong, Jingming
    Fei, Xiaohan
    Soatto, Stefano
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3567 - 3577
  • [35] Fast Two-Stage 3D Object Detection with Semantic Guidance
    Huang Mang
    Hui Bin
    Liu Zhaoji
    Jin Tianming
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (12)
  • [36] 3D Object Detection Based on Strong Semantic Key Point Sampling
    Che, Yunlong
    Yuan, Liang
    Sun, Lihui
    Computer Engineering and Applications, 60 (09): : 254 - 260
  • [37] Boosting Lidar 3D Object Detection with Point Cloud Semantic Segmentation
    Zhang, Xuchong
    Min, Chong
    Jia, Yijie
    Chen, Liming
    Zhang, Jingmin
    Sun, Hongbin
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7614 - 7621
  • [38] Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection
    Unal, Ozan
    Van Gool, Luc
    Dai, Dengxin
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2949 - 2958
  • [39] An object detection algorithm combining semantic and geometric information of the 3D cloud
    Huang, Zhe
    Wang, Yongcai
    Wen, Jie
    Wang, Peng
    Cai, Xudong
    ADVANCED ENGINEERING INFORMATICS, 2023, 56
  • [40] SEFormer: Structure Embedding Transformer for 3D Object Detection
    Feng, Xiaoyu
    Du, Heming
    Fan, Hehe
    Duan, Yueqi
    Liu, Yongpan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 632 - 640