Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network

被引:7
|
作者
Ju, Bo [1 ]
Zou, Zhikang [1 ]
Ye, Xiaoqing [1 ]
Jiang, Minyue [1 ]
Tan, Xiao [1 ]
Ding, Errui [1 ]
Wang, Jingdong [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
3D Object Detection; Knowledge Distillation; Semantic Passing;
D O I
10.1145/3503161.3547891
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
3D object detection task from lidar or camera sensors is essential for autonomous driving. Pioneer attempts at multi-modality fusion complement the sparse lidar point clouds with rich semantic texture information from images at the cost of extra network designs and overhead. In this work, we propose a novel semantic passing framework, named SPNet, to boost the performance of existing lidar-based 3D detection models with the guidance of rich context painting, with no extra computation cost during inference. Our key design is to first exploit the potential instructive semantic knowledge within the ground-truth labels by training a semantic-painted teacher model and then guide the pure-lidar network to learn the semantic-painted representation via knowledge passing modules at different granularities: class-wise passing, pixel-wise passing and instance-wise passing. Experimental results show that the proposed SPNet can seamlessly cooperate with most existing 3D detection frameworks with 1 similar to 5% AP gain and even achieve new state-of-the-art 3D detection performance on the KITTI test benchmark. Code is available at: https://github.com/jb892/SPNet.
引用
收藏
页码:5639 / 5648
页数:10
相关论文
共 50 条
  • [31] Geometry Uncertainty Projection Network for Monocular 3D Object Detection
    Lu, Yan
    Ma, Xinzhu
    Yang, Lei
    Zhang, Tianzhu
    Liu, Yating
    Chu, Qi
    Yan, Junjie
    Ouyang, Wanli
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3091 - 3101
  • [32] Image attention transformer network for indoor 3D object detection
    Ren, Keyan
    Yan, Tong
    Hu, Zhaoxin
    Han, Honggui
    Zhang, Yunlu
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (07) : 2176 - 2190
  • [33] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, (07) : 2176 - 2190
  • [34] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [35] 3D OBJECT DETECTION NETWORK COMBINED WITH POINT CLOUD COMPLETION
    Zhou, Jing
    Yu, Chao
    Zhang, Junchi
    Hu, Yiyu
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (05) : 789 - 809
  • [36] 3D object detection network based on symmetric shape generation
    Tu X.
    Zheng S.
    Yu S.
    Li W.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (06): : 252 - 263
  • [37] SGFNet: Segmentation Guided Fusion Network for 3D Object Detection
    Wang, Yunlong
    Jiang, Kun
    Wen, Tuopu
    Jiao, Xinyu
    Wijaya, Benny
    Miao, Jinyu
    Shi, Yining
    Fu, Zheng
    Yang, Mengmeng
    Yang, Diange
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8239 - 8246
  • [38] A Hierarchical Graph Network for 3D Object Detection on Point Clouds
    Chen, Jintai
    Lei, Biwen
    Song, Qingyu
    Ying, Haochao
    Chen, Danny Z.
    Wu, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 389 - 398
  • [39] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, 67 (07) : 2176 - 2190
  • [40] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179