CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection

被引:0
|
作者
Zhang, Haonan [1 ,2 ]
Liu, Longjun [1 ,2 ]
Huang, Yuqi [1 ,2 ]
Yang, Zhao [1 ,2 ]
Lei, Xinyu [1 ,2 ]
Wen, Bihan [3 ]
机构
[1] Xi An Jiao Tong Univ, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intel, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
[3] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
10.1109/CVPR52733.2024.01452
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) possesses immense potential to accelerate the deep neural networks (DNNs) for LiDAR-based 3D detection. However, in most of prevailing approaches, the suboptimal teacher models and insufficient student architecture investigations limit the performance gains. To address these issues, we propose a simple yet effective Category-aware Knowledge Distillation and Pruning (CaKDP) framework for compressing 3D detectors. Firstly, CaKDP transfers the knowledge of two-stage detector to one-stage student one, mitigating the impact of inadequate teacher models. To bridge the gap between the heterogeneous detectors, we investigate their differences, and then introduce the student-motivated category-aware KD to align the category prediction between distillation pairs. Secondly, we propose a category-aware pruning scheme to obtain the customizable architecture of compact student model. The method calculates the category prediction gap before and after removing each filter to evaluate the importance offilters, and retains the important filters. Finally, to further improve the student performance, a modified /OU-aware refinement module with negligible computations is leveraged to remove the redundant false positive predictions. Experiments demonstrate that CaKDP achieves the compact detector with high performance. For example, on WOD, CaKDP accelerates CenterPoint by half while boosting L2 mAPH by 1.61%. The code is available athttps://github.com/zhnxjtu/CaKDP.
引用
收藏
页码:15331 / 15341
页数:11
相关论文
共 50 条
  • [1] KNOWLEDGE DISTILLATION WITH CATEGORY-AWARE ATTENTION AND DISCRIMINANT LOGIT LOSSES
    Jiang, Lei
    Zhou, Wengang
    Li, Houqiang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1792 - 1797
  • [2] A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation
    Li, Xiufang
    Jiao, Licheng
    Sun, Qigong
    Liu, Fang
    Liu, Xu
    Li, Lingling
    Chen, Puhua
    Yang, Shuyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9603 - 9618
  • [3] Towards Efficient 3D Object Detection with Knowledge Distillation
    Yang, Jihan
    Shi, Shaoshuai
    Ding, Runyu
    Wang, Zhe
    Qi, Xiaojuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] Representation Disparity-aware Distillation for 3D Object Detection
    Li, Yanjing
    Xu, Sheng
    Lin, Mingbao
    Yin, Jihao
    Zhang, Baochang
    Cao, Xianbin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6692 - 6701
  • [5] Category-Aware Transformer Network for Better Human-Object Interaction Detection
    Dong, Leizhen
    Li, Zhimin
    Xu, Kunlun
    Zhang, Zhijun
    Yan, Luxin
    Zhong, Sheng
    Zou, Xu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19516 - 19525
  • [6] Hardware-Aware Latency Pruning for Real-Time 3D Object Detection
    Shen, Maying
    Mao, Lei
    Chen, Joshua
    Hsu, Justin
    Sun, Xinglong
    Knieps, Oliver
    Maxim, Carmen
    Alvarez, Jose M.
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [7] itKD: Interchange Transfer-based Knowledge Distillation for 3D Object Detection
    Cho, Hyeon
    Choi, Junyong
    Baek, Geonwoo
    Hwang, Wonjun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13540 - 13549
  • [8] Voxel-to-Pillar: Knowledge Distillation of 3D Object Detection in Point Cloud
    Zhang, Jinbao
    Liu, Jun
    PROCEEDINGS OF THE 4TH EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2023, 2024, : 99 - 104
  • [9] Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection
    Hong, Yu
    Dai, Hang
    Ding, Yong
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 87 - 104
  • [10] Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection
    Zhang, Yunde
    Zhang, Zhili
    Liu, Tianshan
    Kong, Jun
    NEURAL PROCESSING LETTERS, 2024, 56 (02)