DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION

被引:7
|
作者
Fang, Tiyu [1 ]
Liang, Zhen [1 ]
Shao, Xiuli [2 ]
Dong, Zihao [1 ]
Li, Jinping [1 ]
机构
[1] Univ Jinan, Sch Informat Sci & Engn, Jinan 250022, Peoples R China
[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China
关键词
RGB-D semantic segmentation; convolutional neural networks; knowledge distillation;
D O I
10.1109/ICASSP43922.2022.9747767
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
RGB-D semantic segmentation is attracting wide attention due to its better performance than conventional RGB methods. However, most of RGB-D semantic segmentation methods need to acquire the real depth information for segmenting RGB images effectively. Therefore, it is extremely challenging to take full advantage of RGB-D semantic segmentation methods for segmenting RGB images without the depth input. To address this challenge, a general depth removal distillation method is proposed to remove depth dependence from RGB-D semantic segmentation model by knowledge distillation, which can be employed to any CNN-based segmentation network structure. Specifically, a depth-aware convolution is adopted to construct the teacher network for getting sufficient knowledge from RGB-D images. Then according to the structure consistency between depth-aware convolution and general convolution, the teacher network is used to transfer the learned knowledge to the student network with general convolutions by sharing parameters. Next, the student network makes up for the lack of depth in manner of learning by RGB images. Meantime, a Variable Temperature Cross Entropy (VTCE) loss function is proposed to further increase the accuracy of the student model by soft target distillation. Extensive experiments on NYUv2 and SUN RGB-D datasets demonstrate the superiority of our proposed approach.
引用
收藏
页码:2405 / 2409
页数:5
相关论文
共 50 条
  • [31] Multi-scale fusion for RGB-D indoor semantic segmentation
    Shiyi Jiang
    Yang Xu
    Danyang Li
    Runze Fan
    [J]. Scientific Reports, 12 (1)
  • [32] SPNet: An RGB-D Sequence Progressive Network for Road Semantic Segmentation
    Zhou, Zhi
    Zhang, Yuhang
    Hua, Guoguang
    Long, Ruijing
    Tian, Shishun
    Zou, Wenbin
    [J]. 2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [33] RGB-D indoor semantic segmentation network based on wavelet transform
    Runze Fan
    Yuhong Liu
    Shiyi Jiang
    Rongfen Zhang
    [J]. Evolving Systems, 2023, 14 : 981 - 991
  • [34] RGB-D Dual Modal Information Complementary Semantic Segmentation Network
    Wang, Lichun
    Gu, Nana
    Xin, Jianjia
    Wang, Shaofan
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (10): : 1489 - 1499
  • [35] CANet: Co-attention network for RGB-D semantic segmentation
    Zhou, Hao
    Qi, Lu
    Huang, Hai
    Yang, Xu
    Wan, Zhaoliang
    Wen, Xianglong
    [J]. PATTERN RECOGNITION, 2022, 124
  • [36] Triple fusion and feature pyramid decoder for RGB-D semantic segmentation
    Ge, Bin
    Zhu, Xu
    Tang, Zihan
    Xia, Chenxing
    Lu, Yiming
    Chen, Zhuang
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [37] RGB-D object detection and semantic segmentation for autonomous manipulation in clutter
    Schwarz, Max
    Milan, Anton
    Periyasamy, Arul Selvam
    Behnke, Sven
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5): : 437 - 451
  • [38] A brief survey on RGB-D semantic segmentation using deep learning*
    Wang, Changshuo
    Wang, Chen
    Li, Weijun
    Wang, Haining
    [J]. DISPLAYS, 2021, 70
  • [39] Semantic Segmentation based Dense RGB-D SLAM in Dynamic Environments
    Zhang, Jianbo
    Liu, Yanjie
    Chen, Junguo
    Ma, Liulong
    Jin, Dong
    Chen, Jiao
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [40] RGB-D indoor semantic segmentation network based on wavelet transform
    Fan, Runze
    Liu, Yuhong
    Jiang, Shiyi
    Zhang, Rongfen
    [J]. EVOLVING SYSTEMS, 2023, 14 (06) : 981 - 991