DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION

被引：7

作者：

Fang, Tiyu ^{[1
]}

Liang, Zhen ^{[1
]}

Shao, Xiuli ^{[2
]}

Dong, Zihao ^{[1
]}

Li, Jinping ^{[1
]}

机构：

[1] Univ Jinan, Sch Informat Sci & Engn, Jinan 250022, Peoples R China

[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

RGB-D semantic segmentation; convolutional neural networks; knowledge distillation;

D O I：

10.1109/ICASSP43922.2022.9747767

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

RGB-D semantic segmentation is attracting wide attention due to its better performance than conventional RGB methods. However, most of RGB-D semantic segmentation methods need to acquire the real depth information for segmenting RGB images effectively. Therefore, it is extremely challenging to take full advantage of RGB-D semantic segmentation methods for segmenting RGB images without the depth input. To address this challenge, a general depth removal distillation method is proposed to remove depth dependence from RGB-D semantic segmentation model by knowledge distillation, which can be employed to any CNN-based segmentation network structure. Specifically, a depth-aware convolution is adopted to construct the teacher network for getting sufficient knowledge from RGB-D images. Then according to the structure consistency between depth-aware convolution and general convolution, the teacher network is used to transfer the learned knowledge to the student network with general convolutions by sharing parameters. Next, the student network makes up for the lack of depth in manner of learning by RGB images. Meantime, a Variable Temperature Cross Entropy (VTCE) loss function is proposed to further increase the accuracy of the student model by soft target distillation. Extensive experiments on NYUv2 and SUN RGB-D datasets demonstrate the superiority of our proposed approach.

引用

页码：2405 / 2409

页数：5

共 50 条

[31] Multi-scale fusion for RGB-D indoor semantic segmentation
Shiyi Jiang
Yang Xu
Danyang Li
Runze Fan
[J]. Scientific Reports, 12 (1)
[32] SPNet: An RGB-D Sequence Progressive Network for Road Semantic Segmentation
Zhou, Zhi
Zhang, Yuhang
Hua, Guoguang
Long, Ruijing
Tian, Shishun
Zou, Wenbin
[J]. 2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
[33] RGB-D indoor semantic segmentation network based on wavelet transform
Runze Fan
Yuhong Liu
Shiyi Jiang
Rongfen Zhang
[J]. Evolving Systems, 2023, 14 : 981 - 991
[34] RGB-D Dual Modal Information Complementary Semantic Segmentation Network
Wang, Lichun
Gu, Nana
Xin, Jianjia
Wang, Shaofan
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (10): : 1489 - 1499
[35] CANet: Co-attention network for RGB-D semantic segmentation
Zhou, Hao
Qi, Lu
Huang, Hai
Yang, Xu
Wan, Zhaoliang
Wen, Xianglong
[J]. PATTERN RECOGNITION, 2022, 124
[36] Triple fusion and feature pyramid decoder for RGB-D semantic segmentation
Ge, Bin
Zhu, Xu
Tang, Zihan
Xia, Chenxing
Lu, Yiming
Chen, Zhuang
[J]. MULTIMEDIA SYSTEMS, 2024, 30 (05)
[37] RGB-D object detection and semantic segmentation for autonomous manipulation in clutter
Schwarz, Max
Milan, Anton
Periyasamy, Arul Selvam
Behnke, Sven
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5): : 437 - 451
[38] A brief survey on RGB-D semantic segmentation using deep learning*
Wang, Changshuo
Wang, Chen
Li, Weijun
Wang, Haining
[J]. DISPLAYS, 2021, 70
[39] Semantic Segmentation based Dense RGB-D SLAM in Dynamic Environments
Zhang, Jianbo
Liu, Yanjie
Chen, Junguo
Ma, Liulong
Jin, Dong
Chen, Jiao
[J]. 2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
[40] RGB-D indoor semantic segmentation network based on wavelet transform
Fan, Runze
Liu, Yuhong
Jiang, Shiyi
Zhang, Rongfen
[J]. EVOLVING SYSTEMS, 2023, 14 (06) : 981 - 991

← 1 2 3 4 5 →