DGPINet-KD: Deep Guided and Progressive Integration Network with Knowledge Distillation for RGB-D Indoor Scene Analysis

被引:1
|
作者
Zhou W. [1 ]
Jian B. [1 ]
Fang M. [1 ]
Dong X. [1 ]
Liu Y. [4 ]
Jiang Q. [5 ]
机构
[1] Technology, Hangzhou
[2] School of Computer Science and Engineering, Nanyang Technological University, Singapore
[3] School of Information Science and Engineering, Ningbo University, Ningbo
关键词
branch attention; Circuits and systems; Computational modeling; Convolution; depth guidance; Feature extraction; indoor scene analysis; knowledge distillation; Logic gates; RGB-D data; Semantic segmentation; Semantics;
D O I
10.1109/TCSVT.2024.3382354
中图分类号
学科分类号
摘要
Significant advancements in RGB-D semantic segmentation have been made owing to the increasing availability of robust depth information. Most researchers have combined depth with RGB data to capture complementary information in images. Although this approach improves segmentation performance, it requires excessive model parameters. To address this problem, we propose DGPINet-KD, a deep-guided and progressive integration network with knowledge distillation (KD) for RGB-D indoor scene analysis. First, we used branching attention and depth guidance to capture coordinated, precise location information and extract more complete spatial information from the depth map to complement the semantic information for the encoded features. Second, we trained the student network (DGPINet-S) with a well-trained teacher network (DGPINet-T) using a multilevel KD. Third, an integration unit was developed to explore the contextual dependencies of the decoding features and to enhance relational KD. Comprehensive experiments on two challenging indoor benchmark datasets, NYUDv2 and SUN RGB-D, demonstrated that DGPINet-KD achieved improved performance in indoor scene analysis tasks compared with existing methods. Notably, on the NYUDv2 dataset, DGPINet-KD (DGPINet-S with KD) achieves a pixel accuracy gain of 1.7% and a class accuracy gain of 2.3% compared with DGPINet-S. In addition, compared with DGPINet-T, the proposed DGPINet-KD (DGPINet-S with KD) utilizes significantly fewer parameters (29.3M) while maintaining accuracy. The source code is available at https://github.com/XUEXIKUAIL/DGPINet. IEEE
引用
下载
收藏
页码:1 / 1
相关论文
共 36 条
  • [1] PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
    Zhou, Wujie
    Yang, Enquan
    Lei, Jingsheng
    Wan, Jian
    Yu, Lu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3483 - 3494
  • [2] Lightweight Dual Stream Network With Knowledge Distillation for RGB-D Scene Parsing
    Zhang, Yuming
    Zhou, Wujie
    Ran, Xiaoxiao
    Fang, Meixin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 855 - 859
  • [3] FIMKD: Feature-Implicit Mapping Knowledge Distillation for RGB-D Indoor Scene Semantic Segmentation
    Zhejiang University of Science & Technology, School of Information & Electronic Engineering, Hangzhou
    310023, China
    不详
    308232, Singapore
    不详
    430074, China
    不详
    315211, China
    IEEE. Trans. Artif. Intell., 2024, 12 (6488-6499):
  • [4] Morphology-Guided Network via Knowledge Distillation for RGB-D Mirror Segmentation
    Zhou, Wujie
    Cai, Yuqi
    Qiang, Fangfang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 1 - 10
  • [5] DMFNet: Deep Multi-Modal Fusion Network for RGB-D Indoor Scene Segmentation
    Yuan, Jianzhong
    Zhou, Wujie
    Luo, Ting
    IEEE ACCESS, 2019, 7 : 169350 - 169358
  • [6] An Efficient RGB-D Indoor Scene-Parsing Solution via Lightweight Multiflow Intersection and Knowledge Distillation
    Zhou, Wujie
    Zhang, Yuming
    Yan, Weiqing
    Ye, Lv
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (03) : 336 - 345
  • [7] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
    Seichter, Daniel
    Koehler, Mona
    Lewandowski, Benjamin
    Wengefeld, Tim
    Gross, Horst-Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13525 - 13531
  • [8] FRNet: Feature Reconstruction Network for RGB-D Indoor Scene Parsing
    Zhou, Wujie
    Yang, Enquan
    Lei, Jingsheng
    Yu, Lu
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (04) : 677 - 687
  • [9] RGB-D Gate-guided edge distillation for indoor semantic segmentation
    Wenbin Zou
    Yingqing Peng
    Zhengyu Zhang
    Shishun Tian
    Xia Li
    Multimedia Tools and Applications, 2022, 81 : 35815 - 35830
  • [10] RGB-D Gate-guided edge distillation for indoor semantic segmentation
    Zou, Wenbin
    Peng, Yingqing
    Zhang, Zhengyu
    Tian, Shishun
    Li, Xia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35815 - 35830