RGB-D Gate-guided edge distillation for indoor semantic segmentation

被引:7
|
作者
Zou, Wenbin [1 ,2 ]
Peng, Yingqing [1 ,2 ]
Zhang, Zhengyu [1 ,2 ]
Tian, Shishun [1 ,2 ]
Li, Xia [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen Key Lab Adv Machine Learning & Applicat, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
RGB-D Semantic Segmentation; Edge Distillation; Gate; Deep Learning;
D O I
10.1007/s11042-021-11395-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fusing the RGB and depth information can significantly improve the performance of semantic segmentation since the depth data represents the geometric information. In this paper, we propose a novel Gate-guided Edge Distillation (GED) based approach to effectively generate edge-aware features by fusing the RGB and depth data, assisting the high-level semantic prediction. The proposed GED consists of two modules: gated fusion and edge distillation. The gated fusion module adaptively learns the relationship between RGB and depth data to generate complementary features. To address the adverse effects caused by redundant information of edge-aware features, edge distillation module enhances the semantic features of the same object while preserving the discrimination of the semantic features belonging to different objects. Besides, by using distilled edge-aware features as detailed guidance, the proposed edge-guided fusion module effectively fuses with semantic features. In addition, the complementary features are leveraged in multi-level feature fusion module to further enhance detailed information. Extensive experiments on the widely used SUN-RGBD and NYU-Dv2 datasets demonstrate that the proposed approach with ResNet-50 achieves state-of-the-art performance.
引用
收藏
页码:35815 / 35830
页数:16
相关论文
共 50 条
  • [11] RGB-D indoor semantic segmentation network based on wavelet transform
    Fan, Runze
    Liu, Yuhong
    Jiang, Shiyi
    Zhang, Rongfen
    [J]. EVOLVING SYSTEMS, 2023, 14 (06) : 981 - 991
  • [12] AGWNet: Attention-guided adaptive shuffle channel gate warped feature network for indoor scene RGB-D semantic segmentation
    Xiong, Bing
    Peng, Yue
    Zhu, JingKe
    Gu, Jia
    Chen, Zhen
    Qin, Wenjian
    [J]. Displays, 2024, 83
  • [13] SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION
    Li, Yabei
    Zhang, Junge
    Cheng, Yanhua
    Huang, Kaiqi
    Tan, Tieniu
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1262 - 1266
  • [14] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [15] RGBxD: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation
    Cao, Jinming
    Leng, Hanchao
    Cohen-Or, Daniel
    Lischinski, Dani
    Chen, Ying
    Tu, Changhe
    Li, Yangyan
    [J]. NEUROCOMPUTING, 2021, 462 : 568 - 580
  • [16] RGB-D joint modelling with scene geometric information for indoor semantic segmentation
    Liu, Hong
    Wu, Wenshan
    Wang, Xiangdong
    Qian, Yueliang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22475 - 22488
  • [17] RGB-D Semantic Segmentation for Indoor Modeling Using Deep Learning: A Review
    Rached, Ishraq
    Hajji, Rafika
    Landes, Tania
    [J]. RECENT ADVANCES IN 3D GEOINFORMATION SCIENCE, 3D GEOINFO 2023, 2024, : 587 - 604
  • [18] RGB-D joint modelling with scene geometric information for indoor semantic segmentation
    Hong Liu
    Wenshan Wu
    Xiangdong Wang
    Yueliang Qian
    [J]. Multimedia Tools and Applications, 2018, 77 : 22475 - 22488
  • [19] RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
    Yan, Xingchao
    Hou, Sujuan
    Karim, Awudu
    Jia, Weikuan
    [J]. DISPLAYS, 2021, 70
  • [20] Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks
    Barchid, Sami
    Mennesson, Jose
    Djeraba, Chaabane
    [J]. 2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 199 - 202