EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation

被引:1
|
作者
Chen, Jianlin [1 ,2 ]
Li, Gongyang [1 ,2 ]
Zhang, Zhijiang [1 ,2 ]
Zeng, Dan [1 ,2 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
关键词
RGB-D indoor semantic segmentation; Encoding fusion; Decoding correction;
D O I
10.1016/j.imavis.2023.104892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial task in vision measurement systems that involves understanding and segmenting different objects and regions within an image. Over the years, numerous RGB-D semantic segmentation methods have been developed, leveraging the encoder -decoder architecture to achieve outstanding performance. However, existing methods have two main problems that constrain further performance improvement. Firstly, in the encoding stage, existing methods have a weak ability to fuse cross -modal information, and low -quality depth maps can easily lead to poor feature representation. Secondly, in the decoding stage, the upsampling of highlevel semantic information may cause the loss of contextual information, and low-level features from the encoder may bring noises to the decoder through skip connections. To solve these issues, we propose a novel Encoding Fusion and Decoding Correction Network (EFDCNet) for RGB-D indoor semantic segmentation. First, in the encoding stage of EFDCNet, we focus on extracting valuable information from low -quality depth maps, and employ a channel -wise filter to select informative depth features. Additionally, we establish the global dependencies between RGB and depth features via the self -attention mechanism to enhance the cross -modal feature interactions, extracting discriminant and powerful features. Then, in the decoding stage of EFDCNet, we use the highest -level information as semantic guidance to compensate for the upsampling information and filter out noise from the low-level encoder features propagated through the skip connections to the decoder. Extensive experiments conducted on two widely -used RGB-D indoor semantic segmentation datasets demonstrate that the proposed EFDCNet surpasses the performance of relevant state-of-the-art methods. The code is available at https://github.com/ Mark9010/EFDCNet
引用
收藏
页数:11
相关论文
共 50 条
  • [1] RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
    Yan, Xingchao
    Hou, Sujuan
    Karim, Awudu
    Jia, Weikuan
    [J]. DISPLAYS, 2021, 70
  • [2] Multi-scale fusion for RGB-D indoor semantic segmentation
    Jiang, Shiyi
    Xu, Yang
    Li, Danyang
    Fan, Runze
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01):
  • [3] Multi-scale fusion for RGB-D indoor semantic segmentation
    Shiyi Jiang
    Yang Xu
    Danyang Li
    Runze Fan
    [J]. Scientific Reports, 12 (1)
  • [4] A Fusion Network for Semantic Segmentation Using RGB-D Data
    Yuan, Jiahui
    Zhang, Kun
    Xia, Yifan
    Qi, Lin
    Dong, Junyu
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [5] RGB-D indoor semantic segmentation network based on wavelet transform
    Runze Fan
    Yuhong Liu
    Shiyi Jiang
    Rongfen Zhang
    [J]. Evolving Systems, 2023, 14 : 981 - 991
  • [6] RGB-D indoor semantic segmentation network based on wavelet transform
    Fan, Runze
    Liu, Yuhong
    Jiang, Shiyi
    Zhang, Rongfen
    [J]. EVOLVING SYSTEMS, 2023, 14 (06) : 981 - 991
  • [7] Attention-based fusion network for RGB-D semantic segmentation
    Zhong, Li
    Guo, Chi
    Zhan, Jiao
    Deng, JingYi
    [J]. NEUROCOMPUTING, 2024, 608
  • [8] AMCFNet: Asymmetric multiscale and crossmodal fusion network for RGB-D semantic segmentation in indoor service robots
    Zhou, Wujie
    Yue, Yuchun
    Fang, Meixin
    Mao, Shanshan
    Yang, Rongwang
    Yu, Lu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [9] Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation
    Duan, Li-Juan
    Sun, Qi-Chao
    Qiao, Yuan-Hua
    Chen, Jun-Cheng
    Cui, Guo-Qin
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (02): : 275 - 291
  • [10] CMPFFNet: Cross-Modal and Progressive Feature Fusion Network for RGB-D Indoor Scene Semantic Segmentation
    Zhou, Wujie
    Xiao, Yuxiang
    Yan, Weiqing
    Yu, Lu
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 11