DCANet: Differential convolution attention network for RGB-D semantic segmentation

被引:0
|
作者
Bai, Lizhi [1 ]
Yang, Jun [1 ]
Tian, Chunqi [1 ]
Sun, Yaoru [1 ]
Mao, Maoyu [1 ]
Xu, Yanjun [1 ]
Xu, Weirong [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; RGB-D; Differential convolution; Attention; SALIENCY;
D O I
10.1016/j.patcog.2025.111379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combining RGB images and their corresponding depth maps in semantic segmentation has proven to be effective in recent years. However, existing RGB-D modal fusion methods either lack non-linear feature fusion abilities or treat both modal images equally, disregarding the intrinsic distribution gap and information loss. In this study, we have observed that depth maps are well-suited for providing fine-grained patterns of objects due to their local depth continuity, while RGB images effectively offer a global view. Based on this observation, we propose a novel module called the pixel Differential Convolution Attention (DCA) module, which takes into account geometric information and local-range correlations for depth data. Additionally, we extend the DCA module to create the Ensemble Differential Convolution Attention (EDCA), which propagates long-range contextual dependencies and seamlessly incorporates spatial distribution for RGB data. The DCA and EDCA modules dynamically adjust convolutional weights based on pixel differences, enabling self-adaptation in the local and long-range contexts, respectively. We construct a two-branch network, named the Differential Convolutional Network (DCANet), using the DCA and EDCA modules to fuse the local and global information from the two-modal data. Asa result, the individual advantages of RGB and depth data are emphasized. Experimental results demonstrate that our DCANet achieves anew state-of-the-art performance for RGB-D semantic segmentation on two challenging benchmark datasets: NYUv2 and SUN-RGBD.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] 2.5D CONVOLUTION FOR RGB-D SEMANTIC SEGMENTATION
    Xing, Yajie
    Wang, Jingbo
    Chen, Xiaokang
    Zeng, Gang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1410 - 1414
  • [2] Attention-based fusion network for RGB-D semantic segmentation
    Zhong, Li
    Guo, Chi
    Zhan, Jiao
    Deng, JingYi
    NEUROCOMPUTING, 2024, 608
  • [3] CANet: Co-attention network for RGB-D semantic segmentation
    Zhou, Hao
    Qi, Lu
    Huang, Hai
    Yang, Xu
    Wan, Zhaoliang
    Wen, Xianglong
    PATTERN RECOGNITION, 2022, 124
  • [4] Cross-modal attention fusion network for RGB-D semantic segmentation
    Zhao, Qiankun
    Wan, Yingcai
    Xu, Jiqian
    Fang, Lijin
    NEUROCOMPUTING, 2023, 548
  • [5] CDMANet: central difference mutual attention network for RGB-D semantic segmentation
    Ge, Mengjiao
    Su, Wen
    Gao, Jinfeng
    Jia, Guoqiang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [6] RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
    Yan, Xingchao
    Hou, Sujuan
    Karim, Awudu
    Jia, Weikuan
    DISPLAYS, 2021, 70
  • [7] Shape-Aware Convolution with Convolutional Kernel Attention for RGB-D Image Semantic Segmentation
    Zhou, Kun
    Zhang, Zejun
    Tang, Xu
    Xu, Wen
    Xie, Jianxiao
    Tang, Changbing
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2025, E108A (02) : 140 - 148
  • [8] Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation
    Duan L.-J.
    Sun Q.-C.
    Qiao Y.-H.
    Chen J.-C.
    Cui G.-Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (02): : 275 - 291
  • [9] Automatic Network Architecture Search for RGB-D Semantic Segmentation
    Wang, Wenna
    Zhuo, Tao
    Zhang, Xiuwei
    Sun, Mingjun
    Yin, Hanlin
    Xing, Yinghui
    Zhang, Yanning
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3777 - 3786
  • [10] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,