Multi-scale fusion for RGB-D indoor semantic segmentation

被引:5
|
作者
Jiang, Shiyi [1 ]
Xu, Yang [1 ,2 ]
Li, Danyang [1 ]
Fan, Runze [1 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
[2] Guiyang Aluminum Magnesium Design & Res Inst Co L, Guiyang 550009, Peoples R China
来源
SCIENTIFIC REPORTS | 2022年 / 12卷 / 01期
关键词
D O I
10.1038/s41598-022-24836-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In computer vision, convolution and pooling operations tend to lose high-frequency information, and the contour details will also disappear with the deepening of the network, especially in image semantic segmentation. For RGB-D image semantic segmentation, all the effective information of RGB and depth image can not be used effectively, while the form of wavelet transform can retain the low and high frequency information of the original image perfectly. In order to solve the information losing problems, we proposed an RGB-D indoor semantic segmentation network based on multi-scale fusion: designed a wavelet transform fusion module to retain contour details, a nonsubsampled contourlet transform to replace the pooling operation, and a multiple pyramid module to aggregate multi-scale information and context global information. The proposed method can retain the characteristics of multi-scale information with the help of wavelet transform, and make full use of the complementarity of high and low frequency information. As the depth of the convolutional neural network increases without losing the multi-frequency characteristics, the segmentation accuracy of image edge contour details is also improved. We evaluated our proposed efficient method on commonly used indoor datasets NYUv2 and SUNRGB-D, and the results showed that we achieved state-of-the-art performance and real-time inference.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-scale fusion for RGB-D indoor semantic segmentation
    Shiyi Jiang
    Yang Xu
    Danyang Li
    Runze Fan
    Scientific Reports, 12 (1)
  • [2] Transformer fusion for indoor RGB-D semantic segmentation
    Wu, Zongwei
    Zhou, Zhuyun
    Allibert, Guillaume
    Stolz, Christophe
    Demonceaux, Cedric
    Ma, Chao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [3] Feature fusion and context interaction for RGB-D indoor semantic segmentation
    Liu, Heng
    Xie, Wen
    Wang, Shaoxun
    APPLIED SOFT COMPUTING, 2024, 167
  • [4] RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation
    Park, Seong-Jin
    Hong, Ki-Sang
    Lee, Seungyong
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4990 - 4999
  • [5] RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
    Yan, Xingchao
    Hou, Sujuan
    Karim, Awudu
    Jia, Weikuan
    DISPLAYS, 2021, 70
  • [6] SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION
    Li, Yabei
    Zhang, Junge
    Cheng, Yanhua
    Huang, Kaiqi
    Tan, Tieniu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1262 - 1266
  • [7] EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation
    Chen, Jianlin
    Li, Gongyang
    Zhang, Zhijiang
    Zeng, Dan
    IMAGE AND VISION COMPUTING, 2024, 142
  • [8] Temporally Consistent Semantic Segmentation using Spatially Aware Multi-view Semantic Fusion for Indoor RGB-D videos
    Sun, Fengyuan
    Karaoglu, Sezer
    Gevers, Theo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4250 - 4259
  • [9] Accurate semantic segmentation of RGB-D images for indoor navigation
    Sharan, Sudeep
    Nauth, Peter
    Dominguez-Jimenez, Juan-Jose
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [10] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
    Seichter, Daniel
    Koehler, Mona
    Lewandowski, Benjamin
    Wengefeld, Tim
    Gross, Horst-Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13525 - 13531