Learning deep cross-scale feature propagation for indoor semantic segmentation

被引:6
|
作者
Huan, Linxi [1 ]
Zheng, Xianwei [1 ]
Tang, Shengjun [2 ]
Gong, Jianya [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China
[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;
D O I
10.1016/j.isprsjprs.2021.03.023
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.
引用
收藏
页码:42 / 53
页数:12
相关论文
共 50 条
  • [41] GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation
    Wang, Zhuoying
    Wang, Yongtao
    Tang, Zhi
    Li, Yangyan
    Chen, Ying
    Ling, Haibin
    Lin, Weisi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7111 - 7118
  • [42] An enhanced underwater fish segmentation method in complex scenes using Swin transformer with cross-scale feature fusion
    Liu, Shue
    Zhao, Siwei
    Wang, Yiying
    Xin, Jiaming
    Li, Dashe
    VISUAL COMPUTER, 2024,
  • [43] Video semantic segmentation via feature propagation with holistic attention
    Wu, Junrong
    Wen, Zongzheng
    Zhao, Sanyuan
    Huang, Kele
    PATTERN RECOGNITION, 2020, 104
  • [44] Special feature on ecosystem engineers: Cross-scale and cross-system perspectives
    Briones, M. J. I.
    FUNCTIONAL ECOLOGY, 2024, 38 (01) : 4 - 7
  • [45] Multichannel Cross-Scale Semantic Coherent Attention Network for Image Inpainting
    Zou C.
    Ye L.
    Mobile Information Systems, 2022, 2022
  • [46] CS-UNet: Cross-scale U-Net with Semantic-position dependencies for retinal vessel segmentation
    Yang, Ying
    Yue, Shengbin
    Quan, Haiyan
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024, 35 (02) : 134 - 153
  • [47] How deep learning is empowering semantic segmentationTraditional and deep learning techniques for semantic segmentation: A comparison
    Uroosa Sehar
    Muhammad Luqman Naseem
    Multimedia Tools and Applications, 2022, 81 : 30519 - 30544
  • [48] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF UAV VIDEOS
    Wang, Yiwen
    Lyn, Ye
    Cao, Yanpeng
    Yang, Michael Ying
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2459 - 2462
  • [49] Deep Dual Learning for Semantic Image Segmentation
    Luo, Ping
    Wang, Guangrun
    Lin, Liang
    Wang, Xiaogang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2737 - 2745
  • [50] SEMANTIC SEGMENTATION OF TEXT USING DEEP LEARNING
    Lattisi, Tiziano
    Farina, Davide
    Ronchetti, Marco
    COMPUTING AND INFORMATICS, 2022, 41 (01) : 78 - 97