Learning deep cross-scale feature propagation for indoor semantic segmentation

被引:6
|
作者
Huan, Linxi [1 ]
Zheng, Xianwei [1 ]
Tang, Shengjun [2 ]
Gong, Jianya [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China
[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;
D O I
10.1016/j.isprsjprs.2021.03.023
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.
引用
收藏
页码:42 / 53
页数:12
相关论文
共 50 条
  • [21] A cross-scale mixed attention network for smoke segmentation
    Yuan, Feiniu
    Shi, Yu
    Zhang, Lin
    Fang, Yuming
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [22] Learning Contextual Information for Indoor Semantic Segmentation
    Wang, Jianhua
    Zheng, Chuanxia
    Chen, Weihai
    Wu, Xingming
    PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 1639 - 1644
  • [23] CFTNet: Cross-Scale Feature Transfer for Lane Detection
    Zhang, Dawen
    Lu, Tao
    Wang, Jiaming
    Chang, Jun
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 169 - 175
  • [24] CCANet: Cross-Modality Comprehensive Feature Aggregation Network for Indoor Scene Semantic Segmentation
    Zhang, Zihao
    Yang, Yale
    Hou, Huifang
    Meng, Fanman
    Zhang, Fan
    Xie, Kangzhan
    Zhuang, Chunsheng
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2025, 17 (02) : 366 - 378
  • [25] Cross-scale feature fusion connection for a YOLO detector
    Ruan, Zhongling
    Wang, Hao
    Cao, Jianzhong
    Zhang, Hongbo
    IET COMPUTER VISION, 2022, 16 (02) : 99 - 110
  • [26] How deep learning is empowering semantic segmentation Traditional and deep learning techniques for semantic segmentation: A comparison
    Sehar, Uroosa
    Naseem, Muhammad Luqman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30519 - 30544
  • [27] RGB-D Semantic Segmentation for Indoor Modeling Using Deep Learning: A Review
    Rached, Ishraq
    Hajji, Rafika
    Landes, Tania
    RECENT ADVANCES IN 3D GEOINFORMATION SCIENCE, 3D GEOINFO 2023, 2024, : 587 - 604
  • [28] Indoor/Outdoor Semantic Segmentation Using Deep Learning for Visually Impaired Wheelchair Users
    Mohamed, Elhassan
    Sirlantzis, Konstantinos
    Howells, Gareth
    IEEE ACCESS, 2021, 9 : 147914 - 147932
  • [29] Deep-Learning based Global and Semantic Feature Fusion for Indoor Scene Classification
    Pereira, Ricardo
    Goncalves, Nuno
    Garrote, Luis
    Barros, Tiago
    Lopes, Ana
    Nunes, Urbano J.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 67 - 73
  • [30] C3Net: Cross-Modal Feature Recalibrated, Cross-Scale Semantic Aggregated and Compact Network for Semantic Segmentation of Multi-Modal High-Resolution Aerial Images
    Cao, Zhiying
    Diao, Wenhui
    Sun, Xian
    Lyu, Xiaode
    Yan, Menglong
    Fu, Kun
    REMOTE SENSING, 2021, 13 (03)