Cascaded Feature Network for Semantic Segmentation of RGB-D Images

被引:81
|
作者
Lin, Di [1 ]
Chen, Guangyong [2 ]
Daniel Cohen-Or [1 ,3 ]
Heng, Pheng-Ann [2 ]
Huang, Hui [1 ,4 ]
机构
[1] Shenzhen Univ, Shenzhen, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[3] Tel Aviv Univ, Tel Aviv, Israel
[4] SIAT, Shenzhen, Peoples R China
关键词
D O I
10.1109/ICCV.2017.147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fully convolutional network (FCN) has been successfully applied in semantic segmentation of scenes represented with RGB images. Images augmented with depth channel provide more understanding of the geometric information of the scene in the image. The question is how to best exploit this additional information to improve the segmentation performance. In this paper, we present a neural network with multiple branches for segmenting RGB-D images. Our approach is to use the available depth to split the image into layers with common visual characteristic of objects/scenes, or common "scene-resolution". We introduce context-aware receptive field (CaRF) which provides a better control on the relevant contextual information of the learned features. Equipped with CaRF, each branch of the network semantically segments relevant similar scene-resolution, leading to a more focused domain which is easier to learn. Furthermore, our network is cascaded with features from one branch augmenting the features of adjacent branch. We show that such cascading of features enriches the contextual information of each branch and enhances the overall performance. The accuracy that our network achieves outperforms the state-of- the-art methods on two public datasets.
引用
收藏
页码:1320 / 1328
页数:9
相关论文
共 50 条
  • [1] Two-Stage Cascaded Decoder for Semantic Segmentation of RGB-D Images
    Yue, Yuchun
    Zhou, Wujie
    Lei, Jingsheng
    Yu, Lu
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1115 - 1119
  • [2] Zig-Zag Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Huang, Hui
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) : 2642 - 2655
  • [3] SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Zhang, Ruimao
    Ji, Yuanfeng
    Li, Ping
    Huang, Hui
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) : 1120 - 1131
  • [4] RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
    Yan, Xingchao
    Hou, Sujuan
    Karim, Awudu
    Jia, Weikuan
    [J]. DISPLAYS, 2021, 70
  • [5] FGMNet: Feature grouping mechanism network for RGB-D indoor scene semantic segmentation
    Zhang, Yuming
    Zhou, Wujie
    Ye, Lv
    Yu, Lu
    Luo, Ting
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 149
  • [6] Accurate semantic segmentation of RGB-D images for indoor navigation
    Sharan, Sudeep
    Nauth, Peter
    Dominguez-Jimenez, Juan-Jose
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [7] Automatic Network Architecture Search for RGB-D Semantic Segmentation
    Wang, Wenna
    Zhuo, Tao
    Zhang, Xiuwei
    Sun, Mingjun
    Yin, Hanlin
    Xing, Yinghui
    Zhang, Yanning
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3777 - 3786
  • [8] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [9] Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
    Yang, Jun
    Bai, Lizhi
    Sun, Yaoru
    Tian, Chunqi
    Mao, Maoyu
    Wang, Guorun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1481 - 1492
  • [10] A Fusion Network for Semantic Segmentation Using RGB-D Data
    Yuan, Jiahui
    Zhang, Kun
    Xia, Yifan
    Qi, Lin
    Dong, Junyu
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615