Multi-Scale Convolutional Features Network for Semantic Segmentation in Indoor Scenes

被引:2
|
作者
Wang, Yanran [1 ]
Chen, Qingliang [1 ,2 ]
Chen, Shilang [3 ]
Wu, Junjun [3 ]
机构
[1] Jinan Univ, Dept Comp Sci, Guangzhou 510632, Peoples R China
[2] Guangzhou Xuanyuan Res Inst Co Ltd, Guangzhou 510006, Peoples R China
[3] Foshan Univ, Sch Mechatron Engn, Foshan 528000, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Semantics; Feature extraction; Image segmentation; Convolution; Image edge detection; Service robots; Image resolution; Semantic segmentation; convolutional neural networks (CNN); hidden convolutional features; dilated convolution; indoor service robots;
D O I
10.1109/ACCESS.2020.2993570
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is one of the most fundamental techniques for visual intelligence, which plays a vital role for indoor service robotic tasks such as scene understanding, autonomous navigation and dexterous manipulation. However, semantic segmentation of indoor environments poses great challenges for existing segmentation techniques due to the complex overlaps, heavy occlusions and cluttered scenes with objects of different shapes and scales, which may lead to the loss of edge information and insufficient segmentation accuracy. And most of the semantic segmentation networks are very complex and cannot be applied to mobile robot platforms. Thus, it is of significant importance for ensuring as few network parameters as possible while improving the detection of meaningful edges in indoor scenes. In this paper, we present a novel indoor scene semantic segmentation method that can refine the segmentation edges and achieve a balance between accuracy and model complexity for indoor service robots. Our approach systematically incorporates dilated convolution and rich convolutional features from the intermediate layers of Convolutional Neural Networks (CNN), which is based on two motivations: (1) The middle hidden layer of CNN contains a lot of potentially useful information for better edge detection which is, however, no longer present in latter layers in traditional structures. (2) The dilated convolution can change the size of receptive field and obtain multi-scale feature information without losing the resolution and introducing any additional parameters. Thus we propose a new end-to-end Multi-Scale Convolutional Features (MSCF) network to integrate the dilated convolution and rich convolutional features extracted from the intermediate layers of traditional CNN. Finally, the resulting approach is extensively evaluated on the prestigious indoor image datasets of SUN RGB-D and NYUDv2, and shows promising improvements over state-of-the-art baselines, both qualitatively and quantitatively.
引用
收藏
页码:89575 / 89583
页数:9
相关论文
共 50 条
  • [21] Multi-scale sequential network for semantic text segmentation and localization
    Villamizar, Michael
    Canevet, Olivier
    Odobez, Jean-Marc
    PATTERN RECOGNITION LETTERS, 2020, 129 : 63 - 69
  • [22] DNS: A multi-scale deconvolution semantic segmentation network for joint detection and segmentation
    Feng, Ning
    Dong, Le
    Zhang, Qianni
    Zhang, Ning
    Wu, Xi
    Chen, Jianwen
    2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018), 2019, 277
  • [23] Semantic Road Segmentation via Multi-scale Ensembles of Learned Features
    Alvarez, Jose M.
    LeCun, Yann
    Gevers, Theo
    Lopez, Antonio M.
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7584 : 586 - 595
  • [24] Multi-Scale Depthwise Separable Convolution for Semantic Segmentation in Street-Road Scenes
    Dai, Yingpeng
    Li, Chenglin
    Su, Xiaohang
    Liu, Hongxian
    Li, Jiehao
    REMOTE SENSING, 2023, 15 (10)
  • [25] Semantic segmentation of autonomous driving scenes based on multi-scale adaptive attention mechanism
    Liu, Danping
    Zhang, Dong
    Wang, Lei
    Wang, Jun
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [26] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Lee, Sang-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18689 - 18707
  • [27] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Sang-Woong Lee
    Multimedia Tools and Applications, 2018, 77 : 18689 - 18707
  • [28] Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
    Zhao, Shan
    Wang, Zihao
    Huo, Zhanqiang
    Zhang, Fukai
    SENSORS, 2024, 24 (16)
  • [29] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
    Shen, Xu
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2022, 14 (23)
  • [30] Efficient Parallel Multi-Scale Detail and Semantic Encoding Network for Lightweight Semantic Segmentation
    Liu, Xiao
    Shi, Xiuya
    Chen, Lufei
    Qing, Linbo
    Ren, Chao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2544 - 2552