Multi-Scale Convolutional Features Network for Semantic Segmentation in Indoor Scenes

被引:2
|
作者
Wang, Yanran [1 ]
Chen, Qingliang [1 ,2 ]
Chen, Shilang [3 ]
Wu, Junjun [3 ]
机构
[1] Jinan Univ, Dept Comp Sci, Guangzhou 510632, Peoples R China
[2] Guangzhou Xuanyuan Res Inst Co Ltd, Guangzhou 510006, Peoples R China
[3] Foshan Univ, Sch Mechatron Engn, Foshan 528000, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Semantics; Feature extraction; Image segmentation; Convolution; Image edge detection; Service robots; Image resolution; Semantic segmentation; convolutional neural networks (CNN); hidden convolutional features; dilated convolution; indoor service robots;
D O I
10.1109/ACCESS.2020.2993570
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is one of the most fundamental techniques for visual intelligence, which plays a vital role for indoor service robotic tasks such as scene understanding, autonomous navigation and dexterous manipulation. However, semantic segmentation of indoor environments poses great challenges for existing segmentation techniques due to the complex overlaps, heavy occlusions and cluttered scenes with objects of different shapes and scales, which may lead to the loss of edge information and insufficient segmentation accuracy. And most of the semantic segmentation networks are very complex and cannot be applied to mobile robot platforms. Thus, it is of significant importance for ensuring as few network parameters as possible while improving the detection of meaningful edges in indoor scenes. In this paper, we present a novel indoor scene semantic segmentation method that can refine the segmentation edges and achieve a balance between accuracy and model complexity for indoor service robots. Our approach systematically incorporates dilated convolution and rich convolutional features from the intermediate layers of Convolutional Neural Networks (CNN), which is based on two motivations: (1) The middle hidden layer of CNN contains a lot of potentially useful information for better edge detection which is, however, no longer present in latter layers in traditional structures. (2) The dilated convolution can change the size of receptive field and obtain multi-scale feature information without losing the resolution and introducing any additional parameters. Thus we propose a new end-to-end Multi-Scale Convolutional Features (MSCF) network to integrate the dilated convolution and rich convolutional features extracted from the intermediate layers of traditional CNN. Finally, the resulting approach is extensively evaluated on the prestigious indoor image datasets of SUN RGB-D and NYUDv2, and shows promising improvements over state-of-the-art baselines, both qualitatively and quantitatively.
引用
收藏
页码:89575 / 89583
页数:9
相关论文
共 50 条
  • [1] Multi-scale Convolutional Neural Network for SAR Image Semantic Segmentation
    Duan, Yiping
    Tao, Xiaoming
    Han, Chaoyi
    Qin, Xiaowei
    Lu, Jianhua
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [2] MANet: Multi-Scale Aware-Relation Network for Semantic Segmentation in Aerial Scenes
    He, Pei
    Jiao, Licheng
    Shang, Ronghua
    Wang, Shuang
    Liu, Xu
    Quan, Dou
    Yang, Kun
    Zhao, Dong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] MFSNet: Enhancing Semantic Segmentation of Urban Scenes with a Multi-Scale Feature Shuffle Network
    Qian, Xiaohong
    Shu, Chente
    Jin, Wuyin
    Yu, Yunxiang
    Yang, Shengying
    [J]. ELECTRONICS, 2024, 13 (01)
  • [4] OUTSIDE: Multi-Scale Semantic Segmentation of Universal Outdoor Scenes
    Gerhardt, Christoph
    Weidner, Florian
    Broll, Wolfgang
    [J]. IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [5] Multi-scale and multi-path cascaded convolutional network for semantic segmentation of colorectal polyps
    Manan, Malik Abdul
    Feng, Jinchao
    Yaqub, Muhammad
    Ahmed, Shahzad
    Imran, Syed Muhammad Ali
    Chuhan, Imran Shabir
    Khan, Haroon Ahmed
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2024, 105 : 341 - 359
  • [6] Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians
    Alavianmehr, M. A.
    Helfroush, M. S.
    Danyali, H.
    Tashk, A.
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
  • [7] Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians
    M. A. Alavianmehr
    M. S. Helfroush
    H. Danyali
    A. Tashk
    [J]. Journal of Real-Time Image Processing, 2023, 20
  • [8] LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes
    Li, Lirong
    Ding, Jiang
    Cui, Hao
    Chen, Zhiqiang
    Liao, Guisheng
    [J]. VISUAL COMPUTER, 2024,
  • [9] Dense Multi-Scale Convolutional Network for Plant Segmentation
    Tran, Thi Hoang Yen
    Phan, Tran Dang Khoa
    [J]. IEEE ACCESS, 2023, 11 : 82640 - 82651
  • [10] Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes
    Liao, Yong
    Liu, Qiong
    [J]. SENSORS, 2021, 21 (09)