Multi-Scale Convolutional Features Network for Semantic Segmentation in Indoor Scenes

被引：2

作者：

Wang, Yanran ^{[1
]}

Chen, Qingliang ^{[1
,2
]}

Chen, Shilang ^{[3
]}

Wu, Junjun ^{[3
]}

机构：

[1] Jinan Univ, Dept Comp Sci, Guangzhou 510632, Peoples R China

[2] Guangzhou Xuanyuan Res Inst Co Ltd, Guangzhou 510006, Peoples R China

[3] Foshan Univ, Sch Mechatron Engn, Foshan 528000, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Semantics; Feature extraction; Image segmentation; Convolution; Image edge detection; Service robots; Image resolution; Semantic segmentation; convolutional neural networks (CNN); hidden convolutional features; dilated convolution; indoor service robots;

D O I：

10.1109/ACCESS.2020.2993570

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation is one of the most fundamental techniques for visual intelligence, which plays a vital role for indoor service robotic tasks such as scene understanding, autonomous navigation and dexterous manipulation. However, semantic segmentation of indoor environments poses great challenges for existing segmentation techniques due to the complex overlaps, heavy occlusions and cluttered scenes with objects of different shapes and scales, which may lead to the loss of edge information and insufficient segmentation accuracy. And most of the semantic segmentation networks are very complex and cannot be applied to mobile robot platforms. Thus, it is of significant importance for ensuring as few network parameters as possible while improving the detection of meaningful edges in indoor scenes. In this paper, we present a novel indoor scene semantic segmentation method that can refine the segmentation edges and achieve a balance between accuracy and model complexity for indoor service robots. Our approach systematically incorporates dilated convolution and rich convolutional features from the intermediate layers of Convolutional Neural Networks (CNN), which is based on two motivations: (1) The middle hidden layer of CNN contains a lot of potentially useful information for better edge detection which is, however, no longer present in latter layers in traditional structures. (2) The dilated convolution can change the size of receptive field and obtain multi-scale feature information without losing the resolution and introducing any additional parameters. Thus we propose a new end-to-end Multi-Scale Convolutional Features (MSCF) network to integrate the dilated convolution and rich convolutional features extracted from the intermediate layers of traditional CNN. Finally, the resulting approach is extensively evaluated on the prestigious indoor image datasets of SUN RGB-D and NYUDv2, and shows promising improvements over state-of-the-art baselines, both qualitatively and quantitatively.

引用

页码：89575 / 89583

页数：9

共 50 条

[1] Multi-scale Convolutional Neural Network for SAR Image Semantic Segmentation
Duan, Yiping
Tao, Xiaoming
Han, Chaoyi
Qin, Xiaowei
Lu, Jianhua
[J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
[2] MANet: Multi-Scale Aware-Relation Network for Semantic Segmentation in Aerial Scenes
He, Pei
Jiao, Licheng
Shang, Ronghua
Wang, Shuang
Liu, Xu
Quan, Dou
Yang, Kun
Zhao, Dong
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[3] MFSNet: Enhancing Semantic Segmentation of Urban Scenes with a Multi-Scale Feature Shuffle Network
Qian, Xiaohong
Shu, Chente
Jin, Wuyin
Yu, Yunxiang
Yang, Shengying
[J]. ELECTRONICS, 2024, 13 (01)
[4] OUTSIDE: Multi-Scale Semantic Segmentation of Universal Outdoor Scenes
Gerhardt, Christoph
Weidner, Florian
Broll, Wolfgang
[J]. IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
[5] Multi-scale and multi-path cascaded convolutional network for semantic segmentation of colorectal polyps
Manan, Malik Abdul
Feng, Jinchao
Yaqub, Muhammad
Ahmed, Shahzad
Imran, Syed Muhammad Ali
Chuhan, Imran Shabir
Khan, Haroon Ahmed
[J]. ALEXANDRIA ENGINEERING JOURNAL, 2024, 105 : 341 - 359
[6] Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians
Alavianmehr, M. A.
Helfroush, M. S.
Danyali, H.
Tashk, A.
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
[7] Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians
M. A. Alavianmehr
M. S. Helfroush
H. Danyali
A. Tashk
[J]. Journal of Real-Time Image Processing, 2023, 20
[8] LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes
Li, Lirong
Ding, Jiang
Cui, Hao
Chen, Zhiqiang
Liao, Guisheng
[J]. VISUAL COMPUTER, 2024,
[9] Dense Multi-Scale Convolutional Network for Plant Segmentation
Tran, Thi Hoang Yen
Phan, Tran Dang Khoa
[J]. IEEE ACCESS, 2023, 11 : 82640 - 82651
[10] Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes
Liao, Yong
Liu, Qiong
[J]. SENSORS, 2021, 21 (09)

← 1 2 3 4 5 →