Semantic segmentation with Recurrent Neural Networks on RGB-D videos

被引:0
|
作者
Gao, Chuan [1 ]
Wang, Weihong [1 ]
Chen, Mingxi [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
关键词
Image semantic segmentation; RNN; SLAM;
D O I
10.1109/cac48633.2019.8996634
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fully Convolutional Network (FCN) based on convolutional neural network is the mainstream method for image semantic segmentation, but its performance on RGB-D video is not good. In this paper, we propose a new semantic segmentation method for RGB-D video using a Recurrent Neural Network architecture. Our novel method combines visual simultaneous localization and mapping (SLAM) technology to transfer semantic information across frames, so the data association between each frame of RGB-D video can be captured. We also show that the proposed approach allows us to improve over existing method by a comparison study on a RGBD scene dataset.
引用
收藏
页码:1203 / 1207
页数:5
相关论文
共 50 条
  • [1] Multimodal Neural Networks: RGB-D for Semantic Segmentation and Object Detection
    Schneider, Lukas
    Jasch, Manuel
    Froehlich, Bjoern
    Weber, Thomas
    Franke, Uwe
    Pollefeys, Marc
    Raetsch, Matthias
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 98 - 109
  • [2] Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks
    Barchid, Sami
    Mennesson, Jose
    Djeraba, Chaabane
    [J]. 2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 199 - 202
  • [3] Regularized Fully Convolutional Networks for RGB-D Semantic Segmentation
    Su, Wen
    Wang, Zengfu
    [J]. 2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [4] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [5] Recurrent Convolutional Neural Networks for Object-Class Segmentation of RGB-D Video
    Pavel, Mircea Serban
    Schulz, Hannes
    Behnke, Sven
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [6] Object class segmentation of RGB-D video using recurrent convolutional neural networks
    Pavel, Mircea Serban
    Schulz, Hannes
    Behnke, Sven
    [J]. NEURAL NETWORKS, 2017, 88 : 105 - 113
  • [8] RGB-D Scene Labeling with Multimodal Recurrent Neural Networks
    Fan, Heng
    Mei, Xue
    Prokhorov, Danil
    Ling, Haibin
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 203 - 211
  • [9] Joining geometric and RGB features for RGB-D semantic segmentation
    Zhang, Shaopeng
    Zhong, Min
    Zeng, Gang
    Gan, Rui
    [J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [10] Fast Detection of Tomato Sucker Using Semantic Segmentation Neural Networks Based on RGB-D Images
    Giang, Truong Thi Huong
    Khai, Tran Quoc
    Im, Dae-Young
    Ryoo, Young-Jae
    [J]. SENSORS, 2022, 22 (14)