Multi-level graph convolutional recurrent neural network for semantic image segmentation

被引:5
|
作者
Jiang, Dingchao [1 ]
Qu, Hua [1 ]
Zhao, Jihong [2 ]
Zhao, Jianlong [3 ]
Liang, Wei [4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[4] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
关键词
Deep learning; Semantic image segmentation; Graph convolutional recurrent neural network; Multi-level features;
D O I
10.1007/s11235-021-00769-y
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the advent of the Internet of Things (IoT) era, many devices have surfaced that capture and generate various visual data. To recognize and extract a meaningful pattern from these visual data, powerful methods are required for different IoT applications. Fortunately, deep convolutional neural networks (CNNs) significantly improve the performance of almost all tasks in computer vision, including semantic image segmentation. However, the feature extraction of CNNs may cause the loss of contextual and spatial information. Moreover, the standard convolutional and pooling layers adopted by most CNN architectures lead to a fixed receptive field, which makes it challenging to deal with multi-scale objects in the image. To remedy these issues of CNNs for semantic image segmentation, this paper proposes a multi-level graph convolutional recurrent neural network (MGCRNN) to combine CNNs and graph neural networks (GNNs) for fusing multi-level features. By applying graph convolutional recurrent neural network (GCRNN), the proposed model acquires a global view of the image and aggregates multi-level contextual and structural information. The experiments verify the ability of GCRNN to obtain a flexible receptive field and learn structure features without losing spatial information. Results of these experiments conducted on the Pascal VOC 2012 and Cityscapes datasets show that the proposed model outperforms baseline approaches and can be competitive with state-of-the-art methods
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [1] Multi-level graph convolutional recurrent neural network for semantic image segmentation
    Dingchao Jiang
    Hua Qu
    Jihong Zhao
    Jianlong Zhao
    Wei Liang
    Telecommunication Systems, 2021, 77 : 563 - 576
  • [2] Multi-level Graph Memory Network Cluster Convolutional Recurrent Network for traffic forecasting
    Sun, Le
    Dai, Wenzhang
    Muhammad, Ghulam
    INFORMATION FUSION, 2024, 105
  • [3] A Multi-level Deep Convolutional Neural Network for Image Emotion Classification
    Wang W.
    Li L.
    Huang J.
    Luo J.
    Xu X.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2019, 47 (06): : 39 - 50
  • [4] Multi-scale Convolutional Neural Network for SAR Image Semantic Segmentation
    Duan, Yiping
    Tao, Xiaoming
    Han, Chaoyi
    Qin, Xiaowei
    Lu, Jianhua
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [5] A regularized convolutional neural network for semantic image segmentation
    Jia, Fan
    Liu, Jun
    Tai, Xue-Cheng
    ANALYSIS AND APPLICATIONS, 2021, 19 (01) : 147 - 165
  • [6] Lightweight image semantic segmentation based on multi-level feature cascaded network
    Zhou D.-W.
    Tian J.-Y.
    Ma L.-Y.
    Sun X.-X.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (08): : 1516 - 1524
  • [8] Multi-level Graph Label Propagation for Image Segmentation
    Belizario, Ivar Vargas
    Neto, Joao Batista
    2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 195 - 202
  • [9] Multi-level disentanglement graph neural network
    Wu, Lirong
    Lin, Haitao
    Xia, Jun
    Tan, Cheng
    Li, Stan Z.
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 9087 - 9101
  • [10] Multi-level disentanglement graph neural network
    Lirong Wu
    Haitao Lin
    Jun Xia
    Cheng Tan
    Stan Z. Li
    Neural Computing and Applications, 2022, 34 : 9087 - 9101