Multi-level graph convolutional recurrent neural network for semantic image segmentation

被引:5
|
作者
Jiang, Dingchao [1 ]
Qu, Hua [1 ]
Zhao, Jihong [2 ]
Zhao, Jianlong [3 ]
Liang, Wei [4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[4] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
关键词
Deep learning; Semantic image segmentation; Graph convolutional recurrent neural network; Multi-level features;
D O I
10.1007/s11235-021-00769-y
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the advent of the Internet of Things (IoT) era, many devices have surfaced that capture and generate various visual data. To recognize and extract a meaningful pattern from these visual data, powerful methods are required for different IoT applications. Fortunately, deep convolutional neural networks (CNNs) significantly improve the performance of almost all tasks in computer vision, including semantic image segmentation. However, the feature extraction of CNNs may cause the loss of contextual and spatial information. Moreover, the standard convolutional and pooling layers adopted by most CNN architectures lead to a fixed receptive field, which makes it challenging to deal with multi-scale objects in the image. To remedy these issues of CNNs for semantic image segmentation, this paper proposes a multi-level graph convolutional recurrent neural network (MGCRNN) to combine CNNs and graph neural networks (GNNs) for fusing multi-level features. By applying graph convolutional recurrent neural network (GCRNN), the proposed model acquires a global view of the image and aggregates multi-level contextual and structural information. The experiments verify the ability of GCRNN to obtain a flexible receptive field and learn structure features without losing spatial information. Results of these experiments conducted on the Pascal VOC 2012 and Cityscapes datasets show that the proposed model outperforms baseline approaches and can be competitive with state-of-the-art methods
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [21] Superpixel Based Graph Convolutional Neural Network for SAR Image Segmentation
    Turkmenli, Ilter
    Aptoula, Erchan
    Kayabol, Koray
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVII, 2021, 11862
  • [22] An Interactive Image Segmentation Method Based on Multi-Level Semantic Fusion
    Zou, Ruirui
    Wang, Qinghui
    Wen, Falin
    Chen, Yang
    Liu, Jiale
    Du, Shaoyi
    Yuan, Chengzhi
    SENSORS, 2023, 23 (14)
  • [23] A New Multi-Channel Deep Convolutional Neural Network for Semantic Segmentation of Remote Sensing Image
    Liu, Wenjie
    Zhang, Yongjun
    Fan, Haisheng
    Zou, Yongjie
    Cui, Zhongwei
    IEEE ACCESS, 2020, 8 : 131814 - 131825
  • [24] Multi-level graph learning network for hyperspectral image classification
    Wan, Sheng
    Pan, Shirui
    Zhong, Shengwei
    Yang, Jie
    Yang, Jian
    Zhan, Yibing
    Gong, Chen
    PATTERN RECOGNITION, 2022, 129
  • [25] Multi-level graph neural network for text sentiment analysis
    Liao, Wenxiong
    Zeng, Bi
    Liu, Jianqi
    Wei, Pengfei
    Cheng, Xiaochun
    Zhang, Weiwen
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [26] Multi-level spatial attention network for image data segmentation
    Guo, Jun
    Jiang, Zhixiong
    Jiang, Dingchao
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2021, 14 (03) : 289 - 299
  • [27] Multi-level dilated residual network for biomedical image segmentation
    Naga Raju Gudhe
    Hamid Behravan
    Mazen Sudah
    Hidemi Okuma
    Ritva Vanninen
    Veli-Matti Kosma
    Arto Mannermaa
    Scientific Reports, 11
  • [28] Multi-level Feature Attention Network for medical image segmentation
    Zhang, Yaning
    Yin, Jianjian
    Gu, Yanhui
    Chen, Yi
    Expert Systems with Applications, 2025, 263
  • [29] Multi-level dilated residual network for biomedical image segmentation
    Gudhe, Naga Raju
    Behravan, Hamid
    Sudah, Mazen
    Okuma, Hidemi
    Vanninen, Ritva
    Kosma, Veli-Matti
    Mannermaa, Arto
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [30] Multi-Level Generative Chaotic Recurrent Network for Image Inpainting
    Chen, Cong
    Abbott, Amos
    Stilwell, Daniel
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3625 - 3634