A Strip Dilated Convolutional Network for Semantic Segmentation

被引:3
|
作者
Zhou, Yan [1 ]
Zheng, Xihong [1 ]
Ouyang, Wanli [2 ]
Li, Baopu [3 ]
机构
[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan 411105, Peoples R China
[2] Univ Sydney, Sch Elect & Informat, Camperdown, NSW 2006, Australia
[3] Baidu Res USA, Sunnyvale, CA 94089 USA
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Multi-scale contexts; Encoder-decoder; Multi-scale strip pooling module; Strip dilated convolution module; ATTENTION;
D O I
10.1007/s11063-022-11048-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are frequently a large number of strip objects in segmentation scenarios, and the use of conventional square convolution may yield redundant information. Based on our previously proposed SA-FFNet (Zhou et al. in Neurocomputing 453:50-59, 2021), we study the effect of strip sub-region information extraction on semantic segmentation and propose a network. Our method is conducive to extracting multi-scale strip objects that often appear in segmentation scenes, and using strip dilated convolution to further extract contextual dependencies in other directions. First, we propose a multi-scale strip pooling module that enables the backbone network to effectively obtain multi-scale contexts; Then, we introduce a strip dilated convolution module, which supplements the vertical contexts of the strip pooling by using strip dilated convolution; Finally, we construct a novel network integrating the proposed two modules. The method explicitly takes horizontal and vertical contexts of multi-scale strip objects into consideration, so that scene understanding could benefit from long-range dependencies. The experimental results on the widely used PASCAL VOC 2012 and Cityscapes scene analysis benchmark datasets, which are better than the existing OCRNet, DeeplabV3+, SPNet, etc, both qualitatively and quantitatively.
引用
收藏
页码:4439 / 4459
页数:21
相关论文
共 50 条
  • [21] Parallel global convolutional network for semantic image segmentation
    Bai, Xing
    Zhou, Jun
    [J]. IET IMAGE PROCESSING, 2021, 15 (01) : 252 - 259
  • [22] Image semantic segmentation with an improved fully convolutional network
    Kuo-Kun Tseng
    Haichuan Sun
    Junwu Liu
    Jiaqi Li
    K. L. Yung
    W. H. Ip
    [J]. Soft Computing, 2020, 24 : 8253 - 8273
  • [23] Image semantic segmentation with an improved fully convolutional network
    Tseng, Kuo-Kun
    Sun, Haichuan
    Liu, Junwu
    Li, Jiaqi
    Yung, K. L.
    Ip, W. H.
    [J]. SOFT COMPUTING, 2020, 24 (11) : 8253 - 8273
  • [24] Fully convolutional network with attention modules for semantic segmentation
    Yunjia Huang
    Haixia Xu
    [J]. Signal, Image and Video Processing, 2021, 15 : 1031 - 1039
  • [25] Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery
    Ouyang, Song
    Li, Yansheng
    [J]. REMOTE SENSING, 2021, 13 (01) : 1 - 22
  • [26] MULTIPLE SKIP CONNECTIONS OF DILATED CONVOLUTION NETWORK FOR SEMANTIC SEGMENTATION
    Yamashita, Takayoshi
    Furukawa, Hironori
    Fujiyoshi, Hironobu
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1593 - 1597
  • [27] Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
    Zhao, Shan
    Wang, Zihao
    Huo, Zhanqiang
    Zhang, Fukai
    [J]. SENSORS, 2024, 24 (16)
  • [28] Fully convolutional network with dilated convolutions for handwritten text line segmentation
    Renton, Guillaume
    Soullard, Yann
    Chatelain, Clement
    Adam, Sebastien
    Kermorvant, Christopher
    Paquet, Thierry
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 177 - 186
  • [29] A cascaded fully convolutional network framework for dilated pancreatic duct segmentation
    Shen, Chen
    Roth, Holger R.
    Hayashi, Yuichiro
    Oda, Masahiro
    Miyamoto, Tadaaki
    Sato, Gen
    Mori, Kensaku
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (02) : 343 - 354
  • [30] A novel dilated convolutional neural network model for road scene segmentation
    Zhang, Yachao
    Yuan, Yuxia
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (04):