Semantic segmentation with cross convolution and multi-layer feature refinement

被引:1
|
作者
Ma, Yingdong [1 ]
Jing, Nan [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, 235,West Daxue Rd, Hohhot, Peoples R China
关键词
Semantic segmentation; Cross convolution; Multi-scale context; Feature fusion; NETWORK;
D O I
10.1016/j.jvcir.2023.103971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-level features and contextual information have been proven effective in semantic segmentation. As a common solution, dilated convolution with multiple dilation rates is widely adopted by various computer vision tasks. However, since a large number of pixels are not involved in convolutional calculation, dilated convolution suffers from the information loss problem. Another important aspect for image segmentation is that most feature fusion methods combine different level features directly, which ignores the semantic gap between shallow layer features and deep layer features. In this work, we propose the multi-scale cross convolution to alleviate the information loss problem. Cross convolution conducts convolutional operations in horizontal and vertical di-rection with different kernels. By combining cross convolution with dilated convolutions using different convolution kernels, more pixels are engaged in convolutional operation to capture multi-scale features. To address the issue of semantic gap between multi-layer features, a feature fusion scheme is developed in which a dual attention mechanism is applied to conduct feature refinement in both spatial and channel dimensions. Comprehensive experiments are conducted to evaluate the proposed method on Cityscapes and ADE20K datasets. Experimental results demonstrate that the cross convolution and feature fusion method improve segmentation performance significantly and achieve competitive performance over state-of-the-art approaches.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
    Yizhen Chen
    Haifeng Hu
    [J]. Neural Processing Letters, 2020, 51 : 1081 - 1092
  • [2] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
    Chen, Yizhen
    Hu, Haifeng
    [J]. Neural Processing Letters, 2020, 51 (02): : 1081 - 1092
  • [3] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
    Chen, Yizhen
    Hu, Haifeng
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1081 - 1092
  • [4] MLAttack: Fooling Semantic Segmentation Networks by Multi-layer Attacks
    Gupta, Puneet
    Rahtu, Esa
    [J]. PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 401 - 413
  • [5] Semantic Segmentation of Polarimetric Synthetic Aperture Radar Images Based on Multi-Layer Deep Feature Fusion
    Hu T.
    Li W.
    Qin X.
    [J]. Zhongguo Jiguang/Chinese Journal of Lasers, 2019, 46 (02):
  • [6] Semantic Segmentation of Polarimetric Synthetic Aperture Radar Images Based on Multi-Layer Deep Feature Fusion
    Hu Tao
    Li Weihua
    Qin Xianxiang
    [J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (02):
  • [7] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
    Li, You
    Zhang, Qingxuan
    Zhang, Yulei
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [8] Gated Multi-Layer Fusion for Real-Time Semantic Segmentation
    Zhang C.
    Cheng Q.
    Li Z.
    Wang Z.
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (09): : 1442 - 1449
  • [9] Fast semantic segmentation network with attention gate and multi-layer fusion
    Yanping Tang
    Canlong Zhang
    Qinghe Cheng
    Zhixin Li
    Luyang Qian
    [J]. Multimedia Tools and Applications, 2022, 81 : 21547 - 21562
  • [10] Fast semantic segmentation network with attention gate and multi-layer fusion
    Tang, Yanping
    Zhang, Canlong
    Cheng, Qinghe
    Li, Zhixin
    Qian, Luyang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 21547 - 21562