Semantic segmentation with cross convolution and multi-layer feature refinement

被引：1

作者：

Ma, Yingdong ^{[1
]}

Jing, Nan ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, 235,West Daxue Rd, Hohhot, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 97卷

关键词：

Semantic segmentation; Cross convolution; Multi-scale context; Feature fusion; NETWORK;

D O I：

10.1016/j.jvcir.2023.103971

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-level features and contextual information have been proven effective in semantic segmentation. As a common solution, dilated convolution with multiple dilation rates is widely adopted by various computer vision tasks. However, since a large number of pixels are not involved in convolutional calculation, dilated convolution suffers from the information loss problem. Another important aspect for image segmentation is that most feature fusion methods combine different level features directly, which ignores the semantic gap between shallow layer features and deep layer features. In this work, we propose the multi-scale cross convolution to alleviate the information loss problem. Cross convolution conducts convolutional operations in horizontal and vertical di-rection with different kernels. By combining cross convolution with dilated convolutions using different convolution kernels, more pixels are engaged in convolutional operation to capture multi-scale features. To address the issue of semantic gap between multi-layer features, a feature fusion scheme is developed in which a dual attention mechanism is applied to conduct feature refinement in both spatial and channel dimensions. Comprehensive experiments are conducted to evaluate the proposed method on Cityscapes and ADE20K datasets. Experimental results demonstrate that the cross convolution and feature fusion method improve segmentation performance significantly and achieve competitive performance over state-of-the-art approaches.

引用

页数：11

共 50 条

[1] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
Yizhen Chen
Haifeng Hu
[J]. Neural Processing Letters, 2020, 51 : 1081 - 1092
[2] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
Chen, Yizhen
Hu, Haifeng
[J]. Neural Processing Letters, 2020, 51 (02): : 1081 - 1092
[3] Multi-layer Adaptive Feature Fusion for Semantic Segmentation
Chen, Yizhen
Hu, Haifeng
[J]. NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1081 - 1092
[4] MLAttack: Fooling Semantic Segmentation Networks by Multi-layer Attacks
Gupta, Puneet
Rahtu, Esa
[J]. PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 401 - 413
[5] Semantic Segmentation of Polarimetric Synthetic Aperture Radar Images Based on Multi-Layer Deep Feature Fusion
Hu T.
Li W.
Qin X.
[J]. Zhongguo Jiguang/Chinese Journal of Lasers, 2019, 46 (02):
[6] Semantic Segmentation of Polarimetric Synthetic Aperture Radar Images Based on Multi-Layer Deep Feature Fusion
Hu Tao
Li Weihua
Qin Xianxiang
[J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (02):
[7] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
Li, You
Zhang, Qingxuan
Zhang, Yulei
[J]. 2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
[8] Gated Multi-Layer Fusion for Real-Time Semantic Segmentation
Zhang C.
Cheng Q.
Li Z.
Wang Z.
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (09): : 1442 - 1449
[9] Fast semantic segmentation network with attention gate and multi-layer fusion
Yanping Tang
Canlong Zhang
Qinghe Cheng
Zhixin Li
Luyang Qian
[J]. Multimedia Tools and Applications, 2022, 81 : 21547 - 21562
[10] Fast semantic segmentation network with attention gate and multi-layer fusion
Tang, Yanping
Zhang, Canlong
Cheng, Qinghe
Li, Zhixin
Qian, Luyang
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 21547 - 21562

← 1 2 3 4 5 →