Multi-Direction Convolution for Semantic Segmentation

被引:0
|
作者
Li, Dehui [1 ]
Cao, Zhiguo [1 ]
Xian, Ke [1 ]
Qi, Xinyuan [1 ]
Zhang, Chao [1 ]
Lu, Hao [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Wuhan 430074, Peoples R China
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICPR48806.2021.9413174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context is known to be one of crucial factors effecting the performance improvement of semantic segmentation. However, state-of-the-art segmentation models built upon fully convolutional networks are inherently weak in encoding contextual information because of stacked local operations such as convolution and pooling. Failing to capture context leads to inferior segmentation performance. Despite many context modules have been proposed to relieve this problem, they still operate in a local manner or use the same contextual information in different positions (due to upsampling). In this paper. we introduce the idea of Multi-Direction Convolution (MDC)-a novel operator capable of encoding rich contextual information. This operator is inspired by an observation that the standard convolution only slides along the spatial dimension (x, y direction) where the channel dimension (z direction) is fixed, which renders slow growth of the receptive field (RF). If considering the channel-fixed convolution to be one-direction, MDC is multi-direction in the sense that MDC slides along both spatial and channel dimensions, i.e., it slides along x, y when z is fixed, along x, z when y is fixed, and along y, z when x is fixed. In this way, MDC is able to encode rich contextual information with the fast increase of the RE Compared to existing context modules, the encoded context is position-sensitive because no upsampling is required. MDC is also efficient and easy to implement. It can be implemented with few standard convolution layers with permutation. We show through extensive experiments that MDC effectively and selectively enlarges the RF and outperforms existing contextual modules on two standard benchmarks, including Cityscapes and PASCAL VOC2012.
引用
收藏
页码:519 / 525
页数:7
相关论文
共 50 条
  • [1] Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion
    Liu, Sun'ao
    Xu, Hai
    Liu, Yizhi
    Xie, Hongtao
    [J]. MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 727 - 738
  • [2] A multi-direction GVF snake for the segmentation of skin cancer images
    Tang, Jinshan
    [J]. PATTERN RECOGNITION, 2009, 42 (06) : 1172 - 1179
  • [3] Defect detection of cheese yarn based on multi-scale multi-direction template convolution
    Cai, Yichao
    Zhou, Xiao
    Song, Mingfeng
    Mou, Xin'gang
    [J]. Fangzhi Xuebao/Journal of Textile Research, 2019, 40 (04): : 152 - 157
  • [4] Improving Brain Tumor Segmentation with Multi-direction Fusion and Fine Class Prediction
    Liu, Sun'ao
    Guo, Xiaonan
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 349 - 358
  • [5] Local Threshold Segmentation Method Based on Multi-Direction Grayscale Wave for Image
    Wu Zhengping
    Ma Zhanwen
    Yan Hua
    Zhang Zhaomeng
    Yin Fan
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (06)
  • [6] Multi-direction edge detection operator
    Xu, Pengfei
    Miao, Qiguang
    Liu, Tian'ge
    Chen, Xiaojiang
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2015, : 187 - 190
  • [7] Multi-direction remote sensing ship detection based on center point and semantic information
    Li, Runlin
    Zou, Huanxin
    Cao, Xu
    Cheng, Fei
    He, Shitian
    Li, Meilin
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (06): : 1772 - 1781
  • [8] Local Binary Convolution Based Prior Knowledge of Multi-Direction Features for Finger Vein Verification
    Zhang, Huijie
    Lu, Ling
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1089 - 1093
  • [9] Multi-direction empirical mode decomposition
    Liang, Lingfei
    Dong, Yongsheng
    [J]. Journal of Computational Information Systems, 2014, 10 (07): : 3003 - 3010
  • [10] Understanding Convolution for Semantic Segmentation
    Wang, Panqu
    Chen, Pengfei
    Yuan, Ye
    Liu, Ding
    Huang, Zehua
    Hou, Xiaodi
    Cottrell, Garrison
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1451 - 1460