Multi-Direction Convolution for Semantic Segmentation

被引:0
|
作者
Li, Dehui [1 ]
Cao, Zhiguo [1 ]
Xian, Ke [1 ]
Qi, Xinyuan [1 ]
Zhang, Chao [1 ]
Lu, Hao [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Wuhan 430074, Peoples R China
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICPR48806.2021.9413174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context is known to be one of crucial factors effecting the performance improvement of semantic segmentation. However, state-of-the-art segmentation models built upon fully convolutional networks are inherently weak in encoding contextual information because of stacked local operations such as convolution and pooling. Failing to capture context leads to inferior segmentation performance. Despite many context modules have been proposed to relieve this problem, they still operate in a local manner or use the same contextual information in different positions (due to upsampling). In this paper. we introduce the idea of Multi-Direction Convolution (MDC)-a novel operator capable of encoding rich contextual information. This operator is inspired by an observation that the standard convolution only slides along the spatial dimension (x, y direction) where the channel dimension (z direction) is fixed, which renders slow growth of the receptive field (RF). If considering the channel-fixed convolution to be one-direction, MDC is multi-direction in the sense that MDC slides along both spatial and channel dimensions, i.e., it slides along x, y when z is fixed, along x, z when y is fixed, and along y, z when x is fixed. In this way, MDC is able to encode rich contextual information with the fast increase of the RE Compared to existing context modules, the encoded context is position-sensitive because no upsampling is required. MDC is also efficient and easy to implement. It can be implemented with few standard convolution layers with permutation. We show through extensive experiments that MDC effectively and selectively enlarges the RF and outperforms existing contextual modules on two standard benchmarks, including Cityscapes and PASCAL VOC2012.
引用
收藏
页码:519 / 525
页数:7
相关论文
共 50 条
  • [1] Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion
    Liu, Sun'ao
    Xu, Hai
    Liu, Yizhi
    Xie, Hongtao
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 727 - 738
  • [2] A multi-direction GVF snake for the segmentation of skin cancer images
    Tang, Jinshan
    PATTERN RECOGNITION, 2009, 42 (06) : 1172 - 1179
  • [3] Defect detection of cheese yarn based on multi-scale multi-direction template convolution
    Cai Y.
    Zhou X.
    Song M.
    Mou X.
    Fangzhi Xuebao/Journal of Textile Research, 2019, 40 (04): : 152 - 157
  • [4] Improving Brain Tumor Segmentation with Multi-direction Fusion and Fine Class Prediction
    Liu, Sun'ao
    Guo, Xiaonan
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 349 - 358
  • [5] Local Threshold Segmentation Method Based on Multi-Direction Grayscale Wave for Image
    Wu Zhengping
    Ma Zhanwen
    Yan Hua
    Zhang Zhaomeng
    Yin Fan
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (06)
  • [6] Local Binary Convolution Based Prior Knowledge of Multi-Direction Features for Finger Vein Verification
    Zhang, Huijie
    Lu, Ling
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1089 - 1093
  • [7] Multi-direction edge detection operator
    Xu, Pengfei
    Miao, Qiguang
    Liu, Tian'ge
    Chen, Xiaojiang
    2015 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2015, : 187 - 190
  • [8] Multi-direction remote sensing ship detection based on center point and semantic information
    Li R.
    Zou H.
    Cao X.
    Cheng F.
    He S.
    Li M.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (06): : 1772 - 1781
  • [9] Multi-direction empirical mode decomposition
    Liang, Lingfei
    Dong, Yongsheng
    Journal of Computational Information Systems, 2014, 10 (07): : 3003 - 3010
  • [10] DENSE CONVOLUTION FOR SEMANTIC SEGMENTATION
    Han, Chaoyi
    Tao, Xiaoming
    Duan, Yiping
    Lu, Jianhua
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2222 - 2226