Scale-pyramid dynamic atrous convolution for pixel-level labeling

被引:1
|
作者
Li, Zhiqiang [1 ,2 ,3 ]
Jiang, Jie [1 ,3 ]
Chen, Xi [1 ,2 ,3 ]
Zhang, Min [7 ]
Wang, Yong [4 ]
Li, Qingli [2 ]
Qi, Honggang [5 ]
Liu, Min [1 ,3 ]
Laganiere, Robert [6 ]
机构
[1] East China Normal Univ, Key Lab Geog Informat Sci, Minist Educ, Shanghai 200241, Peoples R China
[2] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[3] East China Normal Univ, Sch Geog Sci, Shanghai 200241, Peoples R China
[4] Sun Yat Sen Univ, Sch Aeronaut & Astronaut, Shenzhen 518107, Peoples R China
[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[6] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
[7] Engn Univ PAP, Xian 710086, Peoples R China
基金
中国国家自然科学基金;
关键词
Pixel-level labeling; Deep learning; DCNN; Dynamic convolution; Kernel engineering; NETWORK;
D O I
10.1016/j.eswa.2023.122695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For achieving better performance, the majority of deep convolutional neural networks have endeavored to increase the model capacity by adding more convolutional layers or increasing the size of the filters. Consequently, the computational cost increases proportionally with the model capacity. This problem can be alleviated by dynamic convolution. In the case of pixel-level labeling, existing pixel-level dynamic convolution methods have a smaller scanning area than ordinary convolution or image-level dynamic convolution and are thus unable to exploit fine contextual information. As a consequence, pixel-level dynamic convolution is more sensitive to large-scale varying objects and confusion categories. In this paper, we propose a scale-pyramid dynamic atrous convolution (SDAConv) and exploit multi-scale pixel-level features in finer granularity, in order to efficiently increase model capacity, exploring contextual information, capture detail information and alleviate large-scale variation problem at the same time. Through kernel engineering (instead of network engineering), SDAConv dynamically arranges atrous filters in the individual convolutional kernels over different semantic areas at dense scales in the spatial dimension. By simply replacing the regular convolution with SDAConv in SOTA architectures, extensive experiments on three public datasets, Cityscapes, PASCAL VOC 2012 and ADE20K benchmarks demonstrate the superior performance of SDAConv on pixel-level labeling tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Research and development of non multi-scale to pixel-level image fusion
    Li, Mingjing
    Dong, Yubing
    Wang, Xiaoli
    RENEWABLE ENERGY AND ENVIRONMENTAL TECHNOLOGY, PTS 1-6, 2014, 448-453 : 3621 - 3624
  • [22] A Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression
    Wang, Suhong
    Jia, Chuanmin
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7077 - 7091
  • [23] The use of intermediate graphical constructions in problem solving with dynamic, pixel-level diagrams
    Furnas, G
    Qu, Y
    Shrivastava, S
    Peters, G
    THEORY AND APPLICATION OF DIAGRAMS, PROCEEDINGS, 2000, 1889 : 314 - 329
  • [24] Very Wide Dynamic Range ROIC With Pixel-Level ADC for SWIR FPAs
    Jo, Y. M.
    Woo, D. H.
    Kang, S. G.
    Lee, H. C.
    IEEE SENSORS JOURNAL, 2016, 16 (19) : 7227 - 7233
  • [25] Minimalistic fully convolution networks (MFCN): pixel-level classification for hyperspectral image with few labeled samples
    Xu, Buyun
    Hou, Weijun
    Wei, Yiwei
    Wang, Yiting
    Li, Xihai
    OPTICS EXPRESS, 2022, 30 (10) : 16585 - 16605
  • [26] Multi-scale feature fusion network for pixel-level pavement distress detection
    Zhong, Jingtao
    Zhu, Junqing
    Huyan, Ju
    Ma, Tao
    Zhang, Weiguang
    Automation in Construction, 2022, 141
  • [27] PIXEL-LEVEL ANNOTATION OF SPECIFIC TARGETS FOR LARGE-SCALE REMOTE SENSING IMAGES
    Liu, Guolong
    Hu, Wei
    Zhang, Fan
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6374 - 6377
  • [28] Current Input Pixel-Level ADC with High SNR and Wide Dynamic Range for a Microbolometer
    Lee, Jeongho
    Nam, Ilku
    Woo, DooHyung
    SENSORS, 2021, 21 (07)
  • [29] Multi-scale feature fusion network for pixel-level pavement distress detection
    Zhong, Jingtao
    Zhu, Junqing
    Huyan, Ju
    Ma, Tao
    Zhang, Weiguang
    AUTOMATION IN CONSTRUCTION, 2022, 141
  • [30] Multi-scale feature fusion network for pixel-level pavement distress detection
    Zhong, Jingtao
    Zhu, Junqing
    Huyan, Ju
    Ma, Tao
    Zhang, Weiguang
    AUTOMATION IN CONSTRUCTION, 2022, 141