Scale-pyramid dynamic atrous convolution for pixel-level labeling

被引:1
|
作者
Li, Zhiqiang [1 ,2 ,3 ]
Jiang, Jie [1 ,3 ]
Chen, Xi [1 ,2 ,3 ]
Zhang, Min [7 ]
Wang, Yong [4 ]
Li, Qingli [2 ]
Qi, Honggang [5 ]
Liu, Min [1 ,3 ]
Laganiere, Robert [6 ]
机构
[1] East China Normal Univ, Key Lab Geog Informat Sci, Minist Educ, Shanghai 200241, Peoples R China
[2] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[3] East China Normal Univ, Sch Geog Sci, Shanghai 200241, Peoples R China
[4] Sun Yat Sen Univ, Sch Aeronaut & Astronaut, Shenzhen 518107, Peoples R China
[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[6] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
[7] Engn Univ PAP, Xian 710086, Peoples R China
基金
中国国家自然科学基金;
关键词
Pixel-level labeling; Deep learning; DCNN; Dynamic convolution; Kernel engineering; NETWORK;
D O I
10.1016/j.eswa.2023.122695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For achieving better performance, the majority of deep convolutional neural networks have endeavored to increase the model capacity by adding more convolutional layers or increasing the size of the filters. Consequently, the computational cost increases proportionally with the model capacity. This problem can be alleviated by dynamic convolution. In the case of pixel-level labeling, existing pixel-level dynamic convolution methods have a smaller scanning area than ordinary convolution or image-level dynamic convolution and are thus unable to exploit fine contextual information. As a consequence, pixel-level dynamic convolution is more sensitive to large-scale varying objects and confusion categories. In this paper, we propose a scale-pyramid dynamic atrous convolution (SDAConv) and exploit multi-scale pixel-level features in finer granularity, in order to efficiently increase model capacity, exploring contextual information, capture detail information and alleviate large-scale variation problem at the same time. Through kernel engineering (instead of network engineering), SDAConv dynamically arranges atrous filters in the individual convolutional kernels over different semantic areas at dense scales in the spatial dimension. By simply replacing the regular convolution with SDAConv in SOTA architectures, extensive experiments on three public datasets, Cityscapes, PASCAL VOC 2012 and ADE20K benchmarks demonstrate the superior performance of SDAConv on pixel-level labeling tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Binarization method based on pixel-level dynamic thresholds for change detection in image sequences
    Cheng, Hsu-Yung
    Wu, Quen-Zong
    Fan, Kuo-Chin
    Jeng, Bor-Shenn
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2006, 22 (03) : 545 - 557
  • [32] Automatic pixel-level detection method for concrete crack with channel-spatial attention convolution neural network
    Li, Yuanyuan
    Yu, Meng
    Wu, Decheng
    Li, Rui
    Xu, Kefei
    Cheng, Longqi
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (02): : 1460 - 1477
  • [33] A spatiotemporal convolution recurrent neural network for pixel-level peripapillary atrophy prediction using sequential fundus images
    Li, Mengxuan
    Zhang, Weihang
    Zhao, He
    Xu, Yubin
    Xu, Jie
    Li, Huiqi
    APPLIED SOFT COMPUTING, 2024, 155
  • [34] HDCB-Net: A Neural Network With the Hybrid Dilated Convolution for Pixel-Level Crack Detection on Concrete Bridges
    Jiang, Wenbo
    Liu, Min
    Peng, Yunuo
    Wu, Lehui
    Wang, Yaonan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5485 - 5494
  • [35] Pixel-Level Grasp Detection based on EfficientNet and Multi-scale Feature Fusion Network
    Gao, Junli
    Luo, Yinming
    Huang, Xianxin
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 486 - 491
  • [36] Multi-Scale Flame Situation Detection Based on Pixel-Level Segmentation of Visual Images
    Wang, Xinzhi
    Li, Mengyue
    Liu, Quanyi
    Chang, Yudong
    Zhang, Hui
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [37] Automatic pixel-level crack detection with multi-scale feature fusion for slab tracks
    Ye, Wenlong
    Ren, Juanjuan
    Zhang, Allen A.
    Lu, Chunfang
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (18) : 2648 - 2665
  • [38] Automatic Pixel-Level Pavement Crack Detection Using Information of Multi-Scale Neighborhoods
    Ai, Dihao
    Jiang, Guiyuan
    Kei, Lam Siew
    Li, Chengwu
    IEEE ACCESS, 2018, 6 : 24452 - 24463
  • [39] Population spatialization with pixel-level attribute grading by considering scale mismatch issue in regression modeling
    Mei, Yuao
    Gui, Zhipeng
    Wu, Jinghang
    Peng, Dehua
    Li, Rui
    Wu, Huayi
    Wei, Zhengyang
    GEO-SPATIAL INFORMATION SCIENCE, 2022, 25 (03) : 365 - 382
  • [40] A high dynamic range CMOS image sensor with a novel pixel-level logarithmic counter memory
    Freedman, Saul D.
    Boussaid, Farid
    2015 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2015, : 14 - 19