Dense-scale dynamic network with filter-varying atrous convolution for semantic segmentation

被引：0

作者：

Zhiqiang Li

Jie Jiang

Xi Chen

Robert Laganière

Qingli Li

Min Liu

Honggang Qi

Yong Wang

Min Zhang

机构：

[1] East China Normal University,School of Geographic Sciences

[2] East China Normal University,The Key Laboratory of Geographic Information Science, Ministry of Education of China

[3] East China Normal University,Key Laboratory of Spatial

[4] East China Normal University,temporal Big Data Analysis and Application of Natural Resources in Megacities, Ministry of Natural Resources

[5] University of Ottawa,Shanghai Key Laboratory of Multidimensional Information Processing

[6] University of Chinese Academy of Sciences,School of Electrical Engineering and Computer Science

[7] Sun Yat-sen University,School of Computer Science and Technology

[8] Engineering University of PAP,School of Aeronautics and Astronautics

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Semantic segmentation; Deep learning; Deep convolution neural networks (DCNNs); Dynamic convolution;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Deep convolution neural networks (DCNNs) in deep learning have been widely used in semantic segmentation. However, the filters of most regular convolutions in DCNNs are spatially invariant to local transformations, which reduces localization accuracy and hinders the improvement of semantic segmentation. Dynamic convolution with pixel-level filters can enhance the localization accuracy through its region-awareness, but these are sensitive to objects with large-scale variations in semantic segmentation. To simultaneously address the low localization accuracy and objects with large-scale variations, we propose a filter-varying atrous convolution (FAC) to efficiently enlarge the per-pixel receptive fields pertaining to various objects. FAC mainly consists of a conditional-filter-generating network (CFGN) and a dynamic local filtering operation (DLFO). In the CFGN, a class probability map is used to generate the corresponding filters, making the FAC genuinely dynamic. In the DLFO, by replacing the sliding convolution operation one by one with a one-time dot product operation, the efficiency of the algorithm is greatly improved. Also, a dense scale module (DSM) is constructed to generate denser scales and larger receptive fields for exploring long-range contextual information. Finally, a dense-scale dynamic network (DsDNet) simultaneously enhances the localization accuracy and reduces the effect of large-scale variations of the object, by assigning FAC to different spatial locations at dense scales. In addition, to accelerate network convergence and improve segmentation accuracy, our network employs two pixel-wise cross-entropy loss functions. One is between the Backbone and DSM, and the other is at the network’s end. Extensive experiments on Cityscapes, PASCAL VOC 2012, and ADE20K datasets verify that the performance of our DsDNet is superior to the non-dynamic and multi-scale convolution neural networks.

引用

页码：26810 / 26826

页数：16

共 50 条

[41] Semantic Segmentation of High-Resolution Remote Sensing Images Based on Improved FuseNet Combined with Atrous Convolution
Yang J.
Yu X.
[J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2022, 47 (07): : 1071 - 1080
[42] Dynamic attention network for semantic segmentation
Wu, Fei
Chen, Feng
Jing, Xiao-Yuan
Hu, Chang-Hui
Ge, Qi
Ji, Yimu
[J]. NEUROCOMPUTING, 2020, 384 (384) : 182 - 191
[43] A Two-Stage Atrous Convolution Neural Network for Brain Tumor Segmentation and Survival Prediction
Miron, Radu
Albert, Ramona
Breaban, Mihaela
[J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2020), PT II, 2021, 12659 : 290 - 299
[44] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
Ke Li
ZhanDong Liu
[J]. Multimedia Tools and Applications, 2023, 82 : 34679 - 34702
[45] Dynamic Sampling Network for Semantic Segmentation
Fu, Bin
He, Junjun
Zhang, Zhengfu
Qiao, Yu
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10794 - 10801
[46] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
Li, Ke
Liu, ZhanDong
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34679 - 34702
[47] Fine semantic mapping based on dense segmentation network
Zuo, Guoyu
Zheng, Tao
Liu, Yuelei
Xu, Zichen
Gong, Daoxiong
Yu, Jianjun
[J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (01) : 47 - 60
[48] Fine semantic mapping based on dense segmentation network
Guoyu Zuo
Tao Zheng
Yuelei Liu
Zichen Xu
Daoxiong Gong
Jianjun Yu
[J]. Intelligent Service Robotics, 2021, 14 : 47 - 60
[49] Gaussian Dynamic Convolution for Semantic Segmentation in Remote Sensing Images
Feng, Mingzhe
Sun, Xin
Dong, Junyu
Zhao, Haoran
[J]. REMOTE SENSING, 2022, 14 (22)
[50] Strong-Structural Convolution Neural Network for Semantic Segmentation
Ouyang, Yi
[J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2019, 29 (04) : 716 - 729

← 1 2 3 4 5 →