Multilevel feature fusion dilated convolutional network for semantic segmentation
被引:8
|
作者:
Ku, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Chinese Acad Sci, Inst Robot, Shenyang 110169, Peoples R China
Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Ku, Tao
[1
,2
,3
]
Yang, Qirui
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Chinese Acad Sci, Inst Robot, Shenyang 110169, Peoples R China
Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Yang, Qirui
[1
,2
,3
,4
]
Zhang, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Chinese Acad Sci, Inst Robot, Shenyang 110169, Peoples R China
Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Peoples R ChinaChinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
Zhang, Hao
[1
,2
,3
]
机构:
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot, Shenyang 110169, Peoples R China
[3] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
Recently, convolutional neural network (CNN) has led to significant improvement in the field of computer vision, especially the improvement of the accuracy and speed of semantic segmentation tasks, which greatly improved robot scene perception. In this article, we propose a multilevel feature fusion dilated convolution network (Refine-DeepLab). By improving the space pyramid pooling structure, we propose a multiscale hybrid dilated convolution module, which captures the rich context information and effectively alleviates the contradiction between the receptive field size and the dilated convolution operation. At the same time, the high-level semantic information and low-level semantic information obtained through multi-level and multi-scale feature extraction can effectively improve the capture of global information and improve the performance of large-scale target segmentation. The encoder-decoder gradually recovers spatial information while capturing high-level semantic information, resulting in sharper object boundaries. Extensive experiments verify the effectiveness of our proposed Refine-DeepLab model, evaluate our approaches thoroughly on the PASCAL VOC 2012 data set without MS COCO data set pretraining, and achieve a state-of-art result of 81.73% mean interaction-over-union in the validate set.
机构:
School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, ChinaSchool of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, China
Wang, Yinyu
Meng, Fanyun
论文数: 0引用数: 0
h-index: 0
机构:
School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, ChinaSchool of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, China
Meng, Fanyun
Wang, Jinhe
论文数: 0引用数: 0
h-index: 0
机构:
School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, ChinaSchool of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, China
Wang, Jinhe
Liu, Zhihao
论文数: 0引用数: 0
h-index: 0
机构:
School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, ChinaSchool of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao,266000, China
机构:
Department of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, AustraliaDepartment of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, Australia
Xu, Lian
Xue, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, AustraliaDepartment of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, Australia
Xue, Hao
Bennamoun, Mohammed
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, AustraliaDepartment of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, Australia
Bennamoun, Mohammed
Boussaid, Farid
论文数: 0引用数: 0
h-index: 0
机构:
School of Electrical, Electronics and Computer Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, AustraliaDepartment of Computer Science and Software Engineering, The University of Western Australia, 35 Stirling Hwy, Perth,WA,6009, Australia