Multilevel feature fusion dilated convolutional network for semantic segmentation

被引:8
|
作者
Ku, Tao [1 ,2 ,3 ]
Yang, Qirui [1 ,2 ,3 ,4 ]
Zhang, Hao [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot, Shenyang 110169, Peoples R China
[3] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Semantic segmentation; convolutional neural network; deep learning; computer vision; robot vision;
D O I
10.1177/17298814211007665
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Recently, convolutional neural network (CNN) has led to significant improvement in the field of computer vision, especially the improvement of the accuracy and speed of semantic segmentation tasks, which greatly improved robot scene perception. In this article, we propose a multilevel feature fusion dilated convolution network (Refine-DeepLab). By improving the space pyramid pooling structure, we propose a multiscale hybrid dilated convolution module, which captures the rich context information and effectively alleviates the contradiction between the receptive field size and the dilated convolution operation. At the same time, the high-level semantic information and low-level semantic information obtained through multi-level and multi-scale feature extraction can effectively improve the capture of global information and improve the performance of large-scale target segmentation. The encoder-decoder gradually recovers spatial information while capturing high-level semantic information, resulting in sharper object boundaries. Extensive experiments verify the effectiveness of our proposed Refine-DeepLab model, evaluate our approaches thoroughly on the PASCAL VOC 2012 data set without MS COCO data set pretraining, and achieve a state-of-art result of 81.73% mean interaction-over-union in the validate set.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Strip Dilated Convolutional Network for Semantic Segmentation
    Zhou, Yan
    Zheng, Xihong
    Ouyang, Wanli
    Li, Baopu
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (04) : 4439 - 4459
  • [2] A Strip Dilated Convolutional Network for Semantic Segmentation
    Yan Zhou
    Xihong Zheng
    Wanli Ouyang
    Baopu Li
    [J]. Neural Processing Letters, 2023, 55 : 4439 - 4459
  • [3] Multilevel feature context semantic fusion network for cloud and cloud shadow segmentation
    Zhang, Enwei
    Hu, Kai
    Xia, Min
    Weng, Liguo
    Lin, Haifeng
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [4] Improved ASPP and Multilevel Feature Semantic Fusion Segmentation Method
    Wang, Yinyu
    Meng, Fanyun
    Wang, Jinhe
    Liu, Zhihao
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 220 - 228
  • [5] Dilated Convolutional Pixels Affinity Network for Weakly Supervised Semantic Segmentation
    Zhang Zhe
    Wang Bilin
    Yu Zhezhou
    Li Zhiyuan
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (06) : 1120 - 1130
  • [6] Dilated Convolutional Pixels Affinity Network for Weakly Supervised Semantic Segmentation
    ZHANG Zhe
    WANG Bilin
    YU Zhezhou
    LI Zhiyuan
    [J]. Chinese Journal of Electronics, 2021, 30 (06) : 1120 - 1130
  • [7] Multilevel Context Feature Fusion for Semantic Segmentation of ALS Point Cloud
    Zeng, Tao
    Luo, Fulin
    Guo, Tan
    Gong, Xiuwen
    Xue, Jingyun
    Li, Hanshan
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [8] Atrous convolutional feature network for weakly supervised semantic segmentation
    Xu, Lian
    Xue, Hao
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    [J]. Neurocomputing, 2021, 421 : 115 - 126
  • [9] Atrous convolutional feature network for weakly supervised semantic segmentation
    Xu, Lian
    Xue, Hao
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    [J]. NEUROCOMPUTING, 2021, 421 : 115 - 126
  • [10] A New Multilevel Feature Fusion Network for Medical Image Segmentation
    Xiaojing Qiu
    [J]. Sensing and Imaging, 2021, 22