Multi-scale deep encoder-decoder network for salient object detection

被引:13
|
作者
Ren, Qinghua [1 ]
Hu, Renjie [1 ]
机构
[1] Southeast Univ, Sch Elect Engn, Nanjing, Jiangsu, Peoples R China
关键词
Salient object detection; Multi-scale; Encoder-decoder; CNN;
D O I
10.1016/j.neucom.2018.07.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) have recently made revolutionary improvements in salient object detection. However, most existing CNN-based models fail to precisely separate the whole salient object(s) from a cluttered background due to the downsampling effects or the patch-level operation. In this paper, we propose a multi-scale deep encoder-decoder network which learns discriminative saliency cues and computes confidence scores in an end-to-end fashion. The encoder network extracts meaningful and informative features in a global view, and the decoder network recovers lost detailed object structure in a local perspective. By taking multiple resized images as the inputs, the proposed model incorporates multi-scale features from a shared network and predicts a fine-grained saliency map at the pixel level. To easily and efficiently train the whole network, the light-weighted decoder breaks through the limit of conventional symmetric structure. In addition, a two-stage training strategy is designed to encourage the robustness and accuracy of the network. Without any post-processing steps, our method is capable of significantly reducing the computation complexity while densely segmenting foreground objects from an image. Extensive experiments on six challenging datasets demonstrate that the proposed model outperforms other state-of-the-art approaches in terms of various evaluation metrics. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 50 条
  • [1] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Hongbo Bi
    Huihui Zhu
    Lina Yang
    Ranwan Wu
    [J]. Pattern Recognition and Image Analysis, 2022, 32 : 340 - 350
  • [2] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Bi, Hongbo
    Zhu, Huihui
    Yang, Lina
    Wu, Ranwan
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (02) : 340 - 350
  • [3] A Traffic Surveillance Multi-Scale Vehicle Detection Object Method Base on Encoder-Decoder
    Hong, Feng
    Lu, Chang-Hua
    Liu, Chun
    Liu, Ru-Ru
    Wei, Ju
    [J]. IEEE ACCESS, 2020, 8 : 47664 - 47674
  • [4] A Multi-scale Edge Detection Method Based on Encoder-Decoder
    Tian, An-Lin
    Lei, Wei-Min
    Zhang, Peng
    Zhang, Wei
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 936 - 943
  • [5] Multi-scale deep neural network for salient object detection
    Xiao, Fen
    Deng, Wenzheng
    Peng, Liangchan
    Cao, Chunhong
    Hu, Kai
    Gao, Xieping
    [J]. IET IMAGE PROCESSING, 2018, 12 (11) : 2036 - 2041
  • [6] SAR IMAGES ENHANCEMENT VIA DEEP MULTI-SCALE ENCODER-DECODER NEURAL NETWORK
    Yang, Xiaqing
    Zhou, Yuanyuan
    Wang, Chen
    Shi, Jun
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3368 - 3371
  • [7] Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection
    Feng, Guang
    Meng, Jinyu
    Zhang, Lihe
    Lu, Huchuan
    [J]. PATTERN RECOGNITION, 2022, 128
  • [8] Multi-scale Recurrent Encoder-Decoder Network for Dense Temporal Classification
    Choo, Sungkwon
    Seo, Wonkyo
    Jeong, Dong-Ju
    Cho, Nam Ik
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 103 - 108
  • [9] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE ENCODER-DECODER NETWORK
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Zhang, Xiangrong
    Zhu, Cheng
    Ma, Junyong
    Jiao, Licheng
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1283 - 1286
  • [10] Multi-scale Supervised Attentive Encoder-Decoder Network for Crowd Counting
    Zhang, Anran
    Jiang, Xiaolong
    Zhang, Baochang
    Cao, Xianbin
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)