Multi-scale deep encoder-decoder network for salient object detection

被引:13
|
作者
Ren, Qinghua [1 ]
Hu, Renjie [1 ]
机构
[1] Southeast Univ, Sch Elect Engn, Nanjing, Jiangsu, Peoples R China
关键词
Salient object detection; Multi-scale; Encoder-decoder; CNN;
D O I
10.1016/j.neucom.2018.07.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) have recently made revolutionary improvements in salient object detection. However, most existing CNN-based models fail to precisely separate the whole salient object(s) from a cluttered background due to the downsampling effects or the patch-level operation. In this paper, we propose a multi-scale deep encoder-decoder network which learns discriminative saliency cues and computes confidence scores in an end-to-end fashion. The encoder network extracts meaningful and informative features in a global view, and the decoder network recovers lost detailed object structure in a local perspective. By taking multiple resized images as the inputs, the proposed model incorporates multi-scale features from a shared network and predicts a fine-grained saliency map at the pixel level. To easily and efficiently train the whole network, the light-weighted decoder breaks through the limit of conventional symmetric structure. In addition, a two-stage training strategy is designed to encourage the robustness and accuracy of the network. Without any post-processing steps, our method is capable of significantly reducing the computation complexity while densely segmenting foreground objects from an image. Extensive experiments on six challenging datasets demonstrate that the proposed model outperforms other state-of-the-art approaches in terms of various evaluation metrics. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 50 条
  • [21] TMNet: Triple-modal interaction encoder and multi-scale fusion decoder network for V-D-T salient object detection
    Wan, Bin
    Lv, Chengtao
    Zhou, Xiaofei
    Sun, Yaoqi
    Zhu, Zunjie
    Wang, Hongkui
    Yan, Chenggang
    [J]. PATTERN RECOGNITION, 2024, 147
  • [22] Multi-scale Pyramid Pooling Network for salient object detection
    Dakhia, Abdelhafid
    Wang, Tiantian
    Lu, Huchuan
    [J]. NEUROCOMPUTING, 2019, 333 : 211 - 220
  • [23] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
    Ma, Xinxin
    Liu, Kai
    Ding, Chongyang
    Yan, Lin
    Duan, Meiyu
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [24] MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
    Aboutalebi, Hossein
    Pavlova, Maya
    Gunraj, Hayden
    Shafiee, Mohammad Javad
    Sabri, Ali
    Alaref, Amer
    Wong, Alexander
    [J]. FRONTIERS IN MEDICINE, 2022, 8
  • [25] Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation
    Wang, Shuyang
    Mu, Xiaodong
    Yang, Dongfang
    He, Hao
    Zhao, Peng
    [J]. IEEE ACCESS, 2020, 8 : 215299 - 215309
  • [26] Semantic Segmentation of Remote Sensing Image Based on Multi-Scale Semantic Encoder-Decoder Network
    Liang Y.
    Yi C.-X.
    Wang G.-Y.
    Hu Y.-H.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3199 - 3214
  • [27] Multi-scale salient object detection network combining an attention mechanism
    Liu, Di
    Guo, Jichang
    Wang, Yudong
    Zhang, Yi
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 118 - 126
  • [28] Salient Object Detection with Chained Multi-Scale Fully Convolutional Network
    Tang, Youbao
    Wu, Xiangqian
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 618 - 626
  • [29] DMINet: dense multi-scale inference network for salient object detection
    Chenxing Xia
    Yanguang Sun
    Xiuju Gao
    Bin Ge
    Songsong Duan
    [J]. The Visual Computer, 2022, 38 : 3059 - 3072
  • [30] DMINet: dense multi-scale inference network for salient object detection
    Xia, Chenxing
    Sun, Yanguang
    Gao, Xiuju
    Ge, Bin
    Duan, Songsong
    [J]. VISUAL COMPUTER, 2022, 38 (9-10): : 3059 - 3072