Multi-scale deep encoder-decoder network for salient object detection

被引：13

作者：

Ren, Qinghua ^{[1
]}

Hu, Renjie ^{[1
]}

机构：

[1] Southeast Univ, Sch Elect Engn, Nanjing, Jiangsu, Peoples R China

来源：

NEUROCOMPUTING | 2018年 / 316卷

关键词：

Salient object detection; Multi-scale; Encoder-decoder; CNN;

D O I：

10.1016/j.neucom.2018.07.055

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep convolutional neural networks (CNNs) have recently made revolutionary improvements in salient object detection. However, most existing CNN-based models fail to precisely separate the whole salient object(s) from a cluttered background due to the downsampling effects or the patch-level operation. In this paper, we propose a multi-scale deep encoder-decoder network which learns discriminative saliency cues and computes confidence scores in an end-to-end fashion. The encoder network extracts meaningful and informative features in a global view, and the decoder network recovers lost detailed object structure in a local perspective. By taking multiple resized images as the inputs, the proposed model incorporates multi-scale features from a shared network and predicts a fine-grained saliency map at the pixel level. To easily and efficiently train the whole network, the light-weighted decoder breaks through the limit of conventional symmetric structure. In addition, a two-stage training strategy is designed to encourage the robustness and accuracy of the network. Without any post-processing steps, our method is capable of significantly reducing the computation complexity while densely segmenting foreground objects from an image. Extensive experiments on six challenging datasets demonstrate that the proposed model outperforms other state-of-the-art approaches in terms of various evaluation metrics. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：95 / 104

页数：10

共 50 条

[21] TMNet: Triple-modal interaction encoder and multi-scale fusion decoder network for V-D-T salient object detection
Wan, Bin
Lv, Chengtao
Zhou, Xiaofei
Sun, Yaoqi
Zhu, Zunjie
Wang, Hongkui
Yan, Chenggang
[J]. PATTERN RECOGNITION, 2024, 147
[22] Multi-scale Pyramid Pooling Network for salient object detection
Dakhia, Abdelhafid
Wang, Tiantian
Lu, Huchuan
[J]. NEUROCOMPUTING, 2019, 333 : 211 - 220
[23] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
Ma, Xinxin
Liu, Kai
Ding, Chongyang
Yan, Lin
Duan, Meiyu
[J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
[24] MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
Aboutalebi, Hossein
Pavlova, Maya
Gunraj, Hayden
Shafiee, Mohammad Javad
Sabri, Ali
Alaref, Amer
Wong, Alexander
[J]. FRONTIERS IN MEDICINE, 2022, 8
[25] Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation
Wang, Shuyang
Mu, Xiaodong
Yang, Dongfang
He, Hao
Zhao, Peng
[J]. IEEE ACCESS, 2020, 8 : 215299 - 215309
[26] Semantic Segmentation of Remote Sensing Image Based on Multi-Scale Semantic Encoder-Decoder Network
Liang Y.
Yi C.-X.
Wang G.-Y.
Hu Y.-H.
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3199 - 3214
[27] Multi-scale salient object detection network combining an attention mechanism
Liu, Di
Guo, Jichang
Wang, Yudong
Zhang, Yi
[J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 118 - 126
[28] Salient Object Detection with Chained Multi-Scale Fully Convolutional Network
Tang, Youbao
Wu, Xiangqian
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 618 - 626
[29] DMINet: dense multi-scale inference network for salient object detection
Chenxing Xia
Yanguang Sun
Xiuju Gao
Bin Ge
Songsong Duan
[J]. The Visual Computer, 2022, 38 : 3059 - 3072
[30] DMINet: dense multi-scale inference network for salient object detection
Xia, Chenxing
Sun, Yanguang
Gao, Xiuju
Ge, Bin
Duan, Songsong
[J]. VISUAL COMPUTER, 2022, 38 (9-10): : 3059 - 3072

← 1 2 3 4 5 →