Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

被引:25
|
作者
Feng, Guang [1 ]
Meng, Jinyu [1 ]
Zhang, Lihe [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116023, Peoples R China
关键词
RGB-D salient object detection; Deep interleaved encoder; Cross-modal mutual guidance; Residual multi-scale feature aggregation; Real-time; FUSION; ATTENTION;
D O I
10.1016/j.patcog.2022.108666
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A B S T R A C T Recently, RGB-D salient object detection (SOD) has aroused widespread research interest. Existing RGB-D SOD approaches mainly consider the cross-modal information fusion in the decoder. And their multi modal interaction mainly concentrates on the same level of features between RGB stream and depth stream. They do not deeply explore the coherence of multi-model features at different levels. In this paper, we design a two-stream deep interleaved encoder network to extract RGB and depth information and realize their mixing simultaneously. This network allows us to gradually learn multi-modal representation at different levels from shallow to deep. Moreover, to further fuse multi-modal features in the decoding stage, we propose a cross-modal mutual guidance module and a residual multi-scale aggregation module to implement the global guidance and local refinement of the salient region. Extensive experiments on six benchmark datasets demonstrate that the proposed approach performs favorably against most stateof-the-art methods under different evaluation metrics. During the testing stage, this model can run at a real-time speed of 93 FPS.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Progressive multi-scale fusion network for RGB-D salient object detection
    Ren, Guangyu
    Xie, Yanchun
    Dai, Tianhong
    Stathaki, Tania
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 223
  • [2] Multi-scale iterative refinement network for RGB-D salient object detection
    Liu, Ze-Yu
    Liu, Jian-Wei
    Zuo, Xin
    Hu, Ming-Fei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [3] Multi-scale Residual Interaction for RGB-D Salient Object Detection
    Hu, Mingjun
    Zhang, Xiaoqin
    Zhao, Li
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 575 - 590
  • [4] MMNet: Multi-Stage and Multi-Scale Fusion Network for RGB-D Salient Object Detection
    Liao, Guibiao
    Gao, Wei
    Jiang, Qiuping
    Wang, Ronggang
    Li, Ge
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2436 - 2444
  • [5] Cross-Stage Multi-Scale Interaction Network for RGB-D Salient Object Detection
    Yi, Kang
    Zhu, Jinchao
    Guo, Fu
    Xu, Jing
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2402 - 2406
  • [6] Dual attention guided multi-scale fusion network for RGB-D salient object detection
    Gao, Huan
    Guo, Jichang
    Wang, Yudong
    Dong, Jianan
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [7] MAGNet: Multi-scale Awareness and Global fusion Network for RGB-D salient object detection
    Zhong, Mingyu
    Sun, Jing
    Ren, Peng
    Wang, Fasheng
    Sun, Fuming
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [8] Multi-scale deep encoder-decoder network for salient object detection
    Ren, Qinghua
    Hu, Renjie
    [J]. NEUROCOMPUTING, 2018, 316 : 95 - 104
  • [9] Rethinking feature aggregation for deep RGB-D salient object detection
    Zhang, Yuan-fang
    Zheng, Jiangbin
    Li, Long
    Liu, Nian
    Jia, Wenjing
    Fan, Xiaochen
    Xu, Chengpei
    He, Xiangjian
    [J]. NEUROCOMPUTING, 2021, 423 : 463 - 473
  • [10] RGB-D Salient Object Detection via Feature Fusion and Multi-scale Enhancement
    Wu, Peiliang
    Duan, Liangliang
    Kong, Lingfu
    [J]. COMPUTER VISION, CCCV 2015, PT II, 2015, 547 : 359 - 368