Perceptual localization and focus refinement network for RGB-D salient object detection

被引:0
|
作者
Han, Jinyu [1 ]
Wang, Mengyin [1 ]
Wu, Weiyi [1 ]
Jia, Xu [2 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Dalian 116600, Peoples R China
[2] Liaoning Univ Technol, Sch Elect & Informat Engn, Jinzhou 121001, Peoples R China
基金
中国国家自然科学基金;
关键词
Salient object detection; RGB-D; Multi-level; Cross-modal; Fusion network; IMAGE;
D O I
10.1016/j.eswa.2024.125278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-D salient object detection task still encounters three challenges: (1) how to effectively integrate superior information from different modalities, (2) how to effectively mine common information of features at different levels, and (3) how to detect salient objects in complex scenes, such as complex backgrounds, low-quality depth maps, small targets, and high foreground-background similarity. To address the above challenges, we propose a novel Perceptual Localization and Focus Refinement Network, termed PLFRNet, based on the mechanism of human visual capture of salient objects in images. The network includes three key components: an encoder, a Perceptual Localization Module (PLM), and a Focus-Refinement Decoder (FRD). Specifically, we first adopt a two-stream asymmetric Pyramid Visual Transformer as the encoder to extract RGB and depth features. Then, we develop the PLM under the guidance of a Perceptual Localization Unit (PLU) delicately designed. This module can mine the common information of features at different levels and integrate the advantageous information from multiple modalities to localize salient objects. Finally, we propose the FRD focusing on detailed information guided by the attention mechanism. Furthermore, it further refines the located objects by gradually interacting with low-level features to achieve salient object detection. Extensive experimental results show that this method achieves state-of-the-art performance compared with 13 RGB-D models on 6 public datasets. The codes are released at https://github.com/hjy0518/PLFRNet/.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multi-scale iterative refinement network for RGB-D salient object detection
    Liu, Ze-Yu
    Liu, Jian-Wei
    Zuo, Xin
    Hu, Ming-Fei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [2] Depth-aware inverted refinement network for RGB-D salient object detection
    Gao, Lina
    Liu, Bing
    Fu, Ping
    Xu, Mingzhu
    [J]. NEUROCOMPUTING, 2023, 518 : 507 - 522
  • [3] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
    Zeng, Zhihong
    Liu, Haijun
    Chen, Fenglei
    Tan, Xiaoheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669
  • [4] Multi-modality information refinement fusion network for RGB-D salient object detection
    Bao, Hua
    Fan, Bo
    [J]. VISUAL COMPUTER, 2024, 40 (06): : 4183 - 4199
  • [5] Circular Complement Network for RGB-D Salient Object Detection
    Bai, Zhen
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    [J]. NEUROCOMPUTING, 2021, 451 : 95 - 106
  • [6] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [7] Dynamic Selective Network for RGB-D Salient Object Detection
    Wen, Hongfa
    Yan, Chenggang
    Zhou, Xiaofei
    Cong, Runmin
    Sun, Yaoqi
    Zheng, Bolun
    Zhang, Jiyong
    Bao, Yongjun
    Ding, Guiguang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9179 - 9192
  • [8] DYNAMIC SELECTION NETWORK FOR RGB-D SALIENT OBJECT DETECTION
    Zhou, Jinlin
    Luo, Zhiming
    Li, Shaozi
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 776 - 780
  • [9] Siamese Network for RGB-D Salient Object Detection and Beyond
    Fu, Keren
    Fan, Deng-Ping
    Ji, Ge-Peng
    Zhao, Qijun
    Shen, Jianbing
    Zhu, Ce
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5541 - 5559
  • [10] Bifurcation Fusion Network for RGB-D Salient Object Detection
    Zhao, Zhi-Hua
    Chen, Li
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (12)