Feature Calibrating and Fusing Network for RGB-D Salient Object Detection

被引:8
|
作者
Zhang, Qiang [1 ,2 ]
Qin, Qi [1 ,2 ]
Yang, Yang [1 ,2 ]
Jiao, Qiang [1 ,2 ]
Han, Jungong [3 ]
机构
[1] Xidian Univ, Key Lab Elect Equipment Struct Design, Minist Educ, Xian, Peoples R China
[2] Xidian Univ, Ctr Complex Syst, Sch Mechanoelect Engn, Xian, Peoples R China
[3] Univ Sheffield, Pathol Dept, Sheffield, England
关键词
Visualization; Object detection; Image synthesis; Feature extraction; Cognition; Saliency detection; Streaming media; Salient object detection; RGB-D images; two-steps sample selection; calibration-then-fusion; region consistency aware loss;
D O I
10.1109/TCSVT.2023.3296581
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to their imaging mechanisms and techniques, some depth images inevitably have low visual qualities or have some inconsistent foregrounds with their corresponding RGB images. Directly using such depth images will deteriorate the performance of RGB-D SOD. In view of this, a novel RGB-D salient object detection model is presented, which follows the principle of calibration-then-fusion to effectively suppress the influence of such two types of depth images on final saliency prediction. Specifically, the proposed model is composed of two stages, i.e., an image generation stage and a saliency reasoning stage. The former generates high-quality and foreground-consistent pseudo depth images via an image generation network. While the latter first calibrates the original depth information with the aid of those newly generated pseudo depth images and then performs cross-modal feature fusion for the final saliency reasoning. Especially, in the first stage, a Two-steps Sample Selection (TSS) strategy is employed to select such reliable depth images from the original RGB-D image pairs as supervision information to optimize the image generation network. Afterwards, in the second stage, a Feature Calibrating and Fusing Network (FCFNet) is proposed to achieve the calibration-then-fusion of cross-modal information for the final saliency prediction, which is achieved by a Depth Feature Calibration (DFC) module, a Shallow-level Feature Injection (SFI) module and a Multi-modal Multi-scale Fusion (MMF) module. Moreover, a loss function, i.e., Region Consistency Aware (RCA) loss, is presented as an auxiliary loss for FCFNet to facilitate the completeness of salient objects together with the reduction of background interference by considering the local regional consistency in the saliency maps. Experiments on six benchmark datasets demonstrate the superiorities of our proposed RGB-D SOD model over some state-of-the-arts.
引用
收藏
页码:1493 / 1507
页数:15
相关论文
共 50 条
  • [41] DVSOD: RGB-D Video Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Wang, Size
    Li, Wenbo
    Cheng, Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [42] Advancing in RGB-D Salient Object Detection: A Survey
    Chen, Ai
    Li, Xin
    He, Tianxiang
    Zhou, Junlin
    Chen, Duanbing
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [43] Adaptive Fusion for RGB-D Salient Object Detection
    Wang, Ningning
    Gong, Xiaojin
    [J]. IEEE ACCESS, 2019, 7 : 55277 - 55284
  • [44] SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection
    Lee, Minhyeok
    Park, Chaewon
    Cho, Suhwan
    Lee, Sangyoun
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 630 - 647
  • [45] AFLNet: Adversarial focal loss network for RGB-D salient object detection
    Zhao, Xiaoli
    Chen, Zheng
    Hwang, Jenq-Neng
    Shang, Xiwu
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 94
  • [46] Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection
    Gao, Haorao
    Su, Yiming
    Wang, Fasheng
    Li, Haojie
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [47] Perceptual localization and focus refinement network for RGB-D salient object detection
    Han, Jinyu
    Wang, Mengyin
    Wu, Weiyi
    Jia, Xu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [48] Depth cue enhancement and guidance network for RGB-D salient object detection
    Li, Xiang
    Zhang, Qing
    Yan, Weiqi
    Dai, Meng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [49] JALNet: joint attention learning network for RGB-D salient object detection
    Gao, Xiuju
    Cui, Jianhua
    Meng, Jin
    Shi, Huaizhong
    Duan, Songsong
    Xia, Chenxing
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2024, 27 (01) : 36 - 47
  • [50] Depth-aware lightweight network for RGB-D salient object detection
    Ling, Liuyi
    Wang, Yiwen
    Wang, Chengjun
    Xu, Shanyong
    Huang, Yourui
    [J]. IET IMAGE PROCESSING, 2023, 17 (08) : 2350 - 2361