Feature Calibrating and Fusing Network for RGB-D Salient Object Detection

被引：8

作者：

Zhang, Qiang ^{[1
,2
]}

Qin, Qi ^{[1
,2
]}

Yang, Yang ^{[1
,2
]}

Jiao, Qiang ^{[1
,2
]}

Han, Jungong ^{[3
]}

机构：

[1] Xidian Univ, Key Lab Elect Equipment Struct Design, Minist Educ, Xian, Peoples R China

[2] Xidian Univ, Ctr Complex Syst, Sch Mechanoelect Engn, Xian, Peoples R China

[3] Univ Sheffield, Pathol Dept, Sheffield, England

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 03期

关键词：

Visualization; Object detection; Image synthesis; Feature extraction; Cognition; Saliency detection; Streaming media; Salient object detection; RGB-D images; two-steps sample selection; calibration-then-fusion; region consistency aware loss;

D O I：

10.1109/TCSVT.2023.3296581

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to their imaging mechanisms and techniques, some depth images inevitably have low visual qualities or have some inconsistent foregrounds with their corresponding RGB images. Directly using such depth images will deteriorate the performance of RGB-D SOD. In view of this, a novel RGB-D salient object detection model is presented, which follows the principle of calibration-then-fusion to effectively suppress the influence of such two types of depth images on final saliency prediction. Specifically, the proposed model is composed of two stages, i.e., an image generation stage and a saliency reasoning stage. The former generates high-quality and foreground-consistent pseudo depth images via an image generation network. While the latter first calibrates the original depth information with the aid of those newly generated pseudo depth images and then performs cross-modal feature fusion for the final saliency reasoning. Especially, in the first stage, a Two-steps Sample Selection (TSS) strategy is employed to select such reliable depth images from the original RGB-D image pairs as supervision information to optimize the image generation network. Afterwards, in the second stage, a Feature Calibrating and Fusing Network (FCFNet) is proposed to achieve the calibration-then-fusion of cross-modal information for the final saliency prediction, which is achieved by a Depth Feature Calibration (DFC) module, a Shallow-level Feature Injection (SFI) module and a Multi-modal Multi-scale Fusion (MMF) module. Moreover, a loss function, i.e., Region Consistency Aware (RCA) loss, is presented as an auxiliary loss for FCFNet to facilitate the completeness of salient objects together with the reduction of background interference by considering the local regional consistency in the saliency maps. Experiments on six benchmark datasets demonstrate the superiorities of our proposed RGB-D SOD model over some state-of-the-arts.

引用

页码：1493 / 1507

页数：15

共 50 条

[1] Bidirectional feature learning network for RGB-D salient object detection
Niu, Ye
Zhou, Sanping
Dong, Yonghao
Wang, Le
Wang, Jinjun
Zheng, Nanning
[J]. PATTERN RECOGNITION, 2024, 150
[2] A deep multimodal feature learning network for RGB-D salient object detection
Liang, Fangfang
Duan, Lijuan
Ma, Wei
Qiao, Yuanhua
Miao, Jun
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
[3] Discriminative feature fusion for RGB-D salient object detection
Chen, Zeyu
Zhu, Mingyu
Chen, Shuhan
Lu, Lu
Tang, Haonan
Hu, Xuelong
Ji, Chunfan
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
[4] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
Zeng, Zhihong
Liu, Haijun
Chen, Fenglei
Tan, Xiaoheng
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669
[5] Circular Complement Network for RGB-D Salient Object Detection
Bai, Zhen
Liu, Zhi
Li, Gongyang
Ye, Linwei
Wang, Yang
[J]. NEUROCOMPUTING, 2021, 451 : 95 - 106
[6] Bilateral Attention Network for RGB-D Salient Object Detection
Zhang, Zhao
Lin, Zheng
Xu, Jun
Jin, Wen-Da
Lu, Shao-Ping
Fan, Deng-Ping
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
[7] Dynamic Selective Network for RGB-D Salient Object Detection
Wen, Hongfa
Yan, Chenggang
Zhou, Xiaofei
Cong, Runmin
Sun, Yaoqi
Zheng, Bolun
Zhang, Jiyong
Bao, Yongjun
Ding, Guiguang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9179 - 9192
[8] DYNAMIC SELECTION NETWORK FOR RGB-D SALIENT OBJECT DETECTION
Zhou, Jinlin
Luo, Zhiming
Li, Shaozi
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 776 - 780
[9] Siamese Network for RGB-D Salient Object Detection and Beyond
Fu, Keren
Fan, Deng-Ping
Ji, Ge-Peng
Zhao, Qijun
Shen, Jianbing
Zhu, Ce
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5541 - 5559
[10] Bifurcation Fusion Network for RGB-D Salient Object Detection
Zhao, Zhi-Hua
Chen, Li
[J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (12)

← 1 2 3 4 5 →