Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection

被引：4

作者：

Zhao, Zhengyun ^{[1
]}

Huang, Ziqing ^{[1
]}

Chai, Xiuli ^{[1
]}

Wang, Jun ^{[1
]}

机构：

[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2023年 / 55卷 / 01期

基金：

中国国家自然科学基金;

关键词：

RGB-D salient object detection; Convolutional neural network; Cross-modal fusion; Depth modal enhancement; FUSION; CONSISTENT; IMAGE;

D O I：

10.1007/s11063-022-10886-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep modal can provide supplementary features for RGB images, which deeply improves the performance of salient object detection (SOD). However, depth images are disturbed by external factors during the acquisition process, resulting in low-quality acquisitions. Moreover, there are differences between the RGB and depth modals, so simply fusing the two modals cannot fully complement the depth information into the RGB modal. To enhance the quality of the depth image and integrate the cross-modal information effectively, we propose a depth enhanced cross-modal cascaded network (DCCNet) for RGB-D SOD. The entire cascaded network includes a depth cascaded branch, a RGB cascaded branch and a cross-modal fusion strategy. In the depth cascaded branch, we design a depth preprocessing algorithm to enhance the quality of the depth image. And in the process of depth feature extraction, we adopt four cascaded cross-modal guided modules to guide the RGB feature extraction process. In the RGB cascaded branch, we design five cascaded residual adaptive selection modules to output the RGB image feature extraction in each stage. In the cross-modal fusion strategy, a cross-modal channel-wise refinement is adopted to fuse the top-level features of the different modal feature branches. Finally, the multiscale loss is adopted to optimize the network training. Experimental results on six common RGB-D SOD datasets show that the performance of the proposed DCCNet is comparable to that of the state-of-the-art RGB-D SOD methods.

引用

下载

页码：361 / 384

页数：24

共 50 条

[1] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
Zhengyun Zhao
Ziqing Huang
Xiuli Chai
Jun Wang
Neural Processing Letters, 2023, 55 : 361 - 384
[2] Cross-modal hierarchical interaction network for RGB-D salient object detection
Bi, Hongbo
Wu, Ranwan
Liu, Ziqi
Zhu, Huihui
Zhang, Cong
Xiang, Tian -Zhu
PATTERN RECOGNITION, 2023, 136
[3] Cross-Modal Fusion and Progressive Decoding Network for RGB-D Salient Object Detection
Hu, Xihang
Sun, Fuming
Sun, Jing
Wang, Fasheng
Li, Haojie
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3067 - 3085
[4] Lightweight cross-modal transformer for RGB-D salient object detection
Huang, Nianchang
Yang, Yang
Zhang, Qiang
Han, Jungong
Huang, Jin
Computer Vision and Image Understanding, 2024, 249
[5] RGB-D salient object detection with asymmetric cross-modal fusion
Yu M.
Xing Z.-H.
Liu Y.
Kongzhi yu Juece/Control and Decision, 2023, 38 (09): : 2487 - 2495
[6] Cross-modal refined adjacent-guided network for RGB-D salient object detection
Bi H.
Zhang J.
Wu R.
Tong Y.
Jin W.
Multimedia Tools Appl, 24 (37453-37478): : 37453 - 37478
[7] Multi-level cross-modal interaction network for RGB-D salient object detection
Huang, Zhou
Chen, Huai-Xin
Zhou, Tao
Yang, Yun-Zhi
Liu, Bi-Yuan
NEUROCOMPUTING, 2021, 452 : 200 - 211
[8] Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
Wang, Shuaihui
Jiang, Fengyi
Xu, Boqian
SENSORS, 2023, 23 (16)
[9] Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and Beyond
Chen, Hao
Shen, Feihong
Ding, Ding
Deng, Yongjian
Li, Chao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1699 - 1709
[10] Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection
Huang, Nianchang
Liu, Yi
Zhang, Qiang
Han, Jungong
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2428 - 2441

← 1 2 3 4 5 →