Multi-level cross-modal interaction network for RGB-D salient object detection

被引：26

作者：

Huang, Zhou ^{[1
]}

Chen, Huai-Xin ^{[1
]}

Zhou, Tao ^{[2
]}

Yang, Yun-Zhi ^{[3
]}

Liu, Bi-Yuan ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] Nanjing Univ Sci & Technol, Nanjing, Peoples R China

[3] CETC Special Mission Aircraft Syst Engn Co Ltd, Chengdu, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 452卷

关键词：

Salient object detection; RGB-D; Cross-modal feature learning; Multi-level interactive integration; FUSION;

D O I：

10.1016/j.neucom.2021.04.053

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Depth cues with affluent spatial information have been proven beneficial in boosting salient object detection (SOD), while the depth quality directly affects the subsequent SOD performance. However, it is inevitable to obtain some low-quality depth cues due to thelimitations of its acquisition devices, which can inhibit the SOD performance. Besides, existing methods tend to combine RGB images and depth cues in a direct fusion or a simple fusion module, making them not effectively exploit the complex correlations between the two sources. Moreover, few methods design an appropriate module to fully fuse multi-level features, resulting in cross-level feature interaction insufficient. To address these issues, we propose a novel Multi-level Cross-modal Interaction Network (MCI-Net) for RGB-D based SOD. Our MCI-Net includes two key components: 1) a cross-modal feature learning network, which is used to learn the high-level features for the RGB images and depth cues, effectively enabling the correlations between the two sources to be exploited; and 2) a multi-level interactive integration network, which integrates multilevel cross-modal features to boost the SOD performance. Extensive experiments on six benchmark data sets demonstrate the superiority of our MCI-Net over 14 state-of-the-art methods, and validate the effectiveness of different components in our MCI-Net. More important, our MCI-Net significantly improves the SOD performance as well as has a higher FPS. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：200 / 211

页数：12

共 50 条

[31] Cross-Stage Multi-Scale Interaction Network for RGB-D Salient Object Detection
Yi, Kang
Zhu, Jinchao
Guo, Fu
Xu, Jing
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2402 - 2406
[32] RGB-D Salient Object Detection Based on Cross-Modal Fusion and Boundary Deformable Convolution Guidance
Meng L.-B.
Yuan M.-Y.
Shi X.-H.
Zhang L.
Wu J.-H.
Cheng F.
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3155 - 3166
[33] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
Meng, Lingbing
Yuan, Mengya
Shi, Xuehan
Liu, Qingqing
Cheng, Fei
Li, Lingli
[J]. IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308
[34] Progressive cross-level fusion network for RGB-D salient object detection
Li, Jianbao
Pan, Chen
Zheng, Yilin
Zhang, Dongping
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[35] Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection
Zhang, Chen
Cong, Runmin
Lin, Qinwei
Ma, Lin
Li, Feng
Zhao, Yao
Kwong, Sam
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2094 - 2102
[36] BMFNet: Bifurcated multi-modal fusion network for RGB-D salient object detection
Sun, Chenwang
Zhang, Qing
Zhuang, Chenyu
Zhang, Mingqian
[J]. IMAGE AND VISION COMPUTING, 2024, 147
[37] Cross-Modal Attentional Context Learning for RGB-D Object Detection
Li, Guanbin
Gan, Yukang
Wu, Hejun
Xiao, Nong
Lin, Liang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1591 - 1601
[38] Cross-Modal Adaptation for RGB-D Detection
Hoffman, Judy
Gupta, Saurabh
Leong, Jian
Guadarrama, Sergio
Darrell, Trevor
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5032 - 5039
[39] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
Li, Gongyang
Liu, Zhi
Chen, Minyu
Bai, Zhen
Lin, Weisi
Ling, Haibin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542
[40] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
Li, Gongyang
Liu, Zhi
Chen, Minyu
Bai, Zhen
Lin, Weisi
Ling, Haibin
[J]. IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542

← 1 2 3 4 5 →