Multi-level cross-modal interaction network for RGB-D salient object detection

被引:26
|
作者
Huang, Zhou [1 ]
Chen, Huai-Xin [1 ]
Zhou, Tao [2 ]
Yang, Yun-Zhi [3 ]
Liu, Bi-Yuan [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[3] CETC Special Mission Aircraft Syst Engn Co Ltd, Chengdu, Peoples R China
关键词
Salient object detection; RGB-D; Cross-modal feature learning; Multi-level interactive integration; FUSION;
D O I
10.1016/j.neucom.2021.04.053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depth cues with affluent spatial information have been proven beneficial in boosting salient object detection (SOD), while the depth quality directly affects the subsequent SOD performance. However, it is inevitable to obtain some low-quality depth cues due to thelimitations of its acquisition devices, which can inhibit the SOD performance. Besides, existing methods tend to combine RGB images and depth cues in a direct fusion or a simple fusion module, making them not effectively exploit the complex correlations between the two sources. Moreover, few methods design an appropriate module to fully fuse multi-level features, resulting in cross-level feature interaction insufficient. To address these issues, we propose a novel Multi-level Cross-modal Interaction Network (MCI-Net) for RGB-D based SOD. Our MCI-Net includes two key components: 1) a cross-modal feature learning network, which is used to learn the high-level features for the RGB images and depth cues, effectively enabling the correlations between the two sources to be exploited; and 2) a multi-level interactive integration network, which integrates multilevel cross-modal features to boost the SOD performance. Extensive experiments on six benchmark data sets demonstrate the superiority of our MCI-Net over 14 state-of-the-art methods, and validate the effectiveness of different components in our MCI-Net. More important, our MCI-Net significantly improves the SOD performance as well as has a higher FPS. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:200 / 211
页数:12
相关论文
共 50 条
  • [31] Cross-Stage Multi-Scale Interaction Network for RGB-D Salient Object Detection
    Yi, Kang
    Zhu, Jinchao
    Guo, Fu
    Xu, Jing
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2402 - 2406
  • [32] RGB-D Salient Object Detection Based on Cross-Modal Fusion and Boundary Deformable Convolution Guidance
    Meng L.-B.
    Yuan M.-Y.
    Shi X.-H.
    Zhang L.
    Wu J.-H.
    Cheng F.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3155 - 3166
  • [33] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Liu, Qingqing
    Cheng, Fei
    Li, Lingli
    [J]. IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308
  • [34] Progressive cross-level fusion network for RGB-D salient object detection
    Li, Jianbao
    Pan, Chen
    Zheng, Yilin
    Zhang, Dongping
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [35] Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection
    Zhang, Chen
    Cong, Runmin
    Lin, Qinwei
    Ma, Lin
    Li, Feng
    Zhao, Yao
    Kwong, Sam
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2094 - 2102
  • [36] BMFNet: Bifurcated multi-modal fusion network for RGB-D salient object detection
    Sun, Chenwang
    Zhang, Qing
    Zhuang, Chenyu
    Zhang, Mingqian
    [J]. IMAGE AND VISION COMPUTING, 2024, 147
  • [37] Cross-Modal Attentional Context Learning for RGB-D Object Detection
    Li, Guanbin
    Gan, Yukang
    Wu, Hejun
    Xiao, Nong
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1591 - 1601
  • [38] Cross-Modal Adaptation for RGB-D Detection
    Hoffman, Judy
    Gupta, Saurabh
    Leong, Jian
    Guadarrama, Sergio
    Darrell, Trevor
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5032 - 5039
  • [39] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542
  • [40] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542