Attention-aware Cross-modal Cross-level Fusion Network for RGB-D Salient Object Detection

被引:0
|
作者
Chen, Hao [1 ]
Li, You-Fu [1 ,2 ]
Su, Dan [1 ]
机构
[1] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Shenzhen Res Inst, Hong Kong, Peoples R China
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks have achieved wide success in RGB saliency detection. Recently, the advent of RGB-D sensors such as Kinect provide additional geometric saliency cues. However, the key challenge for RGB-D salient object detection that how to fuse RGB and depth information sufficiently is still under-studied. Traditional works mainly follow the two-stream architecture and combine RGB and depth features/decisions in an early or late point. The multi-modal fusion stage is performed by directly concatenating the features from two modalities without selection. In this work, we address this question by proposing a novel network with a distinguished insight: A selection module is significantly helpful for more informative and sufficient cross-modal cross-level combination. To this end, we introduce a top-down RGB-D fusion network which integrates an attention-aware cross-modal cross-level fusion block in each level to select discriminative features from each level and each modality. Extensive experiments on public datasets show that the proposed network is able to solve the key problems in RGB-D fusion and achieves state-of-the-art performance on RGB-D salient object detection.
引用
收藏
页码:6821 / 6826
页数:6
相关论文
共 50 条
  • [1] RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion
    Peng, Yanbin
    Zhai, Zhinian
    Feng, Mingkun
    [J]. IEEE ACCESS, 2024, 12 : 45134 - 45146
  • [2] RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion
    Peng, Yanbin
    Zhai, Zhinian
    Feng, Mingkun
    [J]. IEEE Access, 2024, 12 : 45134 - 45146
  • [3] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Liu, Qingqing
    Cheng, Fei
    Li, Lingli
    [J]. IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308
  • [4] Progressive cross-level fusion network for RGB-D salient object detection
    Li, Jianbao
    Pan, Chen
    Zheng, Yilin
    Zhang, Dongping
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [5] Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection
    Chen, Hao
    Li, Youfu
    Su, Dan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) : 4808 - 4820
  • [6] Cross-Modal and Cross-Level Attention Interaction Network for Salient Object Detection
    Wang, Fasheng
    Su, Yiming
    Wang, Ruimin
    Sun, Jing
    Sun, Fuming
    Li, Haojie
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 2907 - 2920
  • [7] RGB-D salient object detection with asymmetric cross-modal fusion
    Yu, Ming
    Xing, Zhang-Hao
    Liu, Yi
    [J]. Kongzhi yu Juece/Control and Decision, 2023, 38 (09): : 2487 - 2495
  • [8] Cross-Modal Fusion and Progressive Decoding Network for RGB-D Salient Object Detection
    Hu, Xihang
    Sun, Fuming
    Sun, Jing
    Wang, Fasheng
    Li, Haojie
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3067 - 3085
  • [9] Cross-modal hierarchical interaction network for RGB-D salient object detection
    Bi, Hongbo
    Wu, Ranwan
    Liu, Ziqi
    Zhu, Huihui
    Zhang, Cong
    Xiang, Tian -Zhu
    [J]. PATTERN RECOGNITION, 2023, 136
  • [10] Multi-level cross-modal interaction network for RGB-D salient object detection
    Huang, Zhou
    Chen, Huai-Xin
    Zhou, Tao
    Yang, Yun-Zhi
    Liu, Bi-Yuan
    [J]. NEUROCOMPUTING, 2021, 452 : 200 - 211