Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection

被引:21
|
作者
Liu, Di [1 ]
Zhang, Kao [1 ]
Chen, Zhenzhong [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Object detection; Saliency detection; Feature extraction; Fuses; Visualization; Computational modeling; Semantics; Cross-modal attention; residual attention; fusion refinement network; RGB-D salient object detection; OBJECT DETECTION; MODEL; DISPARITY; FIXATION;
D O I
10.1109/TMM.2020.2991523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an attentive cross-modal fusion (ACMF) network is proposed for RGB-D salient object detection. The proposed method selectively fuses features in a cross-modal manner and uses a fusion refinement module to fuse output features from different resolutions. Our attentive cross-modal fusion network is built based on residual attention. In each level of ResNet output, both the RGB and depth features are turned into an identity map and a weighted attention map. The identity map is reweighted by the attention map of the paired modality. Moreover, the lower level features with higher resolution are adopted to refine the boundary of detected targets. The entire architecture can be trained end-to-end. The proposed ACMF is compared with state-of-the-art methods on eight recent datasets. The results demonstrate that our model can achieve advanced performance on RGB-D salient object detection.
引用
收藏
页码:967 / 981
页数:15
相关论文
共 50 条
  • [41] Cross-Modal Pyramid Translation for RGB-D Scene Recognition
    Du, Dapeng
    Wang, Limin
    Li, Zhaoyang
    Wu, Gangshan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (08) : 2309 - 2327
  • [42] Cross-Modal Pyramid Translation for RGB-D Scene Recognition
    Dapeng Du
    Limin Wang
    Zhaoyang Li
    Gangshan Wu
    International Journal of Computer Vision, 2021, 129 : 2309 - 2327
  • [43] Cross-modal collaborative propagation for RGB-T saliency detection
    Yu, Xiaosheng
    Pang, Yu
    Chi, Jianning
    Qi, Qi
    VISUAL COMPUTER, 2024, 40 (06): : 4337 - 4354
  • [44] RGB-D Image Saliency Detection Based on Cross-Model Feature Fusion
    Chen Z.
    Zhao X.
    Zhang J.
    Yin M.
    Ye H.
    Zhou H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (11): : 1688 - 1697
  • [45] RGB-D Grasp Detection via Depth Guided Learning with Cross-modal Attention
    Qin, Ran
    Ma, Haoxiang
    Ciao, Boyang
    Huang, Di
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8003 - 8009
  • [46] A cross-modal edge-guided salient object detection for RGB-D image
    Liu, Zhengyi
    Wang, Kaixun
    Dong, Hao
    Wang, Yuan
    NEUROCOMPUTING, 2021, 454 : 168 - 177
  • [47] Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection
    Chen, Hao
    Li, Youfu
    Su, Dan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) : 4808 - 4820
  • [48] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Liu, Qingqing
    Cheng, Fei
    Li, Lingli
    IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308
  • [49] EF-Net: A novel enhancement and fusion network for RGB-D saliency detection
    Chen, Qian
    Fu, Keren
    Liu, Ze
    Chen, Geng
    Du, Hongwei
    Qiu, Bensheng
    Shao, Ling
    PATTERN RECOGNITION, 2021, 112
  • [50] Attention-guided cross-modal multiple feature aggregation network for RGB-D salient object detection
    Chen, Bojian
    Wu, Wenbin
    Li, Zhezhou
    Han, Tengfei
    Chen, Zhuolei
    Zhang, Weihao
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (01): : 643 - 669