Bidirectional Attentional Interaction Networks for RGB-D salient object detection

被引:3
|
作者
Wei, Weiyi [1 ]
Xu, Mengyu [1 ]
Wang, Jian [1 ]
Luo, Xuzhe [1 ]
机构
[1] Northwest Normal Univ, Lanzhou 730070, Gansu, Peoples R China
关键词
RGB-D salient object detection; Cross-modality feature; Bidirectional interaction; Guidance aggregation;
D O I
10.1016/j.imavis.2023.104792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the issues of insufficient cross-modality feature interaction and ineffective utilization of cross-modality data in RGB-D saliency object detection (SOD) tasks, we propose a Bidirectional Attentional Interaction Network (BAINet) for RGB-D SOD, which employs an encoder-decoder structure for bidirectional interaction of cross-modality features through a dual-branch progressive fusion approach. To begin with, based on the fact that RGB and depth information streams can complement each other, the bidirectional attention interaction module accomplishes bidirectional interaction between cross-modality features by capturing complementary cues from different modality data. In order to enhance the expressiveness of the fused RGB-D features, the global feature perception module endows the features with rich multi-scale contextual semantic information by enlarging the field of perception. In addition, exploring the correlation of cross-level features is vital to achieve accurate salient inference. Specifically, We introduce a cross-level guidance aggregation module to capture inter-layer de-pendencies and complete the integration of cross-level features, which effectively suppresses shallow cross-modality features and refines the saliency map during decoding. To improve the model training speed, a hybrid loss function is employed to train multi-branch saliency inference maps simultaneously. Extensive ex-periments on five publicly available datasets clearly show that the proposed model outperforms 18 state-of-the -art methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Bidirectional feature learning network for RGB-D salient object detection
    Niu, Ye
    Zhou, Sanping
    Dong, Yonghao
    Wang, Le
    Wang, Jinjun
    Zheng, Nanning
    [J]. PATTERN RECOGNITION, 2024, 150
  • [2] Three-Stage Bidirectional Interaction Network for Efficient RGB-D Salient Object Detection
    Wang, Yang
    Zhang, Yanqing
    [J]. COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 215 - 233
  • [3] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 : 37 - 69
  • [4] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 (01) : 37 - 69
  • [5] RGB-D salient object detection: A survey
    Zhou, Tao
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 37 - 69
  • [6] Calibrated RGB-D Salient Object Detection
    Ji, Wei
    Li, Jingjing
    Yu, Shuang
    Zhang, Miao
    Piao, Yongri
    Yao, Shunyu
    Bi, Qi
    Ma, Kai
    Zheng, Yefeng
    Lu, Huchuan
    Cheng, Li
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9466 - 9476
  • [7] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542
  • [8] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542
  • [9] BGRDNet: RGB-D salient object detection with a bidirectional gated recurrent decoding network
    Liu, Zhengyi
    Wang, Yuan
    Zhang, Zhili
    Tan, Yacheng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (18) : 25519 - 25539
  • [10] BGRDNet: RGB-D salient object detection with a bidirectional gated recurrent decoding network
    Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Computer Science and Technology, Anhui University, Hefei, China
    [J]. Multimedia Tools Appl, 2022, 18 (25519-25539):