RGB depth salient object detection via cross-modal attention and boundary feature guidance

被引:0
|
作者
Meng, Lingbing [1 ]
Yuan, Mengya [1 ]
Shi, Xuehan [1 ]
Zhang, Le [1 ]
Liu, Qingqing [1 ]
Ping, Dai [1 ]
Wu, Jinhua [1 ]
Cheng, Fei [1 ,2 ]
机构
[1] Anhui Inst Informat Engn, Sch Comp & Software Engn, Wuhu, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Management, Hangzhou, Peoples R China
关键词
computer vision; image processing; REFINEMENT NETWORK; CONTEXT; INFORMATION;
D O I
10.1049/cvi2.12244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB depth (RGB-D) salient object detection (SOD) is a meaningful and challenging task, which has achieved good detection performance in dealing with simple scenes using convolutional neural networks, however, it cannot effectively handle scenes with complex contours of salient objects or similarly coloured salient objects and background. A novel end-to-end framework is proposed for RGB-D SOD, which comprises of four main components: the cross-modal attention feature enhancement (CMAFE) module, the multi-level contextual feature interaction (MLCFI) module, the boundary feature extraction (BFE) module, and the multi-level boundary attention guidance (MLBAG) module. The CMAFE module retains the more effective salient features by employing a dual-attention mechanism to filter noise from two modalities. In the MLCFI module, a shuffle operation is used for high-level and low-level channels to promote cross-channel information communication, and rich semantic information is extracted. The BFE module converts salient features into boundary features to generate boundary maps. The MLBAG module produces saliency maps by aggregating multi-level boundary saliency maps to guide cross-modal features in the decode stage. Extensive experiments are conducted on six public benchmark datasets, with the results demonstrating that the proposed model significantly outperforms 23 state-of-the-art RGB-D SOD models with regards to multiple evaluation metrics.
引用
收藏
页码:273 / 288
页数:16
相关论文
共 50 条
  • [1] CMIGNet: Cross-Modal Inverse Guidance Network for RGB-Depth salient object detection
    Zhu, Hegui
    Ni, Jia
    Yang, Xi
    Zhang, Libo
    [J]. PATTERN RECOGNITION, 2024, 155
  • [2] Coordinate Attention Filtering Depth-Feature Guide Cross-Modal Fusion RGB-Depth Salient Object Detection
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Liu, Qingqing
    Zhange, Le
    Wu, Jinhua
    Dai, Ping
    Cheng, Fei
    [J]. ADVANCES IN MULTIMEDIA, 2023, 2023
  • [3] RGB-D Salient Object Detection Based on Cross-Modal Fusion and Boundary Deformable Convolution Guidance
    Meng, Ling-Bing
    Yuan, Meng-Ya
    Shi, Xue-Han
    Zhang, Le
    Wu, Jin-Hua
    Cheng, Fei
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3155 - 3166
  • [4] Feature Enhancement and Multi-scale Cross-Modal Attention for RGB-D Salient Object Detection
    Wan, Xin
    Yang, Gang
    Zhou, Boyi
    Liu, Chang
    Wang, Hangxu
    Wang, Yutao
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 409 - 420
  • [5] Attention-guided cross-modal multiple feature aggregation network for RGB-D salient object detection
    Chen, Bojian
    Wu, Wenbin
    Li, Zhezhou
    Han, Tengfei
    Chen, Zhuolei
    Zhang, Weihao
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (01): : 643 - 669
  • [6] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
    Zhao, Zhengyun
    Huang, Ziqing
    Chai, Xiuli
    Wang, Jun
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (01) : 361 - 384
  • [7] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
    Zhengyun Zhao
    Ziqing Huang
    Xiuli Chai
    Jun Wang
    [J]. Neural Processing Letters, 2023, 55 : 361 - 384
  • [8] Boundary-Aware RGBD Salient Object Detection With Cross-Modal Feature Sampling
    Niu, Yuzhen
    Long, Guanchao
    Liu, Wenxi
    Guo, Wenzhong
    He, Shengfeng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9496 - 9507
  • [9] RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion
    Peng, Yanbin
    Zhai, Zhinian
    Feng, Mingkun
    [J]. IEEE ACCESS, 2024, 12 : 45134 - 45146
  • [10] RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion
    Peng, Yanbin
    Zhai, Zhinian
    Feng, Mingkun
    [J]. IEEE Access, 2024, 12 : 45134 - 45146