Synergizing triple attention with depth quality for RGB-D salient object detection

被引:1
|
作者
Song, Peipei [1 ,2 ]
Li, Wenyu [3 ]
Zhong, Peiyan [3 ]
Zhang, Jing [1 ]
Konuisz, Piotr [1 ,2 ]
Duan, Feng [3 ]
Barnes, Nick [1 ]
机构
[1] Australian Natl Univ, Canberra, ACT 2601, Australia
[2] CSIRO, Data61, Canberra, ACT 2601, Australia
[3] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Salient object detection; Low quality depth; Triple attention; Multi-modal fusion; RGB-D data; FUSION;
D O I
10.1016/j.neucom.2024.127672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object refers to the conspicuous objects or regions within an image that stand out prominently from its surroundings. Depth maps are commonly utilized as supplementary inputs for salient object detection, referred to as RGB-D SOD. Due to the diverse acquisition sensors, such as infrared detectors and stereo cameras, the quality of acquired depth maps varies considerably. The low-quality depth introduces noise that seriously reduces detection accuracy. To tackle this problem, a triple attention architecture based on a 3D convolutional neural network tailored for quality-aware salient object detection is proposed in this paper, which capitalizes on the strengths across modality, channel, and spatial dimensions. The modality attention learns the quality factors based on the overall modal features. The channel attention highlights features in the dimension of channels, and the patch-level spatial attention establishes long-range dependencies. Thus, the quality factors, channel differences, and spatial contrast are combined to achieve global and local fusion. To enable the evaluations on low-quality depth maps, an assessment criterion is further introduced to categorize the RGB-D datasets. Experimental results of state -of -the -art methods on different quality levels demonstrate the proposed method's effectiveness, especially for the low-quality depth.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [2] Triple-Complementary Network for RGB-D Salient Object Detection
    Huang, Rui
    Xing, Yan
    Zou, Yaobin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 775 - 779
  • [3] Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Zhang, Miao
    Piao, Yongri
    Lu, Huchuan
    Cheng, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (04) : 855 - 876
  • [4] CDNet: Complementary Depth Network for RGB-D Salient Object Detection
    Jin, Wen-Da
    Xu, Jun
    Han, Qi
    Zhang, Yi
    Cheng, Ming-Ming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3376 - 3390
  • [5] Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection
    Jingjing Li
    Wei Ji
    Miao Zhang
    Yongri Piao
    Huchuan Lu
    Li Cheng
    [J]. International Journal of Computer Vision, 2023, 131 : 855 - 876
  • [6] Hybrid-Attention Network for RGB-D Salient Object Detection
    Chen, Yuzhen
    Zhou, Wujie
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [7] Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection
    Zhang, Wenbo
    Ji, Ge-Peng
    Wang, Zhuo
    Fu, Keren
    Zhao, Qijun
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 731 - 740
  • [8] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 : 37 - 69
  • [9] RGB-D salient object detection: A survey
    Tao Zhou
    Deng-Ping Fan
    Ming-Ming Cheng
    Jianbing Shen
    Ling Shao
    [J]. Computational Visual Media, 2021, 7 (01) : 37 - 69
  • [10] RGB-D salient object detection: A survey
    Zhou, Tao
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) : 37 - 69