Specificity-preserving RGB-D saliency detection

被引:0
|
作者
Tao Zhou
Deng-Ping Fan
Geng Chen
Yi Zhou
Huazhu Fu
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] Ministry of Education,Key Laboratory of System Control and Information Processing
[3] ETH Zürich,Computer Vision Lab
[4] Northwestern Polytechnical University,School of Computer Science and Engineering
[5] Southeast University,School of Computer Science and Engineering
[6] Inception Institute of Artificial Intelligence,undefined
来源
关键词
salient object detection (SOD); RGB-D; cross-enhanced integration module (CIM); multi-modal feature aggregation (MFA);
D O I
暂无
中图分类号
学科分类号
摘要
Salient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities, while few methods explicitly consider how to preserve modality-specific characteristics. In this study, we propose a novel framework, the specificity-preserving network (SPNet), which improves SOD performance by exploring both the shared information and modality-specific properties. Specifically, we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction maps. To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and propagate the fused feature to the next layer to integrate cross-level information. Moreover, to capture rich complementary multi-modal information to boost SOD performance, we use a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder. By using skip connections between encoder and decoder layers, hierarchical features can be fully combined. Extensive experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection benchmarks. The project is publicly available at https://github.com/taozh2017/SPNet. [graphic not available: see fulltext]
引用
下载
收藏
页码:297 / 317
页数:20
相关论文
共 50 条
  • [1] Specificity-preserving RGB-D Saliency Detection
    Zhou, Tao
    Fu, Huazhu
    Chen, Geng
    Zhou, Yi
    Fan, Deng-Ping
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4661 - 4671
  • [2] Specificity-preserving RGB-D saliency detection
    Zhou, Tao
    Fan, Deng-Ping
    Chen, Geng
    Zhou, Yi
    Fu, Huazhu
    COMPUTATIONAL VISUAL MEDIA, 2023, 9 (02) : 297 - 317
  • [3] Robust RGB-D Fusion for Saliency Detection
    Wu, Zongwei
    Gobichettipalayam, Shriarulmozhivarman
    Tamadazte, Brahim
    Allibert, Guillaume
    Paudel, Danda Pani
    Demonceaux, Cedric
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 403 - 413
  • [4] Uncertainty Inspired RGB-D Saliency Detection
    Zhang, Jing
    Fan, Deng-Ping
    Dai, Yuchao
    Anwar, Saeed
    Saleh, Fatemeh
    Aliakbarian, Sadegh
    Barnes, Nick
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5761 - 5779
  • [5] Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images
    Wang, Xiaoqiang
    Zhu, Lei
    Tang, Siliang
    Fu, Huazhu
    Li, Ping
    Wu, Fei
    Yang, Yi
    Zhuang, Yueting
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1107 - 1119
  • [6] Select, Supplement and Focus for RGB-D Saliency Detection
    Zhang, Miao
    Ren, Weisong
    Piao, Yongri
    Rong, Zhengkun
    Lu, Huchuan
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3469 - 3478
  • [7] Deep RGB-D Saliency Detection Without Depth
    Zhang, Yuan-fang
    Zheng, Jiangbin
    Jia, Wenjing
    Huang, Wenfeng
    Li, Long
    Liu, Nian
    Li, Fei
    He, Xiangjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 755 - 767
  • [8] Exploiting Global Priors for RGB-D Saliency Detection
    Ren, Jianqiang
    Gong, Xiaojin
    Yu, Lu
    Zhou, Wenhui
    Yang, Michael Ying
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [9] RGB-D Saliency Detection under Bayesian Framework
    Wang, Song-Tao
    Zhou, Zhen
    Qu, Han-Bing
    Li, Bin
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1881 - 1886
  • [10] Saliency Prototype for RGB-D and RGB-T Salient Object Detection
    Zhang, Zihao
    Wang, Jie
    Han, Yahong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3696 - 3705