Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

被引:9
|
作者
Cong, Runmin [1 ,2 ,5 ]
Liu, Hongyu [1 ,6 ,7 ]
Zhang, Chen [1 ,6 ,7 ]
Zhang, Wei [2 ,5 ]
Zheng, Feng [3 ]
Song, Ran [2 ,5 ]
Kwong, Sam [4 ]
机构
[1] Beijing Jiaotong Univ, Beijing, Peoples R China
[2] Shandong Univ, Jinan, Shandong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Guangdong, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Peoples R China
[5] Minist Educ, Key Lab Machine Intelligence & Syst Control, Jinan, Shandong, Peoples R China
[6] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[7] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
salient object detection; RGB-D images; CNNs-assisted Transformer architecture; point-aware interaction; FUSION;
D O I
10.1145/3581783.3611982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By integrating complementary information from RGB image and depth map, the ability of salient object detection (SOD) for complex and challenging scenes can be improved. In recent years, the important role of Convolutional Neural Networks (CNNs) in feature extraction and cross-modality interaction has been fully explored, but it is still insufficient in modeling global long-range dependencies of self-modality and cross-modality. To this end, we introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement (PICR-Net). On the one hand, considering the prior correlation between RGB modality and depth modality, an attention-triggered cross-modality point-aware interaction (CmPI) module is designed to explore the feature interaction of different modalities with positional constraints. On the other hand, in order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation. Extensive experiments on five RGB-D SOD datasets show that the proposed network achieves competitive results in both quantitative and qualitative comparisons. Our code is publicly available at: https://github.com/rmcong/PICR-Net_ACMMM23.
引用
收藏
页码:406 / 416
页数:11
相关论文
共 50 条
  • [41] CDNet: Complementary Depth Network for RGB-D Salient Object Detection
    Jin, Wen-Da
    Xu, Jun
    Han, Qi
    Zhang, Yi
    Cheng, Ming-Ming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3376 - 3390
  • [42] Salient object detection for RGB-D images by generative adversarial network
    Liu, Zhengyi
    Tang, Jiting
    Xiang, Qian
    Zhao, Peng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25403 - 25425
  • [43] Scale Adaptive Fusion Network for RGB-D Salient Object Detection
    Kong, Yuqiu
    Zheng, Yushuo
    Yao, Cuili
    Liu, Yang
    Wang, He
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 608 - 625
  • [44] An adaptive guidance fusion network for RGB-D salient object detection
    Haodong Sun
    Yu Wang
    Xinpeng Ma
    [J]. Signal, Image and Video Processing, 2024, 18 : 1683 - 1693
  • [45] Salient object detection for RGB-D images by generative adversarial network
    Zhengyi Liu
    Jiting Tang
    Qian Xiang
    Peng Zhao
    [J]. Multimedia Tools and Applications, 2020, 79 : 25403 - 25425
  • [46] Three-Stage Bidirectional Interaction Network for Efficient RGB-D Salient Object Detection
    Wang, Yang
    Zhang, Yanqing
    [J]. COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 215 - 233
  • [47] DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection
    Chen, Zuyao
    Cong, Runmin
    Xu, Qianqian
    Huang, Qingming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7012 - 7024
  • [48] CNN-Based RGB-D Salient Object Detection: Learn, Select, and Fuse
    Hao Chen
    Youfu Li
    Yongjian Deng
    Guosheng Lin
    [J]. International Journal of Computer Vision, 2021, 129 : 2076 - 2096
  • [49] CNN-Based RGB-D Salient Object Detection: Learn, Select, and Fuse
    Chen, Hao
    Li, Youfu
    Deng, Yongjian
    Lin, Guosheng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (07) : 2076 - 2096
  • [50] DVSOD: RGB-D Video Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Wang, Size
    Li, Wenbo
    Cheng, Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,