Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

被引:9
|
作者
Cong, Runmin [1 ,2 ,5 ]
Liu, Hongyu [1 ,6 ,7 ]
Zhang, Chen [1 ,6 ,7 ]
Zhang, Wei [2 ,5 ]
Zheng, Feng [3 ]
Song, Ran [2 ,5 ]
Kwong, Sam [4 ]
机构
[1] Beijing Jiaotong Univ, Beijing, Peoples R China
[2] Shandong Univ, Jinan, Shandong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Guangdong, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Peoples R China
[5] Minist Educ, Key Lab Machine Intelligence & Syst Control, Jinan, Shandong, Peoples R China
[6] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[7] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
salient object detection; RGB-D images; CNNs-assisted Transformer architecture; point-aware interaction; FUSION;
D O I
10.1145/3581783.3611982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By integrating complementary information from RGB image and depth map, the ability of salient object detection (SOD) for complex and challenging scenes can be improved. In recent years, the important role of Convolutional Neural Networks (CNNs) in feature extraction and cross-modality interaction has been fully explored, but it is still insufficient in modeling global long-range dependencies of self-modality and cross-modality. To this end, we introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement (PICR-Net). On the one hand, considering the prior correlation between RGB modality and depth modality, an attention-triggered cross-modality point-aware interaction (CmPI) module is designed to explore the feature interaction of different modalities with positional constraints. On the other hand, in order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation. Extensive experiments on five RGB-D SOD datasets show that the proposed network achieves competitive results in both quantitative and qualitative comparisons. Our code is publicly available at: https://github.com/rmcong/PICR-Net_ACMMM23.
引用
收藏
页码:406 / 416
页数:11
相关论文
共 50 条
  • [1] Depth-aware inverted refinement network for RGB-D salient object detection
    Gao, Lina
    Liu, Bing
    Fu, Ping
    Xu, Mingzhu
    [J]. NEUROCOMPUTING, 2023, 518 : 507 - 522
  • [2] Context-aware network for RGB-D salient object detection
    Liang, Fangfang
    Duan, Lijuan
    Ma, Wei
    Qiao, Yuanhua
    Miao, Jun
    Ye, Qixiang
    [J]. PATTERN RECOGNITION, 2021, 111
  • [3] Perceptual localization and focus refinement network for RGB-D salient object detection
    Han, Jinyu
    Wang, Mengyin
    Wu, Weiyi
    Jia, Xu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [4] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542
  • [5] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    [J]. IEEE Transactions on Image Processing, 2021, 30 : 3528 - 3542
  • [6] Depth-aware lightweight network for RGB-D salient object detection
    Ling, Liuyi
    Wang, Yiwen
    Wang, Chengjun
    Xu, Shanyong
    Huang, Yourui
    [J]. IET IMAGE PROCESSING, 2023, 17 (08) : 2350 - 2361
  • [7] Multi-scale iterative refinement network for RGB-D salient object detection
    Liu, Ze-Yu
    Liu, Jian-Wei
    Zuo, Xin
    Hu, Ming-Fei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [8] Depth-Induced Gap-Reducing Network for RGB-D Salient Object Detection: An Interaction, Guidance and Refinement Approach
    Cheng, Xiaolong
    Zheng, Xuan
    Pei, Jialun
    Tang, He
    Lyu, Zehua
    Chen, Chuanbo
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4253 - 4266
  • [9] Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection
    Chen, Hao
    Li, Youfu
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3051 - 3060
  • [10] RGB-D Salient Object Detection by a CNN With Multiple Layers Fusion
    Huang, Rui
    Xing, Yan
    Wang, ZeZheng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (04) : 552 - 556