Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection

被引:4
|
作者
Zhao, Zhengyun [1 ]
Huang, Ziqing [1 ]
Chai, Xiuli [1 ]
Wang, Jun [1 ]
机构
[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
基金
中国国家自然科学基金;
关键词
RGB-D salient object detection; Convolutional neural network; Cross-modal fusion; Depth modal enhancement; FUSION; CONSISTENT; IMAGE;
D O I
10.1007/s11063-022-10886-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep modal can provide supplementary features for RGB images, which deeply improves the performance of salient object detection (SOD). However, depth images are disturbed by external factors during the acquisition process, resulting in low-quality acquisitions. Moreover, there are differences between the RGB and depth modals, so simply fusing the two modals cannot fully complement the depth information into the RGB modal. To enhance the quality of the depth image and integrate the cross-modal information effectively, we propose a depth enhanced cross-modal cascaded network (DCCNet) for RGB-D SOD. The entire cascaded network includes a depth cascaded branch, a RGB cascaded branch and a cross-modal fusion strategy. In the depth cascaded branch, we design a depth preprocessing algorithm to enhance the quality of the depth image. And in the process of depth feature extraction, we adopt four cascaded cross-modal guided modules to guide the RGB feature extraction process. In the RGB cascaded branch, we design five cascaded residual adaptive selection modules to output the RGB image feature extraction in each stage. In the cross-modal fusion strategy, a cross-modal channel-wise refinement is adopted to fuse the top-level features of the different modal feature branches. Finally, the multiscale loss is adopted to optimize the network training. Experimental results on six common RGB-D SOD datasets show that the performance of the proposed DCCNet is comparable to that of the state-of-the-art RGB-D SOD methods.
引用
收藏
页码:361 / 384
页数:24
相关论文
共 50 条
  • [31] A cascaded refined rgb-d salient object detection network based on the attention mechanism
    Zong, Guanyu
    Wei, Longsheng
    Guo, Siyuan
    Wang, Yongtao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (11) : 13527 - 13548
  • [32] A cascaded refined rgb-d salient object detection network based on the attention mechanism
    Guanyu Zong
    Longsheng Wei
    Siyuan Guo
    Yongtao Wang
    [J]. Applied Intelligence, 2023, 53 : 13527 - 13548
  • [33] Asymmetric cross-modal activation network for RGB-T salient object detection
    Xu, Chang
    Li, Qingwu
    Zhou, Qingkai
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [34] ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection
    Xia, Chenxing
    Yang, Feng
    Duan, Songsong
    Gao, Xiuju
    Ge, Bin
    Li, Kuan-Ching
    Fang, Xianjin
    Zhang, Yan
    Yang, Ke
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (03)
  • [35] RGB depth salient object detection via cross-modal attention and boundary feature guidance
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Zhang, Le
    Liu, Qingqing
    Ping, Dai
    Wu, Jinhua
    Cheng, Fei
    [J]. IET COMPUTER VISION, 2024, 18 (02) : 273 - 288
  • [36] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection
    Chen, Hao
    Li, Youfu
    Su, Dan
    [J]. PATTERN RECOGNITION, 2019, 86 : 376 - 385
  • [37] Discriminative Cross-Modal Transfer Learning and Densely Cross-Level Feedback Fusion for RGB-D Salient Object Detection
    Chen, Hao
    Li, Youfu
    Su, Dan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) : 4808 - 4820
  • [38] Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection
    Zhu, Jinchao
    Zhang, Xiaoyu
    Fang, Xian
    Dong, Feng
    Qiu, Yu
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 359 - 363
  • [39] Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection
    Kong, Yuqiu
    Wang, He
    Kong, Lingwei
    Liu, Yang
    Yao, Cuili
    Yin, Baocai
    [J]. SENSORS, 2023, 23 (07)
  • [40] Three-stream RGB-D salient object detection network based on cross-level and cross-modal dual-attention fusion
    Meng, Lingbing
    Yuan, Mengya
    Shi, Xuehan
    Liu, Qingqing
    Cheng, Fei
    Li, Lingli
    [J]. IET IMAGE PROCESSING, 2023, 17 (11) : 3292 - 3308