A cascaded refined rgb-d salient object detection network based on the attention mechanism

被引:0
|
作者
Guanyu Zong
Longsheng Wei
Siyuan Guo
Yongtao Wang
机构
[1] China University of Geosciences,School of Automation
[2] Hubei key Laboratory of Advanced Control and Intelligent Automation for Complex Systems,undefined
[3] Key Laboratory of Geological Survey and Evaluation of Ministry of Education,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Cascaded refined network; Adaptive channel transformation ratio ; Contextual feature aggregation; Hybrid loss function;
D O I
暂无
中图分类号
学科分类号
摘要
The RGB-D salient object detection algorithm simulates human attention behavior and attempts to locate the most visually prominent object(s) from a set of RGB and depth images. Existing works often follow a deterministic decoding network, with few methods explicitly considering how to establish connections between features at various levels. To this end, we first propose a cascaded refined RGB-D salient object detection network based on the attention mechanism (CRNet), whose primary contribution is a cascaded refined upsampling network layout. Specifically, we have developed an adaptive channel transformation ratio α in the micro modification module of convolutional block attention (MM), adaptively adjusting the feature channel conversion ratio according to the original input depth feature level to maximize the integration of contextual information during the feature extraction phase. For the multi-modal feature interaction section, we propose a contextual feature aggregation module (ACF) consisting of separable convolution, dilated convolution, and adaptive averaging pooling. Extend multi-modal fused features’ receptive fields, reduce redundant information, and decrease background noise interference. Furthermore, we first propose a cascaded refined upsampling network, a precise refining process that includes personal refinement, team expansion, and sequential execution operations. Among them, most of the actions were performed in a new sequential refinement module based on attention mechanism (SRM-Wm). We put the training of CRNet under the supervision of a new hybrid loss function. The experiment results show that the structure of our model is simple but very effective and outperforms the 19 SOTAs on six public datasets using four metrics (∼\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$\sim $\end{document}1.6% improvement in F-measure vs. the top-ranked model: BBSNet-TIP2021). You can find the code and results of our method at https://github.com/guanyuzong/CR-Net.
引用
收藏
页码:13527 / 13548
页数:21
相关论文
共 50 条
  • [1] A cascaded refined rgb-d salient object detection network based on the attention mechanism
    Zong, Guanyu
    Wei, Longsheng
    Guo, Siyuan
    Wang, Yongtao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (11) : 13527 - 13548
  • [2] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [3] Hybrid-Attention Network for RGB-D Salient Object Detection
    Chen, Yuzhen
    Zhou, Wujie
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [4] CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection
    Sun, Fuming
    Ren, Peng
    Yin, Bowen
    Wang, Fasheng
    Li, Haojie
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2249 - 2262
  • [5] JALNet: joint attention learning network for RGB-D salient object detection
    Gao, Xiuju
    Cui, Jianhua
    Meng, Jin
    Shi, Huaizhong
    Duan, Songsong
    Xia, Chenxing
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2024, 27 (01) : 36 - 47
  • [6] LIANet: Layer Interactive Attention Network for RGB-D Salient Object Detection
    Han, Yibo
    Wang, Liejun
    Du, Anyu
    Jiang, Shaochen
    [J]. IEEE ACCESS, 2022, 10 : 25435 - 25447
  • [7] Hybrid Attention Mechanism and Forward Feedback Unit for RGB-D Salient Object Detection
    Li, Haitang
    Han, Yibo
    Li, Peiling
    Li, Xiaohui
    Shi, Lijuan
    [J]. IEEE ACCESS, 2023, 11 : 96068 - 96080
  • [8] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
    Zhao, Zhengyun
    Huang, Ziqing
    Chai, Xiuli
    Wang, Jun
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (01) : 361 - 384
  • [9] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
    Zhengyun Zhao
    Ziqing Huang
    Xiuli Chai
    Jun Wang
    [J]. Neural Processing Letters, 2023, 55 : 361 - 384
  • [10] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
    Zeng, Zhihong
    Liu, Haijun
    Chen, Fenglei
    Tan, Xiaoheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669