A cascaded refined rgb-d salient object detection network based on the attention mechanism

被引：0

作者：

Guanyu Zong

Longsheng Wei

Siyuan Guo

Yongtao Wang

机构：

[1] China University of Geosciences,School of Automation

[2] Hubei key Laboratory of Advanced Control and Intelligent Automation for Complex Systems,undefined

[3] Key Laboratory of Geological Survey and Evaluation of Ministry of Education,undefined

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Cascaded refined network; Adaptive channel transformation ratio ; Contextual feature aggregation; Hybrid loss function;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The RGB-D salient object detection algorithm simulates human attention behavior and attempts to locate the most visually prominent object(s) from a set of RGB and depth images. Existing works often follow a deterministic decoding network, with few methods explicitly considering how to establish connections between features at various levels. To this end, we first propose a cascaded refined RGB-D salient object detection network based on the attention mechanism (CRNet), whose primary contribution is a cascaded refined upsampling network layout. Specifically, we have developed an adaptive channel transformation ratio α in the micro modification module of convolutional block attention (MM), adaptively adjusting the feature channel conversion ratio according to the original input depth feature level to maximize the integration of contextual information during the feature extraction phase. For the multi-modal feature interaction section, we propose a contextual feature aggregation module (ACF) consisting of separable convolution, dilated convolution, and adaptive averaging pooling. Extend multi-modal fused features’ receptive fields, reduce redundant information, and decrease background noise interference. Furthermore, we first propose a cascaded refined upsampling network, a precise refining process that includes personal refinement, team expansion, and sequential execution operations. Among them, most of the actions were performed in a new sequential refinement module based on attention mechanism (SRM-Wm). We put the training of CRNet under the supervision of a new hybrid loss function. The experiment results show that the structure of our model is simple but very effective and outperforms the 19 SOTAs on six public datasets using four metrics (∼\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$\sim $\end{document}1.6% improvement in F-measure vs. the top-ranked model: BBSNet-TIP2021). You can find the code and results of our method at https://github.com/guanyuzong/CR-Net.

引用

页码：13527 / 13548

页数：21

共 50 条

[1] A cascaded refined rgb-d salient object detection network based on the attention mechanism
Zong, Guanyu
Wei, Longsheng
Guo, Siyuan
Wang, Yongtao
[J]. APPLIED INTELLIGENCE, 2023, 53 (11) : 13527 - 13548
[2] Bilateral Attention Network for RGB-D Salient Object Detection
Zhang, Zhao
Lin, Zheng
Xu, Jun
Jin, Wen-Da
Lu, Shao-Ping
Fan, Deng-Ping
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
[3] Hybrid-Attention Network for RGB-D Salient Object Detection
Chen, Yuzhen
Zhou, Wujie
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (17):
[4] CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection
Sun, Fuming
Ren, Peng
Yin, Bowen
Wang, Fasheng
Li, Haojie
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2249 - 2262
[5] JALNet: joint attention learning network for RGB-D salient object detection
Gao, Xiuju
Cui, Jianhua
Meng, Jin
Shi, Huaizhong
Duan, Songsong
Xia, Chenxing
[J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2024, 27 (01) : 36 - 47
[6] LIANet: Layer Interactive Attention Network for RGB-D Salient Object Detection
Han, Yibo
Wang, Liejun
Du, Anyu
Jiang, Shaochen
[J]. IEEE ACCESS, 2022, 10 : 25435 - 25447
[7] Hybrid Attention Mechanism and Forward Feedback Unit for RGB-D Salient Object Detection
Li, Haitang
Han, Yibo
Li, Peiling
Li, Xiaohui
Shi, Lijuan
[J]. IEEE ACCESS, 2023, 11 : 96068 - 96080
[8] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
Zhao, Zhengyun
Huang, Ziqing
Chai, Xiuli
Wang, Jun
[J]. NEURAL PROCESSING LETTERS, 2023, 55 (01) : 361 - 384
[9] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
Zhengyun Zhao
Ziqing Huang
Xiuli Chai
Jun Wang
[J]. Neural Processing Letters, 2023, 55 : 361 - 384
[10] AirSOD: A Lightweight Network for RGB-D Salient Object Detection
Zeng, Zhihong
Liu, Haijun
Chen, Fenglei
Tan, Xiaoheng
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1656 - 1669

← 1 2 3 4 5 →