RCNet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

被引:4
|
作者
Xia, Chenxing [1 ]
Sun, Yanguang [1 ,2 ]
Li, Kuan-Ching [3 ]
Ge, Bin [1 ]
Zhang, Hanling [4 ]
Jiang, Bo [5 ]
Zhang, Ji [6 ]
机构
[1] Anhui Univ Sci & Technol, Coll Comp Sci & Engn, Huainan, Anhui, Peoples R China
[2] Nanjing Univ Sci & Technol, Coll Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
[3] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[4] Hunan Univ, Sch Design, Changsha 410082, Peoples R China
[5] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[6] Univ Southern Queensland, Sch Math Phys & Comp, Brisbane, Qld, Australia
关键词
Attention mechanism; Multi-scale contextual information; Salient object detection; MODEL;
D O I
10.1016/j.eswa.2023.121441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] Flow driven attention network for video salient object detection
    Zhou, Feng
    Shuai, Hui
    Liu, Qingshan
    Guo, Guodong
    IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
  • [2] CDD-Net: A Context-Driven Detection Network for Multiclass Object Detection
    Wu, Yulin
    Zhang, Ke
    Wang, Jingyu
    Wang, Yezi
    Wang, Qi
    Li, Qiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [3] Hierarchical U-Shape Attention Network for Salient Object Detection
    Zhou, Sanping
    Wang, Jinjun
    Zhang, Jimuyang
    Wang, Le
    Huang, Dong
    Du, Shaoyi
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8417 - 8428
  • [4] Salient Object Detection Using Recurrent Guidance Network With Hierarchical Attention Features
    Lu, Shanmei
    Guo, Qiang
    Zhang, Yongxia
    IEEE ACCESS, 2020, 8 : 151325 - 151334
  • [5] Hierarchical Feature Fusion Network for Salient Object Detection
    Li, Xuelong
    Song, Dawei
    Dong, Yongsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9165 - 9175
  • [6] Hierarchical Salient Object Detection Network with Dense Connections
    Zhang, Qing
    Shi, Jianchen
    Zuo, Baochuan
    Dai, Meng
    Dong, Tianzhen
    Qi, Xiao
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 454 - 466
  • [7] Curiosity-Driven Salient Object Detection With Fragment Attention
    Wang, Zheng
    Wang, Pengzhi
    Han, Yahong
    Zhang, Xue
    Sun, Meijun
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5989 - 6001
  • [8] Neural Related Work Summarization with a Joint Context-driven Attention Mechanism
    Wang, Yongzhen
    Liu, Xiaozhong
    Gao, Zheng
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1776 - 1786
  • [9] Context driven focus of attention for object detection
    Perko, Roland
    Leonardis, Ales
    ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 216 - 233
  • [10] CONTEXT-DRIVEN MOVING OBJECT DETECTION IN AERIAL SCENES WITH USER INPUT
    Guilmart, C.
    Herbin, S.
    Perez, P.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1781 - 1784