RCNet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

被引:4
|
作者
Xia, Chenxing [1 ]
Sun, Yanguang [1 ,2 ]
Li, Kuan-Ching [3 ]
Ge, Bin [1 ]
Zhang, Hanling [4 ]
Jiang, Bo [5 ]
Zhang, Ji [6 ]
机构
[1] Anhui Univ Sci & Technol, Coll Comp Sci & Engn, Huainan, Anhui, Peoples R China
[2] Nanjing Univ Sci & Technol, Coll Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
[3] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[4] Hunan Univ, Sch Design, Changsha 410082, Peoples R China
[5] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[6] Univ Southern Queensland, Sch Math Phys & Comp, Brisbane, Qld, Australia
关键词
Attention mechanism; Multi-scale contextual information; Salient object detection; MODEL;
D O I
10.1016/j.eswa.2023.121441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [31] ETANet: An Efficient Triple-Attention Network for Salient Object Detection
    Ngo, Thien-Thu
    Huh, Eui-Nam
    Hong, Choong Seon
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 271 - 276
  • [32] CGAN: closure-guided attention network for salient object detection
    Das, Dibyendu Kumar
    Shit, Sahadeb
    Ray, Dip Narayan
    Majumder, Somajyoti
    VISUAL COMPUTER, 2022, 38 (11): : 3803 - 3817
  • [33] Reverse Attention-Based Residual Network for Salient Object Detection
    Chen, Shuhan
    Tan, Xiuli
    Wang, Ben
    Lu, Huchuan
    Hu, Xuelong
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3763 - 3776
  • [34] Multiscale Balanced-Attention Interactive Network for Salient Object Detection
    Yang, Haiyan
    Chen, Rui
    Deng, Dexiang
    MATHEMATICS, 2022, 10 (03)
  • [35] CoGANet: Co-Guided Attention Network for Salient Object Detection
    Zhao, Yufei
    Song, Yong
    Li, Guoqi
    Huang, Yi
    Bai, Yashuo
    Zhou, Ya
    Hao, Qun
    IEEE PHOTONICS JOURNAL, 2022, 14 (04):
  • [36] Global contextual guided residual attention network for salient object detection
    Wang, Jun
    Zhao, Zhengyun
    Yang, Shangqin
    Chai, Xiuli
    Zhang, Wanjun
    Zhang, Miaohui
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6208 - 6226
  • [37] Attention guided contextual feature fusion network for salient object detection
    Zhang, Jin
    Shi, Yanjiao
    Zhang, Qing
    Cui, Liu
    Chen, Ying
    Yi, Yugen
    IMAGE AND VISION COMPUTING, 2022, 117
  • [38] CGAN: closure-guided attention network for salient object detection
    Dibyendu Kumar Das
    Sahadeb Shit
    Dip Narayan Ray
    Somajyoti Majumder
    The Visual Computer, 2022, 38 : 3803 - 3817
  • [39] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    IEEE Transactions on Image Processing, 2021, 30 : 1949 - 1961
  • [40] Progressive Dual-Attention Residual Network for Salient Object Detection
    Zhang, Liqian
    Zhang, Qing
    Zhao, Rui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5902 - 5915