RCNet: Related Context-Driven Network with Hierarchical Attention for Salient Object Detection

被引:4
|
作者
Xia, Chenxing [1 ]
Sun, Yanguang [1 ,2 ]
Li, Kuan-Ching [3 ]
Ge, Bin [1 ]
Zhang, Hanling [4 ]
Jiang, Bo [5 ]
Zhang, Ji [6 ]
机构
[1] Anhui Univ Sci & Technol, Coll Comp Sci & Engn, Huainan, Anhui, Peoples R China
[2] Nanjing Univ Sci & Technol, Coll Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
[3] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[4] Hunan Univ, Sch Design, Changsha 410082, Peoples R China
[5] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[6] Univ Southern Queensland, Sch Math Phys & Comp, Brisbane, Qld, Australia
关键词
Attention mechanism; Multi-scale contextual information; Salient object detection; MODEL;
D O I
10.1016/j.eswa.2023.121441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in salient object detection (SOD) mainly depends on dilated convolution with different receptive fields to capture contextual information for multi-scale learning. Intuitively, contextual information in different scales is conducive to understanding the image content, and thus can help us identify and locate salient objects in real-world scenes. However, the sparsity inside the dilated convolution kernel may cause the problem of local information loss, limiting the predictive accuracy of the model. In addition, the inequality of feature channels should also be considered, and they often feature different deviations for salient objects or background noises. Although some channel attention mechanisms have been proposed in SOD, their ability to capture global information is limited, and the problem of high complexity is still a great challenge. To alleviate the abovementioned problems, we propose a Related Context-Driven Network (RCNet) with Hierarchical Attention for Salient Object Detection, consisting of a cascaded multi-scale context exploration (CMCE) module and a hierarchical feature aggregation (HFA) module. The CMCE module is to capture multi-scale contextual information through using multi-receptive-field dilated convolutions in a diamond hierarchical structure, where a feature reconstruction operation is deployed to improve the correlation of features, effectively avoiding the gridding problems and local information loss. Meanwhile, the HFA module adaptively interacts with the complementary information of the multi-level features to further capture the important information from within the feature channel by a multi-source hybrid channel attention (MHCA) mechanism to generate powerful and robust feature representations. Extensive experiments on six benchmark datasets demonstrate that the proposed RCNet method consistently outperforms 20 existing the state-of-the-art SOD methods in terms of accuracy, generalization capacity and robustness.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [41] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [42] Group attention retention network for co-salient object detection
    Jing Liu
    Jiaxiang Wang
    Zhiwei Fan
    Min Yuan
    Weikang Wang
    Jiexiao Yu
    Machine Vision and Applications, 2023, 34
  • [43] Global contextual guided residual attention network for salient object detection
    Jun Wang
    Zhengyun Zhao
    Shangqin Yang
    Xiuli Chai
    Wanjun Zhang
    Miaohui Zhang
    Applied Intelligence, 2022, 52 : 6208 - 6226
  • [44] CONTEXT-AWARE HIERARCHICAL FEATURE ATTENTION NETWORK FOR MULTI-SCALE OBJECT DETECTION
    Xu, Xuelong
    Luo, Xiangfeng
    Ma, Liyan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2011 - 2015
  • [45] Spatiotemporal context-aware network for video salient object detection
    Tianyou Chen
    Jin Xiao
    Xiaoguang Hu
    Guofeng Zhang
    Shaojie Wang
    Neural Computing and Applications, 2022, 34 : 16861 - 16877
  • [46] COCCI: Context-Driven Clothing Classification Network
    Jiang, Minghua
    Liu, Shuqing
    Shi, Yankang
    Du, Chenghu
    Tang, Guangyu
    Liu, Li
    Peng, Tao
    Hu, Xinrong
    Yu, Feng
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 69 - 80
  • [47] Salient Object Detection with Multiscale Context Enhanced Fully Convolutional Network
    Ling Y.
    Chen Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (11): : 2007 - 2016
  • [48] Spatiotemporal context-aware network for video salient object detection
    Chen, Tianyou
    Xiao, Jin
    Hu, Xiaoguang
    Zhang, Guofeng
    Wang, Shaojie
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (19): : 16861 - 16877
  • [49] Context-Driven Satire Detection With Deep Learning
    Razali, Md Saifullah
    Halin, Alfian Abdul
    Chow, Yang-Wai
    Norowi, Noris Mohd
    Doraisamy, Shyamala
    IEEE ACCESS, 2022, 10 : 78780 - 78787
  • [50] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542