PileNet: A high-and-low pass complementary filter with multi-level feature refinement for salient object detection

被引:0
|
作者
Yang, Xiaoqi [1 ]
Duan, Liangliang [1 ]
Zhou, Quanqiang [1 ]
机构
[1] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China
关键词
Salient object detection; Complementary filter; Feature refinement; Feature aggregation; NETWORK;
D O I
10.1016/j.jvcir.2024.104186
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-head self-attentions (MSAs) in Transformer are low-pass filters, which will tend to reduce high-frequency signals. Convolutional layers (Convs) in Convolutional Neural Network (CNN) are high-pass filters, which will tend to capture high-frequency components of the images. Therefore, CNN and Transformer contain complementary information, and the combination of the two is necessary for satisfactory detection results. In this work, we propose a novel framework PileNet that efficiently combine CNN and Transformer for accurate salient object detection (SOD). Specifically in PileNet, we introduce complementary encoder that extracts multi-level complementary saliency features. Next, we simplify the complementary features by adjusting the number of channels for all features to a fixed value. By introducing the multi-level feature aggregation (MLFA) and multi-level feature refinement (MLFR) units, the lowand high-level features can easily be transmitted to feature blocks at various pyramid levels. Finally, we fuse all the refined saliency features in a Unet-like structure from top to bottom and use multi-point supervision mechanism to produce the final saliency maps. Extensive experimental results over five widely used saliency benchmark datasets clearly demonstrate that our proposed model can accurately locate the entire salient objects with clear object boundaries and outperform sixteen previous state-of-the-art saliency methods in terms of a wide range of metrics.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Salient object detection network with multi-scale feature refinement and boundary feedback
    Zhang, Qing
    Li, Xiang
    [J]. IMAGE AND VISION COMPUTING, 2021, 116
  • [22] MFCINet: multi-level feature and context information fusion network for RGB-D salient object detection
    Chenxing Xia
    Difeng Chen
    Xiuju Gao
    Bin Ge
    Kuan-Ching Li
    Xianjin Fang
    Yan Zhang
    Ke Yang
    [J]. The Journal of Supercomputing, 2024, 80 : 2487 - 2513
  • [23] Multi-level and multi-scale deep saliency network for salient object detection
    Zhang, Qing
    Lin, Jiajun
    Zhuge, Jingling
    Yuan, Wenhao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 415 - 424
  • [24] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    [J]. The Visual Computer, 2023, 39 : 4267 - 4277
  • [25] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    [J]. VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [26] SALIENT OBJECT DETECTION BY MULTI-LEVEL FEATURES LEARNING DETERMINED SPARSE RECONSTRUCTION
    Yan, Xiaoyun
    Wang, Yuehuan
    Song, Qiong
    Dai, Kaiheng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2762 - 2766
  • [27] CEMINet: Context exploration and multi-level interaction network for salient object detection
    Xia, Chenxing
    Chen, Xinyu
    Sun, Yanguang
    Ge, Bin
    Fang, Xianjin
    Gao, Xiuju
    Li, Kuan-Ching
    Zhang, Hanling
    Zhang, Yan
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 147
  • [28] 3MNet: Multi-task, multi-level and multi-channel feature aggregation network for salient object detection
    Yan, Xinghe
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Lu, Mengxu
    Sun, Luna
    [J]. MACHINE VISION AND APPLICATIONS, 2021, 32 (02)
  • [29] Multiple salient object detection through multi-level foreground segmentation strategy
    Paramanandam, Kokila
    Kanagavalli, R.
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
  • [30] 3MNet: Multi-task, multi-level and multi-channel feature aggregation network for salient object detection
    Xinghe Yan
    Zhenxue Chen
    Q. M. Jonathan Wu
    Mengxu Lu
    Luna Sun
    [J]. Machine Vision and Applications, 2021, 32