A novel embedded cross framework for high-resolution salient object detection

被引:0
|
作者
Wang, Baoyu [1 ,2 ]
Yang, Mao [1 ]
Cao, Pingping [2 ]
Liu, Yan [3 ]
机构
[1] Northeast Elect Power Univ, Key Lab Modern Power Syst Simulat & Control & Rene, Minist Educ, Jilin 132012, Peoples R China
[2] Criminal Invest Police Univ China, Coll Basic Educ & Res, Shenyang 110854, Peoples R China
[3] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
关键词
Salient object detection; Embedded cross framework; Dual-path transformer; Unit fusion module; ATTENTION; NETWORK;
D O I
10.1007/s10489-024-06073-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) is a fundamental research topic in computer vision and has attracted significant interest from various fields, it has revealed two issues while driving the rapid development of salient detection. (1) The salient regions in high-resolution images exhibit significant differences in location, structure, and edge details, which makes them difficult to recognize and depict. (2) The traditional salient detection architecture is insensitive to detecting targets in high-resolution feature spaces, which leads to incomplete saliency predictions. To address these limitations, this paper proposes a novel embedded cross framework with a dual-path transformer (ECF-DT) for high-resolution SOD. The framework consists of a dual-path transformer and a unit fusion module for partitioning the salient targets. Specifically, we first design a cross network as a baseline model for salient object detection. Then, the dual-path transformer is embedded into the cross network with the objective of integrating fine-grained visual contextual information and target details while suppressing the disparity of the feature space. To generate more robust feature representations, we also introduce a unit fusion module, which highlights the positive information in the feature channels and encourages saliency prediction. Extensive experiments are conducted on nine benchmark databases, and the performance of the ECF-DT is compared with that of other existing state-of-the-art methods. The results indicate that our method outperforms its competitors and accurately detects the targets in high-resolution images with large objects, cluttered backgrounds, and complex scenes. It achieves MAEs of 0.017, 0.026, and 0.031 on three high-resolution public databases. Moreover, it reaches S-measure rates of 0.909, 0.876, 0.936, 0.854, 0.929, and 0.826 on six low-resolution public databases.
引用
收藏
页数:19
相关论文
共 50 条
  • [11] An edge-aware high-resolution framework for camouflaged object detection
    Ma, Jingyuan
    Chen, Tianyou
    Xiao, Jin
    Hu, Xiaoguang
    Wang, Yingxun
    IMAGE AND VISION COMPUTING, 2025, 157
  • [12] Saliency bagging: a novel framework for robust salient object detection
    Vivek Kumar Singh
    Nitin Kumar
    The Visual Computer, 2020, 36 : 1423 - 1441
  • [13] Saliency Boosting: a novel framework to refine salient object detection
    Vivek Kumar Singh
    Nitin Kumar
    Suresh Madhavan
    Artificial Intelligence Review, 2020, 53 : 3731 - 3772
  • [14] A novel multi-graph framework for salient object detection
    Lu, Ye
    Zhou, Kedong
    Wu, Xiyin
    Gong, Penghan
    VISUAL COMPUTER, 2019, 35 (11): : 1683 - 1699
  • [15] Saliency bagging: a novel framework for robust salient object detection
    Singh, Vivek Kumar
    Kumar, Nitin
    VISUAL COMPUTER, 2020, 36 (07): : 1423 - 1441
  • [16] Saliency Boosting: a novel framework to refine salient object detection
    Singh, Vivek Kumar
    Kumar, Nitin
    Madhavan, Suresh
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) : 3731 - 3772
  • [17] A novel multi-graph framework for salient object detection
    Ye Lu
    Kedong Zhou
    Xiyin Wu
    Penghan Gong
    The Visual Computer, 2019, 35 : 1683 - 1699
  • [18] Cross-scale resolution consistent network for salient object detection
    Huang, Xiaoyu
    Liu, Wei
    Li, Minghui
    Nie, Hangyu
    IET IMAGE PROCESSING, 2024, 18 (10) : 2788 - 2799
  • [19] A Universal Framework for Salient Object Detection
    Lei, Jianjun
    Wang, Bingren
    Fang, Yuming
    Lin, Weisi
    Le Callet, Patrick
    Ling, Nam
    Hou, Chunping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (09) : 1783 - 1795
  • [20] Exploring class-agnostic pixels for scribble-supervised high-resolution salient object detection
    Qingpeng Yang
    Yi Zhou
    Xiuli Chai
    Miaohui Zhang
    Wanjun Zhang
    Jun Wang
    Neural Computing and Applications, 2023, 35 : 3469 - 3482