A novel embedded cross framework for high-resolution salient object detection

被引:0
|
作者
Wang, Baoyu [1 ,2 ]
Yang, Mao [1 ]
Cao, Pingping [2 ]
Liu, Yan [3 ]
机构
[1] Northeast Elect Power Univ, Key Lab Modern Power Syst Simulat & Control & Rene, Minist Educ, Jilin 132012, Peoples R China
[2] Criminal Invest Police Univ China, Coll Basic Educ & Res, Shenyang 110854, Peoples R China
[3] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
关键词
Salient object detection; Embedded cross framework; Dual-path transformer; Unit fusion module; ATTENTION; NETWORK;
D O I
10.1007/s10489-024-06073-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) is a fundamental research topic in computer vision and has attracted significant interest from various fields, it has revealed two issues while driving the rapid development of salient detection. (1) The salient regions in high-resolution images exhibit significant differences in location, structure, and edge details, which makes them difficult to recognize and depict. (2) The traditional salient detection architecture is insensitive to detecting targets in high-resolution feature spaces, which leads to incomplete saliency predictions. To address these limitations, this paper proposes a novel embedded cross framework with a dual-path transformer (ECF-DT) for high-resolution SOD. The framework consists of a dual-path transformer and a unit fusion module for partitioning the salient targets. Specifically, we first design a cross network as a baseline model for salient object detection. Then, the dual-path transformer is embedded into the cross network with the objective of integrating fine-grained visual contextual information and target details while suppressing the disparity of the feature space. To generate more robust feature representations, we also introduce a unit fusion module, which highlights the positive information in the feature channels and encourages saliency prediction. Extensive experiments are conducted on nine benchmark databases, and the performance of the ECF-DT is compared with that of other existing state-of-the-art methods. The results indicate that our method outperforms its competitors and accurately detects the targets in high-resolution images with large objects, cluttered backgrounds, and complex scenes. It achieves MAEs of 0.017, 0.026, and 0.031 on three high-resolution public databases. Moreover, it reaches S-measure rates of 0.909, 0.876, 0.936, 0.854, 0.929, and 0.826 on six low-resolution public databases.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] ANALYSIS OF HIGH-RESOLUTION AERIAL IMAGES FOR OBJECT DETECTION
    TRIVEDI, MM
    BOKIL, AG
    TAKLA, MB
    MAKSYMONKO, GB
    BROACH, JT
    ADVANCES IN IMAGE COMPRESSION AND AUTOMATIC TARGET RECOGNITION, 1989, 1099 : 58 - 65
  • [32] Learning Cross-Modality High-Resolution Representation for Thermal Small-Object Detection
    Zhang, Yan
    Lei, Xu
    Hu, Qian
    Xu, Chang
    Yang, Wen
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [33] Particle filter framework for salient object detection in videos
    Muthuswamy, Karthik
    Rajan, Deepu
    IET COMPUTER VISION, 2015, 9 (03) : 428 - 438
  • [34] Deepside: A general deep framework for salient object detection
    Fu, Keren
    Zhao, Qijun
    Gu, Irene Yu-Hua
    Yang, Jie
    NEUROCOMPUTING, 2019, 356 : 69 - 82
  • [35] Multi-attention embedded network for salient object detection
    He, Wei
    Pan, Chen
    Xu, Wenlong
    Zhang, Ning
    SOFT COMPUTING, 2021, 25 (20) : 13053 - 13067
  • [36] A Causal Debiasing Framework for Unsupervised Salient Object Detection
    Lin, Xiangru
    Wu, Ziyi
    Chen, Guanqi
    Li, Guanbin
    Yu, Yizhou
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1610 - 1619
  • [37] Cross refinement network with edge detection for salient object detection
    Xiang, Junjiang
    Hu, Xiao
    Ding, Jiayu
    Tan, Xiangyue
    Yang, Jiaxin
    IET SIGNAL PROCESSING, 2021, 15 (07) : 425 - 436
  • [38] Depth Injection Framework for RGBD Salient Object Detection
    Yao, Shunyu
    Zhang, Miao
    Piao, Yongri
    Qiu, Chaoyi
    Lu, Huchuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5340 - 5352
  • [39] Multi-attention embedded network for salient object detection
    Wei He
    Chen Pan
    Wenlong Xu
    Ning Zhang
    Soft Computing, 2021, 25 : 13053 - 13067
  • [40] ESOD: Efficient Small Object Detection on High-Resolution Images
    Liu, Kai
    Fu, Zhihang
    Jin, Sheng
    Chen, Ze
    Zhou, Fan
    Jiang, Rongxin
    Chen, Yaowu
    Ye, Jieping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 183 - 195