A novel embedded cross framework for high-resolution salient object detection

被引：0

作者：

Wang, Baoyu ^{[1
,2
]}

Yang, Mao ^{[1
]}

Cao, Pingping ^{[2
]}

Liu, Yan ^{[3
]}

机构：

[1] Northeast Elect Power Univ, Key Lab Modern Power Syst Simulat & Control & Rene, Minist Educ, Jilin 132012, Peoples R China

[2] Criminal Invest Police Univ China, Coll Basic Educ & Res, Shenyang 110854, Peoples R China

[3] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 04期

关键词：

Salient object detection; Embedded cross framework; Dual-path transformer; Unit fusion module; ATTENTION; NETWORK;

D O I：

10.1007/s10489-024-06073-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Salient object detection (SOD) is a fundamental research topic in computer vision and has attracted significant interest from various fields, it has revealed two issues while driving the rapid development of salient detection. (1) The salient regions in high-resolution images exhibit significant differences in location, structure, and edge details, which makes them difficult to recognize and depict. (2) The traditional salient detection architecture is insensitive to detecting targets in high-resolution feature spaces, which leads to incomplete saliency predictions. To address these limitations, this paper proposes a novel embedded cross framework with a dual-path transformer (ECF-DT) for high-resolution SOD. The framework consists of a dual-path transformer and a unit fusion module for partitioning the salient targets. Specifically, we first design a cross network as a baseline model for salient object detection. Then, the dual-path transformer is embedded into the cross network with the objective of integrating fine-grained visual contextual information and target details while suppressing the disparity of the feature space. To generate more robust feature representations, we also introduce a unit fusion module, which highlights the positive information in the feature channels and encourages saliency prediction. Extensive experiments are conducted on nine benchmark databases, and the performance of the ECF-DT is compared with that of other existing state-of-the-art methods. The results indicate that our method outperforms its competitors and accurately detects the targets in high-resolution images with large objects, cluttered backgrounds, and complex scenes. It achieves MAEs of 0.017, 0.026, and 0.031 on three high-resolution public databases. Moreover, it reaches S-measure rates of 0.909, 0.876, 0.936, 0.854, 0.929, and 0.826 on six low-resolution public databases.

引用

页数：19

共 50 条

[11] An edge-aware high-resolution framework for camouflaged object detection
Ma, Jingyuan
Chen, Tianyou
Xiao, Jin
Hu, Xiaoguang
Wang, Yingxun
IMAGE AND VISION COMPUTING, 2025, 157
[12] Saliency bagging: a novel framework for robust salient object detection
Vivek Kumar Singh
Nitin Kumar
The Visual Computer, 2020, 36 : 1423 - 1441
[13] Saliency Boosting: a novel framework to refine salient object detection
Vivek Kumar Singh
Nitin Kumar
Suresh Madhavan
Artificial Intelligence Review, 2020, 53 : 3731 - 3772
[14] A novel multi-graph framework for salient object detection
Lu, Ye
Zhou, Kedong
Wu, Xiyin
Gong, Penghan
VISUAL COMPUTER, 2019, 35 (11): : 1683 - 1699
[15] Saliency bagging: a novel framework for robust salient object detection
Singh, Vivek Kumar
Kumar, Nitin
VISUAL COMPUTER, 2020, 36 (07): : 1423 - 1441
[16] Saliency Boosting: a novel framework to refine salient object detection
Singh, Vivek Kumar
Kumar, Nitin
Madhavan, Suresh
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) : 3731 - 3772
[17] A novel multi-graph framework for salient object detection
Ye Lu
Kedong Zhou
Xiyin Wu
Penghan Gong
The Visual Computer, 2019, 35 : 1683 - 1699
[18] Cross-scale resolution consistent network for salient object detection
Huang, Xiaoyu
Liu, Wei
Li, Minghui
Nie, Hangyu
IET IMAGE PROCESSING, 2024, 18 (10) : 2788 - 2799
[19] A Universal Framework for Salient Object Detection
Lei, Jianjun
Wang, Bingren
Fang, Yuming
Lin, Weisi
Le Callet, Patrick
Ling, Nam
Hou, Chunping
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (09) : 1783 - 1795
[20] Exploring class-agnostic pixels for scribble-supervised high-resolution salient object detection
Qingpeng Yang
Yi Zhou
Xiuli Chai
Miaohui Zhang
Wanjun Zhang
Jun Wang
Neural Computing and Applications, 2023, 35 : 3469 - 3482

← 1 2 3 4 5 →