A novel embedded cross framework for high-resolution salient object detection

被引:0
|
作者
Wang, Baoyu [1 ,2 ]
Yang, Mao [1 ]
Cao, Pingping [2 ]
Liu, Yan [3 ]
机构
[1] Northeast Elect Power Univ, Key Lab Modern Power Syst Simulat & Control & Rene, Minist Educ, Jilin 132012, Peoples R China
[2] Criminal Invest Police Univ China, Coll Basic Educ & Res, Shenyang 110854, Peoples R China
[3] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
关键词
Salient object detection; Embedded cross framework; Dual-path transformer; Unit fusion module; ATTENTION; NETWORK;
D O I
10.1007/s10489-024-06073-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) is a fundamental research topic in computer vision and has attracted significant interest from various fields, it has revealed two issues while driving the rapid development of salient detection. (1) The salient regions in high-resolution images exhibit significant differences in location, structure, and edge details, which makes them difficult to recognize and depict. (2) The traditional salient detection architecture is insensitive to detecting targets in high-resolution feature spaces, which leads to incomplete saliency predictions. To address these limitations, this paper proposes a novel embedded cross framework with a dual-path transformer (ECF-DT) for high-resolution SOD. The framework consists of a dual-path transformer and a unit fusion module for partitioning the salient targets. Specifically, we first design a cross network as a baseline model for salient object detection. Then, the dual-path transformer is embedded into the cross network with the objective of integrating fine-grained visual contextual information and target details while suppressing the disparity of the feature space. To generate more robust feature representations, we also introduce a unit fusion module, which highlights the positive information in the feature channels and encourages saliency prediction. Extensive experiments are conducted on nine benchmark databases, and the performance of the ECF-DT is compared with that of other existing state-of-the-art methods. The results indicate that our method outperforms its competitors and accurately detects the targets in high-resolution images with large objects, cluttered backgrounds, and complex scenes. It achieves MAEs of 0.017, 0.026, and 0.031 on three high-resolution public databases. Moreover, it reaches S-measure rates of 0.909, 0.876, 0.936, 0.854, 0.929, and 0.826 on six low-resolution public databases.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Exploring class-agnostic pixels for scribble-supervised high-resolution salient object detection
    Yang, Qingpeng
    Zhou, Yi
    Chai, Xiuli
    Zhang, Miaohui
    Zhang, Wanjun
    Wang, Jun
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (04): : 3469 - 3482
  • [22] A novel graph-based optimization framework for salient object detection
    Zhang, Jinxia
    Ehinger, Krista A.
    Wei, Haikun
    Zhang, Kanjian
    Yang, Jingyu
    PATTERN RECOGNITION, 2017, 64 : 39 - 50
  • [23] TF-SOD: a novel transformer framework for salient object detection
    Wang, Zhenyu
    Zhang, Yunzhou
    Liu, Yan
    Wang, Zhuo
    Coleman, Sonya
    Kerr, Dermot
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (14): : 11789 - 11806
  • [24] TF-SOD: a novel transformer framework for salient object detection
    Zhenyu Wang
    Yunzhou Zhang
    Yan Liu
    Zhuo Wang
    Sonya Coleman
    Dermot Kerr
    Neural Computing and Applications, 2022, 34 : 11789 - 11806
  • [25] A novel seminar learning framework for weakly supervised salient object detection
    Liu, Yan
    Zhang, Yunzhou
    Wang, Zhenyu
    Yang, Fei
    Qiu, Feng
    Coleman, Sonya
    Kerr, Dermot
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [26] Fast Object Detection in High-Resolution Videos
    Tran, Ryan
    Kanaujia, Atul
    Parameswaran, Vasu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1461 - 1470
  • [27] Revisiting Image Pyramid Structure for High Resolution Salient Object Detection
    Kim, Taehun
    Kim, Kunhee
    Lee, Joonyeong
    Cha, Dongmin
    Lee, Jiho
    Kim, Daijin
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 257 - 273
  • [28] A COARSE-TO-FINE OBJECT DETECTION FRAMEWORK FOR HIGH-RESOLUTION IMAGES WITH SPARSE OBJECTS
    Liu, Jinyan
    Yan, Longbin
    Chen, Jie
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [29] Label Decoupling Framework for Salient Object Detection
    Wei, Jun
    Wang, Shuhui
    Wu, Zhe
    Su, Chi
    Huang, Qingming
    Tian, Qi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 13022 - 13031
  • [30] Salient Object Detection: Integrate Salient Features in the Deep Learning Framework
    Chen, Qixin
    Liu, Tie
    Shang, Yuanyuan
    Shao, Zhuhong
    Ding, Hui
    IEEE ACCESS, 2019, 7 : 152483 - 152492