SOD-diffusion: Salient Object Detection via Diffusion-Based Image Generators

被引:0
|
作者
Zhang, Shuo [1 ,2 ]
Huang, Jiaming [2 ]
Chen, Shizhe [2 ]
Wu, Yan [2 ]
Hu, Tao [2 ]
Liu, Jing [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[2] Huolala, Shenzhen, Peoples R China
关键词
All Open Access; Hybrid Gold;
D O I
10.1111/cgf.15251
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Salient Object Detection (SOD) is a challenging task that aims to precisely identify and segment the salient objects. However, existing SOD methods still face challenges in making explicit predictions near the edges and often lack end-to-end training capabilities. To alleviate these problems, we propose SOD-diffusion, a novel framework that formulates salient object detection as a denoising diffusion process from noisy masks to object masks. Specifically, object masks diffuse from ground-truth masks to random distribution in latent space, and the model learns to reverse this noising process to reconstruct object masks. To enhance the denoising learning process, we design an attention feature interaction module (AFIM) and a specific fine-tuning protocol to integrate conditional semantic features from the input image with diffusion noise embedding. Extensive experiments on five widely used SOD benchmark datasets demonstrate that our proposed SOD-diffusion achieves favorable performance compared to previous well-established methods. Furthermore, leveraging the outstanding generalization capability of SOD-diffusion, we applied it to publicly available images, generating high-quality masks that serve as an additional SOD benchmark testset.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Diffusion-based microalloying via reaction sintering
    D.P Bishop
    G.J Kipouros
    W.F Caley
    Journal of Materials Science, 1997, 32 : 2353 - 2358
  • [32] Salient Object Detection via Google Image Retrieval
    Tan, Weimin
    Yan, Bo
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 97 - 107
  • [33] Diffusion-based network for unsupervised landmark detection
    Wu, Tao
    Wang, Kai
    Tang, Chuanming
    Zhang, Jianlin
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [34] Microfluidics - Microfluidic diffusion-based separation and detection
    Weigl, BH
    Yager, P
    SCIENCE, 1999, 283 (5400) : 346 - 347
  • [35] Detection Techniques for Diffusion-based Molecular Communication
    Llatser, Ignacio
    Cabellos-Aparicio, Albert
    Pierobon, Massimiliano
    Alarcon, Eduard
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2013, 31 (12) : 726 - 734
  • [36] Error Detection in Diffusion-based Molecular Communication
    Einolghozati, Arash
    Fekri, Faramarz
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 1128 - 1133
  • [37] Diffusion-based image denoising combining curvelet and wavelet
    Ashamol, V. G.
    Sreelekha, G.
    Sathidevi, P. S.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 169 - 172
  • [38] Learning Sparse Masks for Diffusion-Based Image Inpainting
    Alt, Tobias
    Peter, Pascal
    Weickert, Joachim
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 528 - 539
  • [39] Text-image Alignment for Diffusion-based Perception
    Kondapanenil, Neehar
    Marksl, Markus
    Knott, Manuel
    Guimaraes, Rogerio
    Perona, Pietro
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13883 - 13893
  • [40] Linear Hyperbolic Diffusion-Based Image Denoising Technique
    Barbu, Tudor
    NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 471 - 478