SOD-diffusion: Salient Object Detection via Diffusion-Based Image Generators

被引:0
|
作者
Zhang, Shuo [1 ,2 ]
Huang, Jiaming [2 ]
Chen, Shizhe [2 ]
Wu, Yan [2 ]
Hu, Tao [2 ]
Liu, Jing [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[2] Huolala, Shenzhen, Peoples R China
关键词
All Open Access; Hybrid Gold;
D O I
10.1111/cgf.15251
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Salient Object Detection (SOD) is a challenging task that aims to precisely identify and segment the salient objects. However, existing SOD methods still face challenges in making explicit predictions near the edges and often lack end-to-end training capabilities. To alleviate these problems, we propose SOD-diffusion, a novel framework that formulates salient object detection as a denoising diffusion process from noisy masks to object masks. Specifically, object masks diffuse from ground-truth masks to random distribution in latent space, and the model learns to reverse this noising process to reconstruct object masks. To enhance the denoising learning process, we design an attention feature interaction module (AFIM) and a specific fine-tuning protocol to integrate conditional semantic features from the input image with diffusion noise embedding. Extensive experiments on five widely used SOD benchmark datasets demonstrate that our proposed SOD-diffusion achieves favorable performance compared to previous well-established methods. Furthermore, leveraging the outstanding generalization capability of SOD-diffusion, we applied it to publicly available images, generating high-quality masks that serve as an additional SOD benchmark testset.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Diffusion-Based Image Compression in Steganography
    Mainberger, Markus
    Schmaltz, Christian
    Berg, Matthias
    Weickert, Joachim
    Backes, Michael
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT II, 2012, 7432 : 219 - 228
  • [22] Cauchy graph embedding based diffusion model for salient object detection
    Tan, Yihua
    Li, Yansheng
    Chen, Chen
    Yu, Jin-Gang
    Tian, Jinwen
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2016, 33 (05) : 887 - 898
  • [23] Diffusion-based cooperative space object tracking
    Jia, Bin
    Pham, Khanh
    Blasch, Erik
    Chen, Genshe
    Shen, Dan
    OPTICAL ENGINEERING, 2019, 58 (04)
  • [24] Salient Object Detection Based on Laplace Diffusion Models with Sink Points
    Wang B.
    Zhang T.
    Wang X.
    Wang, Baoyan (wangbaoyan2005@163.com), 1934, Science Press (39): : 1934 - 1941
  • [25] Conditional Diffusion Models for Camouflaged and Salient Object Detection
    Sun, Ke
    Chen, Zhongxi
    Lin, Xianming
    Sun, Xiaoshuai
    Liu, Hong
    Ji, Rongrong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2833 - 2848
  • [26] ITERATIVE DIFFUSION-BASED ANOMALY DETECTION
    Mishne, Gal
    Cohen, Israel
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1682 - 1686
  • [27] Enhanced Diffusion-Based Analysis for Fast Defect Detection in ECPT Image
    Liang, Yiping
    Bai, Libing
    Tian, Lulu
    Zhang, Xu
    Ren, Chao
    Shao, Dan
    Ma, Zhenzhong
    Sun, Mosi
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2884 - 2896
  • [28] Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
    Lee, Junsung
    Kang, Minsoo
    Han, Bohyung
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 289 - 304
  • [29] Diffusion-based image inpainting with internal learning
    Cherel, Nicolas
    Almansa, Andres
    Gousseau, Yann
    Newson, Alasdair
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 446 - 450
  • [30] Diffusion-based microalloying via reaction sintering
    Bishop, DP
    Kipouros, GJ
    Caley, WF
    JOURNAL OF MATERIALS SCIENCE, 1997, 32 (09) : 2353 - 2358