SOD-diffusion: Salient Object Detection via Diffusion-Based Image Generators

被引:0
|
作者
Zhang, Shuo [1 ,2 ]
Huang, Jiaming [2 ]
Chen, Shizhe [2 ]
Wu, Yan [2 ]
Hu, Tao [2 ]
Liu, Jing [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
[2] Huolala, Shenzhen, Peoples R China
关键词
All Open Access; Hybrid Gold;
D O I
10.1111/cgf.15251
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Salient Object Detection (SOD) is a challenging task that aims to precisely identify and segment the salient objects. However, existing SOD methods still face challenges in making explicit predictions near the edges and often lack end-to-end training capabilities. To alleviate these problems, we propose SOD-diffusion, a novel framework that formulates salient object detection as a denoising diffusion process from noisy masks to object masks. Specifically, object masks diffuse from ground-truth masks to random distribution in latent space, and the model learns to reverse this noising process to reconstruct object masks. To enhance the denoising learning process, we design an attention feature interaction module (AFIM) and a specific fine-tuning protocol to integrate conditional semantic features from the input image with diffusion noise embedding. Extensive experiments on five widely used SOD benchmark datasets demonstrate that our proposed SOD-diffusion achieves favorable performance compared to previous well-established methods. Furthermore, leveraging the outstanding generalization capability of SOD-diffusion, we applied it to publicly available images, generating high-quality masks that serve as an additional SOD benchmark testset.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Anti-forensics of diffusion-based image inpainting
    Dou, Liyun
    Qian, Zhenxing
    Qin, Chuan
    Feng, Guorui
    Zhang, Xinpeng
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (04)
  • [42] Diffusion-Based Wireless Semantic Communication for VR Image
    Zhang, Haoming
    Bao, Zhicheng
    Liang, Haotai
    Liu, Yucheng
    Dong, Chen
    Li, Lin
    IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC WORKSHOPS 2024, 2024, : 639 - 644
  • [43] Improving Diffusion-Based Image Synthesis with Context Prediction
    Yang, Ling
    Liu, Jingwei
    Hong, Shenda
    Zhang, Zhilong
    Huang, Zhilin
    Cai, Zheming
    Zhang, Wentao
    Cui, Bin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] Diffusion-Based Data Augmentation for Nuclei Image Segmentation
    Yu, Xinyi
    Li, Guanbin
    Lou, Wei
    Liu, Siqi
    Wan, Xiang
    Chen, Yan
    Li, Haofeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 592 - 602
  • [45] VideoBooth: Diffusion-based Video Generation with Image Prompts
    Jiang, Yuming
    Wu, Tianxing
    Yang, Shuai
    Si, Chenyang
    Lin, Dahua
    Qiao, Yu
    Loy, Chen Change
    Liu, Ziwei
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6689 - 6700
  • [46] Diffusion-based remote sensing image fusion for classification
    Jiang, Yuling
    Liu, Shujun
    Wang, Huajun
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [47] Suppressing parasitic flow in membraneless diffusion-based microfluidic gradient generators
    Khandan, Vahid
    Chiechi, Ryan C.
    Verpoorte, Elisabeth
    Mathwig, Klaus
    LAB ON A CHIP, 2025,
  • [48] The Crack Diffusion Model: An Innovative Diffusion-Based Method for Pavement Crack Detection
    Zhang, Haoyuan
    Chen, Ning
    Li, Mei
    Mao, Shanjun
    REMOTE SENSING, 2024, 16 (06)
  • [49] Diffusion-Based Image Inpainting Forensics Via Gradient Domain Guided Filtering Enhancement
    Liu Tingting
    Zhang Yujin
    Wu Fei
    Xiong Shiting
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (08)
  • [50] Diffusion-based image inpainting forensics via weighted least squares filtering enhancement
    Zhang, Yujin
    Liu, Tingting
    Cattani, Carlo
    Cui, Qing
    Liu, Shuxian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30725 - 30739