Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation

被引：0

作者：

Quang Nguyen ^{[1
,2
]}

Truong Vu ^{[1
]}

Anh Tran ^{[1
]}

Khoi Nguyen ^{[1
]}

机构：

[1] VinAI Res, Ho Chi Minh City, Vietnam

[2] Ho Chi Minh City Univ Technol, VNU HCM, Ho Chi Minh City, Vietnam

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category labels, we propose a novel method for generating pixel-level semantic segmentation labels using the text-to-image generative model Stable Diffusion (SD). By utilizing the text prompts, cross-attention, and self-attention of SD, we introduce three new techniques: class-prompt appending, class-prompt cross-attention, and self-attention exponentiation. These techniques enable us to generate segmentation maps corresponding to synthetic images. These maps serve as pseudo-labels for training semantic segmenters, eliminating the need for labor-intensive pixel-wise annotation. To account for the imperfections in our pseudo-labels, we incorporate uncertainty regions into the segmentation, allowing us to disregard loss from those regions. We conduct evaluations on two datasets, PASCAL VOC and MSCOCO, and our approach significantly outperforms concurrent work. Our benchmarks and code will be released at https://github.com/VinAIResearch/Dataset-Diffusion.

引用

下载

页数：21

共 50 条

[1] Highly realistic synthetic dataset for pixel-level DensePose estimation via diffusion model
Liu, Qiong (liuqiong@scut.edu.cn), 2025, 159
[2] Temporal Pixel-Level Semantic Understanding Through the VSPW Dataset
Miao, Jiaxu
Wei, Yunchao
Wang, Xiaohan
Yang, Yi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11297 - 11308
[3] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Wu, Weijia
Zhao, Yuzhong
Shou, Mike Zheng
Zhou, Hong
Shen, Chunhua
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1206 - 1217
[4] COCO_TS Dataset: Pixel-Level Annotations Based on Weak Supervision for Scene Text Segmentation
Bonechi, Simone
Andreini, Paolo
Bianchini, Monica
Scarselli, Franco
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 238 - 250
[5] Generating a synthetic diffusion tensor dataset
Bergmann, O
Lundervold, A
Steihaug, T
18th IEEE Symposium on Computer-Based Medical Systems, Proceedings, 2005, : 277 - 281
[6] Semantic Segmentation of Panoramic Images Using a Synthetic Dataset
Xu, Yuanyou
Wang, Kaiwei
Yang, Kailun
Sun, Dongming
Fu, Jia
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS, 2019, 11169
[7] The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Gonzalez, Philippe
Tan, Zheng-Hua
Ostergaard, Jan
Jensen, Jesper
Alstrom, Tommy Sonne
May, Tobias
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2225 - 2229
[8] Combining Pixel-Level and Structure-Level Adaptation for Semantic Segmentation
Bi, Xiwen
Chen, Dubing
Huang, He
Wang, Shidong
Zhang, Haofeng
NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9669 - 9684
[9] Combining Pixel-Level and Structure-Level Adaptation for Semantic Segmentation
Xiwen Bi
Dubing Chen
He Huang
Shidong Wang
Haofeng Zhang
Neural Processing Letters, 2023, 55 : 9669 - 9684
[10] Pixel-level Intra-domain Adaptation for Semantic Segmentation
Yan, Zizheng
Yu, Xianggang
Qin, Yipeng
Wu, Yushuang
Han, Xiaoguang
Cui, Shuguang
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 404 - 413

← 1 2 3 4 5 →