DataDream: Few-Shot Guided Dataset Generation

被引：0

作者：

Kim, Jae Myung ^{[1
,2
,3
]}

Bader, Jessica ^{[2
,3
,4
]}

Alaniz, Stephan ^{[2
,3
]}

Schmid, Cordelia ^{[5
]}

Akata, Zeynep ^{[2
,3
,4
]}

机构：

[1] Univ Tubingen, Tubingen, Germany

[2] Helmholtz Munich, Munich, Germany

[3] MCML, Munich, Germany

[4] TUM, Munich, Germany

[5] PSL Res Univ, CNRS, Ecole Normale Super, INRIA, Paris, France

来源：

COMPUTER VISION - ECCV 2024, PT LXXI | 2025年 / 15129卷

基金：

欧洲研究理事会;

关键词：

D O I：

10.1007/978-3-031-73209-6_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While text-to-image diffusion models have been shown to achieve state-of-the-art results in image synthesis, they have yet to prove their effectiveness in downstream applications. Previous work has proposed to generate data for image classifier training given limited real data access. However, these methods struggle to generate in-distribution images or depict fine-grained features, thereby hindering the generalization of classification models trained on synthetic datasets. We propose DataDream, a framework for synthesizing classification datasets that more faithfully represents the real data distribution when guided by few-shot examples of the target classes. DataDream fine-tunes LoRA weights for the image generation model on the few real images before generating the training data using the adapted model. We then fine-tune LoRA weights for CLIP using the synthetic data to improve downstream image classification over previous approaches on a large variety of datasets. We demonstrate the efficacy of DataDream through extensive experiments, surpassing state-of-the-art classification accuracy with few-shot data across 7 out of 10 datasets, while being competitive on the other 3. Additionally, we provide insights into the impact of various factors, such as the number of real-shot and generated images as well as the fine-tuning compute on model performance. The code is available at https://github.com/ExplainableML/DataDream.

引用

页码：252 / 268

页数：17

共 50 条

[1] Dataset Bias in Few-Shot Image Recognition
Jiang, Shuqiang
Zhu, Yaohui
Liu, Chenlong
Song, Xinhang
Li, Xiangyang
Min, Weiqing
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 229 - 246
[2] Few-Shot PPG Signal Generation via Guided Diffusion Models
Kang, Jinho
Lim, Yongtaek
Kim, Kyuhyung
Lee, Hyeonjeong
Kim, Kwang-Yong
Kim, Minseong
Jung, Jiyoung
Song, Kyungwoo
IEEE SENSORS JOURNAL, 2024, 24 (20) : 32792 - 32800
[3] FREDC: A Few-Shot Relation Extraction Dataset for Chinese
Yilahun, Hankiz
Zhao, Hangtian
Hamdulla, Askar
APPLIED SCIENCES-BASEL, 2025, 15 (03):
[4] FEW-NERD: A Few-shot Named Entity Recognition Dataset
Ding, Ning
Xu, Guangwei
Chen, Yulin
Wang, Xiaobin
Han, Xu
Xie, Pengjun
Zheng, Hai-Tao
Liu, Zhiyuan
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3198 - 3213
[5] Few-Shot Guided Mix for DNN Repairing
Ren, Xuhong
Yu, Bing
Qi, Hua
Juefei-Xu, Felix
Li, Zhuo
Xue, Wanli
Ma, Lei
Zhao, Jianjun
2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 717 - 721
[6] Dataset Bias Prediction for Few-Shot Image Classification
Kim, Jang Wook
Kim, So Yeon
Sohn, Kyung-Ah
ELECTRONICS, 2023, 12 (11)
[7] Learning a Universal Template for Few-shot Dataset Generalization
Triantafillou, Eleni
Larochelle, Hugo
Zemel, Richard
Dumoulin, Vincent
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7435 - 7446
[8] A Closer Look at Few-shot Image Generation
Zhao, Yunqing
Ding, Henghui
Huang, Houjing
Cheung, Ngai-Man
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9130 - 9140
[9] FEW-SHOT GENERATION BY MODELING STEREOSCOPIC PRIORS
Wang, Yuehui
Wang, Qing
Zhang, Dongyu
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2370 - 2374
[10] Counterfactual Generation Framework for Few-Shot Learning
Dang, Zhuohang
Luo, Minnan
Jia, Chengyou
Yan, Caixia
Chang, Xiaojun
Zheng, Qinghua
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3747 - 3758

← 1 2 3 4 5 →