DFSGAN: Introducing editable and representative attributes for few-shot image generation

被引:6
|
作者
Yang, Mengping [1 ,2 ]
Niu, Saisai [3 ]
Wang, Zhe [1 ,2 ]
Li, Dongdong [2 ]
Du, Wenli [1 ]
机构
[1] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[3] Shanghai Aerosp Control Technol Inst, Shanghai 201108, Peoples R China
基金
美国国家科学基金会;
关键词
Generative adversarial networks; Few shot image generation; Latent code; Intermediate representation; ADVERSARIAL NETWORKS; TEXT;
D O I
10.1016/j.engappai.2022.105519
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training generative adversarial networks (GANs) usually requires large-scale data and massive computation resources. The performance of GANs plummets when given limited data due to the discriminator overfitting, thus providing meaningless feedback to the generator during the adversarial training. Existing few-shot GANs are primarily concerned with transferring knowledge from models that have been pre-trained on large-scale datasets or using data augmentation to expand the training sets. However, previous methods consistently take latent codes sampled from a single distribution as the generator's input. We contend that more complicated latent codes can provide the generator with more editable attributes. In this paper, we propose DFSGAN for few-shot image generation, which takes dynamic Gaussian mixture (DGM) latent codes as the generator's input. Our DFSGAN can select the Gaussian components of the latent codes quantitatively. We also design two techniques to strengthen the representative ability of intermediate features of the generating process to improve the fidelity and maintain the content and layout information of the synthesized images. Our DGM and intermediate representation enhancement techniques complement each other and improve synthesis quality. We conduct extensive experiments on 15 few-shot datasets with different resolutions spanning from art paintings to realistic photos. Qualitative and quantitative results demonstrate the superiority and effectiveness of our model.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Rethinking cross-domain semantic relation for few-shot image generation
    Yao Gou
    Min Li
    Yilong Lv
    Yusen Zhang
    Yuhang Xing
    Yujie He
    Applied Intelligence, 2023, 53 : 22391 - 22404
  • [32] Rethinking cross-domain semantic relation for few-shot image generation
    Gou, Yao
    Li, Min
    Lv, Yilong
    Zhang, Yusen
    Xing, Yuhang
    He, Yujie
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22391 - 22404
  • [33] Cross-Modulated Few-Shot Image Generation for Colorectal Tissue Classification
    Kumar, Amandeep
    Bhunia, Ankan Kumar
    Narayan, Sanath
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Laaksonen, Jorma
    Khan, Fahad Shahbaz
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 128 - 137
  • [34] SAGAN: Skip attention generative adversarial networks for few-shot image generation
    Aldhubri, Ali
    Lu, Jianfeng
    Fu, Guanyiman
    DIGITAL SIGNAL PROCESSING, 2024, 149
  • [35] Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation
    Xiao, Ting
    Cai, Yunjie
    Guan, Jiaoyan
    Wang, Zhe
    NEURAL NETWORKS, 2025, 183
  • [36] EVA : EVOLVING GIANT PRETRAINED MODEL FOR FEW-SHOT CONDITIONAL IMAGE GENERATION
    Zhai Zhixin
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [37] Decision fusion for few-shot image classification
    Tianhao Yuan
    Weifeng Liu
    Fei Yan
    Baodi Liu
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [38] Few-shot Image Generation via Adaptation-Aware Kernel Modulation
    Zhao, Yunqing
    Chandrasegaran, Keshigeyan
    Abdollahzadeh, Milad
    Cheung, Ngai-Man
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] Semantic Prompt for Few-Shot Image Recognition
    Chen, Wentao
    Si, Chenyang
    Zhang, Zhang
    Wang, Liang
    Wang, Zilei
    Tan, Tieniu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23581 - 23591
  • [40] An Image Enhancement Method for Few-shot Classification
    Wu, Benze
    Wu, Yirui
    Wan, Shaohua
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2021), 2021, : 159 - 165