DFSGAN: Introducing editable and representative attributes for few-shot image generation

被引:6
|
作者
Yang, Mengping [1 ,2 ]
Niu, Saisai [3 ]
Wang, Zhe [1 ,2 ]
Li, Dongdong [2 ]
Du, Wenli [1 ]
机构
[1] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[3] Shanghai Aerosp Control Technol Inst, Shanghai 201108, Peoples R China
基金
美国国家科学基金会;
关键词
Generative adversarial networks; Few shot image generation; Latent code; Intermediate representation; ADVERSARIAL NETWORKS; TEXT;
D O I
10.1016/j.engappai.2022.105519
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training generative adversarial networks (GANs) usually requires large-scale data and massive computation resources. The performance of GANs plummets when given limited data due to the discriminator overfitting, thus providing meaningless feedback to the generator during the adversarial training. Existing few-shot GANs are primarily concerned with transferring knowledge from models that have been pre-trained on large-scale datasets or using data augmentation to expand the training sets. However, previous methods consistently take latent codes sampled from a single distribution as the generator's input. We contend that more complicated latent codes can provide the generator with more editable attributes. In this paper, we propose DFSGAN for few-shot image generation, which takes dynamic Gaussian mixture (DGM) latent codes as the generator's input. Our DFSGAN can select the Gaussian components of the latent codes quantitatively. We also design two techniques to strengthen the representative ability of intermediate features of the generating process to improve the fidelity and maintain the content and layout information of the synthesized images. Our DGM and intermediate representation enhancement techniques complement each other and improve synthesis quality. We conduct extensive experiments on 15 few-shot datasets with different resolutions spanning from art paintings to realistic photos. Qualitative and quantitative results demonstrate the superiority and effectiveness of our model.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Few-shot Image Generation via Cross-domain Correspondence
    Ojha, Utkarsh
    Li, Yijun
    Lu, Jingwan
    Efros, Alexei A.
    Lee, Yong Jae
    Shechtman, Eli
    Zhang, Richard
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10738 - 10747
  • [22] Few-Shot Image Generation via Style Adaptation and Content Preservation
    He, Xiaosheng
    Yang, Fan
    Liu, Fayao
    Lin, Guosheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [23] WeditGAN: Few-Shot Image Generation via Latent Space Relocation
    Duan, Yuxuan
    Niu, Li
    Hong, Yan
    Zhang, Liqing
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1653 - 1661
  • [24] Few-Shot Image Generation with Mixup-Based Distance Learning
    Kong, Chaerin
    Kim, Jeesoo
    Han, Donghoon
    Kwak, Nojun
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 563 - 580
  • [25] Fast Adaptive Meta-Learning for Few-Shot Image Generation
    Phaphuangwittayakul, Aniwat
    Guo, Yi
    Ying, Fangli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2205 - 2217
  • [26] REPRESENTATIVE LOCAL FEATURE MINING FOR FEW-SHOT LEARNING
    Yan, Kun
    Liu, Lingbo
    Hou, Jun
    Wang, Ping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1730 - 1734
  • [27] Shaping Visual Representations With Attributes for Few-Shot Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1397 - 1401
  • [28] Partition-A-Medical-Image: Extracting Multiple Representative Subregions for Few-Shot Medical Image Segmentation
    Zhu, Yazhou
    Wang, Shidong
    Xin, Tong
    Zhang, Zheng
    Zhang, Haofeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12
  • [29] Few-shot biomedical image segmentation using diffusion models: Beyond image generation
    Khosravi, Bardia
    Rouzrokh, Pouria
    Mickley, John P.
    Faghani, Shahriar
    Mulford, Kellen
    Yang, Linjun
    Larson, A. Noelle
    Howe, Benjamin M.
    Erickson, Bradley J.
    Taunton, Michael J.
    Wyles, Cody C.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [30] Few-Shot Unsupervised Image-to-Image Translation
    Liu, Ming-Yu
    Huang, Xun
    Mallya, Arun
    Karras, Tero
    Aila, Timo
    Lehtinen, Jaakko
    Kautz, Jan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10550 - 10559