Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation

被引:0
|
作者
Yang, Mengping [1 ]
Wang, Zhe [1 ]
Feng, Wenyi [1 ]
Zhang, Qian [2 ]
Xiao, Ting [2 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Key Lab Smart Mfg Energy Chem Proc, Shanghai, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Few-shot Learning; Image Generation; Textural Modulation; Structural Discrimination;
D O I
10.1145/3581783.3611763
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot image generation, which aims to produce plausible and diverse images for one category given a fewimages from this category, has drawn extensive attention. Existing approaches either globally interpolate different images or fuse local representations with pre-defined coefficients. However, such an intuitive combination of images/features only exploits the most relevant information for generation, leading to poor diversity and coarse-grained semantic fusion. To remedy this, this paper proposes a novel textural modulation (TexMod) mechanism to inject external semantic signals into internal local representations. Parameterized by the feedback from the discriminator, our TexMod enables more fined-grained semantic injection while maintaining the synthesis fidelity. Moreover, a global structural discriminator (StructD) is developed to explicitly guide the model to generate images with reasonable layout and outline. Furthermore, the frequency awareness of the model is reinforced by encouraging the model to distinguish frequency signals. Together with these techniques, we build a novel and effective model for few-shot image generation. The effectiveness of our model is identified by extensive experiments on three popular datasets and various settings. Besides achieving state-of-the-art synthesis performance on these datasets, our proposed techniques could be seamlessly integrated into existing models for a further performance boost. Our code and models are available at here.
引用
收藏
页码:7837 / 7848
页数:12
相关论文
共 50 条
  • [21] Few-Shot Learning for Image Denoising
    Jiang, Bo
    Lu, Yao
    Zhang, Bob
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4741 - 4753
  • [22] DFSGAN: Introducing editable and representative attributes for few-shot image generation
    Yang, Mengping
    Niu, Saisai
    Wang, Zhe
    Li, Dongdong
    Du, Wenli
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [23] Few-shot Image Generation via Cross-domain Correspondence
    Ojha, Utkarsh
    Li, Yijun
    Lu, Jingwan
    Efros, Alexei A.
    Lee, Yong Jae
    Shechtman, Eli
    Zhang, Richard
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10738 - 10747
  • [24] Few-Shot Image Generation via Style Adaptation and Content Preservation
    He, Xiaosheng
    Yang, Fan
    Liu, Fayao
    Lin, Guosheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [25] WeditGAN: Few-Shot Image Generation via Latent Space Relocation
    Duan, Yuxuan
    Niu, Li
    Hong, Yan
    Zhang, Liqing
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1653 - 1661
  • [26] Few-Shot Image Generation with Mixup-Based Distance Learning
    Kong, Chaerin
    Kim, Jeesoo
    Han, Donghoon
    Kwak, Nojun
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 563 - 580
  • [27] Fast Adaptive Meta-Learning for Few-Shot Image Generation
    Phaphuangwittayakul, Aniwat
    Guo, Yi
    Ying, Fangli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2205 - 2217
  • [28] Few-shot biomedical image segmentation using diffusion models: Beyond image generation
    Khosravi, Bardia
    Rouzrokh, Pouria
    Mickley, John P.
    Faghani, Shahriar
    Mulford, Kellen
    Yang, Linjun
    Larson, A. Noelle
    Howe, Benjamin M.
    Erickson, Bradley J.
    Taunton, Michael J.
    Wyles, Cody C.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [29] Improving Augmentation Efficiency for Few-Shot Learning
    Cho, Wonhee
    Kim, Eunwoo
    IEEE ACCESS, 2022, 10 : 17697 - 17706
  • [30] Few-Shot Unsupervised Image-to-Image Translation
    Liu, Ming-Yu
    Huang, Xun
    Mallya, Arun
    Karras, Tero
    Aila, Timo
    Lehtinen, Jaakko
    Kautz, Jan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10550 - 10559