Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation

被引:0
|
作者
Yang, Mengping [1 ]
Wang, Zhe [1 ]
Feng, Wenyi [1 ]
Zhang, Qian [2 ]
Xiao, Ting [2 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Key Lab Smart Mfg Energy Chem Proc, Shanghai, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Few-shot Learning; Image Generation; Textural Modulation; Structural Discrimination;
D O I
10.1145/3581783.3611763
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot image generation, which aims to produce plausible and diverse images for one category given a fewimages from this category, has drawn extensive attention. Existing approaches either globally interpolate different images or fuse local representations with pre-defined coefficients. However, such an intuitive combination of images/features only exploits the most relevant information for generation, leading to poor diversity and coarse-grained semantic fusion. To remedy this, this paper proposes a novel textural modulation (TexMod) mechanism to inject external semantic signals into internal local representations. Parameterized by the feedback from the discriminator, our TexMod enables more fined-grained semantic injection while maintaining the synthesis fidelity. Moreover, a global structural discriminator (StructD) is developed to explicitly guide the model to generate images with reasonable layout and outline. Furthermore, the frequency awareness of the model is reinforced by encouraging the model to distinguish frequency signals. Together with these techniques, we build a novel and effective model for few-shot image generation. The effectiveness of our model is identified by extensive experiments on three popular datasets and various settings. Besides achieving state-of-the-art synthesis performance on these datasets, our proposed techniques could be seamlessly integrated into existing models for a further performance boost. Our code and models are available at here.
引用
收藏
页码:7837 / 7848
页数:12
相关论文
共 50 条
  • [1] A Closer Look at Few-shot Image Generation
    Zhao, Yunqing
    Ding, Henghui
    Huang, Houjing
    Cheung, Ngai-Man
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9130 - 9140
  • [2] Few-shot Fish Image Generation and Classification
    Guo, Zonghui
    Zhang, Liqiang
    Jiang, Yufeng
    Niu, Wenjie
    Gu, Zhaorui
    Zheng, Haiyong
    Wang, Guoyu
    Zheng, Bing
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [3] Few-shot Image Generation via Adaptation-Aware Kernel Modulation
    Zhao, Yunqing
    Chandrasegaran, Keshigeyan
    Abdollahzadeh, Milad
    Cheung, Ngai-Man
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination
    Zhu, Jingyuan
    Ma, Huimin
    Chen, Jiansheng
    Yuan, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2950 - 2965
  • [5] Few-shot image generation with reverse contrastive learning
    Gou, Yao
    Li, Min
    Zhang, Yusen
    He, Zhuzhen
    He, Yujie
    NEURAL NETWORKS, 2024, 169 : 154 - 164
  • [6] Exploring Incompatible Knowledge Transfer in Few-shot Image Generation
    Zhao, Yunqing
    Du, Chao
    Abdollahzadeh, Milad
    Pang, Tianyu
    Lin, Min
    Yan, Shuicheng
    Cheung, Ngai-Man
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7380 - 7391
  • [7] Few-Shot Defect Image Generation Based on Consistency Modeling
    Shi, Qingfeng
    Wei, Jing
    Shen, Fei
    Zhang, Zhengtao
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 360 - 376
  • [8] Improved Few-Shot SAR Image Generation by Enhancing Diversity
    Bao, Jianghan
    Yu, Wen Ming
    Yang, Kaiqiao
    Liu, Che
    Cui, Tie Jun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3394 - 3408
  • [9] Few-shot Image Generation Using Discrete Content Representation
    Hong, Yan
    Niu, Li
    Zhang, Jianfu
    Zhang, Liqing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2796 - 2804
  • [10] Attribute Group Editing for Reliable Few-shot Image Generation
    Ding, Guanqi
    Han, Xinzhe
    Wang, Shuhui
    Wu, Shuzhe
    Jin, Xin
    Tu, Dandan
    Huang, Qingming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11184 - 11193