Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation

被引:0
|
作者
Yang, Mengping [1 ]
Wang, Zhe [1 ]
Feng, Wenyi [1 ]
Zhang, Qian [2 ]
Xiao, Ting [2 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Key Lab Smart Mfg Energy Chem Proc, Shanghai, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Few-shot Learning; Image Generation; Textural Modulation; Structural Discrimination;
D O I
10.1145/3581783.3611763
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot image generation, which aims to produce plausible and diverse images for one category given a fewimages from this category, has drawn extensive attention. Existing approaches either globally interpolate different images or fuse local representations with pre-defined coefficients. However, such an intuitive combination of images/features only exploits the most relevant information for generation, leading to poor diversity and coarse-grained semantic fusion. To remedy this, this paper proposes a novel textural modulation (TexMod) mechanism to inject external semantic signals into internal local representations. Parameterized by the feedback from the discriminator, our TexMod enables more fined-grained semantic injection while maintaining the synthesis fidelity. Moreover, a global structural discriminator (StructD) is developed to explicitly guide the model to generate images with reasonable layout and outline. Furthermore, the frequency awareness of the model is reinforced by encouraging the model to distinguish frequency signals. Together with these techniques, we build a novel and effective model for few-shot image generation. The effectiveness of our model is identified by extensive experiments on three popular datasets and various settings. Besides achieving state-of-the-art synthesis performance on these datasets, our proposed techniques could be seamlessly integrated into existing models for a further performance boost. Our code and models are available at here.
引用
收藏
页码:7837 / 7848
页数:12
相关论文
共 50 条
  • [31] Few-Shot Object Detection via Association and DIscrimination
    Cao, Yuhang
    Wang, Jiaqi
    Jin, Ying
    Wu, Tong
    Chen, Kai
    Liu, Ziwei
    Lin, Dahua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [32] Underwater Acoustic Object Discrimination for Few-shot Learning
    Chen, Yuan
    Ma, QiMing
    Yu, Jie
    Chen, Tuo
    2019 4TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2019), 2019, : 430 - 434
  • [33] Few-Shot Object Detection Based on Association and Discrimination
    Jia Jianli
    Han Huiyan
    Kuang Liqun
    Han Fangzheng
    Zheng Xinyi
    Zhang Xiuquan
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
  • [34] Rethinking cross-domain semantic relation for few-shot image generation
    Yao Gou
    Min Li
    Yilong Lv
    Yusen Zhang
    Yuhang Xing
    Yujie He
    Applied Intelligence, 2023, 53 : 22391 - 22404
  • [35] Rethinking cross-domain semantic relation for few-shot image generation
    Gou, Yao
    Li, Min
    Lv, Yilong
    Zhang, Yusen
    Xing, Yuhang
    He, Yujie
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22391 - 22404
  • [36] Cross-Modulated Few-Shot Image Generation for Colorectal Tissue Classification
    Kumar, Amandeep
    Bhunia, Ankan Kumar
    Narayan, Sanath
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Laaksonen, Jorma
    Khan, Fahad Shahbaz
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 128 - 137
  • [37] SAGAN: Skip attention generative adversarial networks for few-shot image generation
    Aldhubri, Ali
    Lu, Jianfeng
    Fu, Guanyiman
    DIGITAL SIGNAL PROCESSING, 2024, 149
  • [38] Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation
    Xiao, Ting
    Cai, Yunjie
    Guan, Jiaoyan
    Wang, Zhe
    NEURAL NETWORKS, 2025, 183
  • [39] EVA : EVOLVING GIANT PRETRAINED MODEL FOR FEW-SHOT CONDITIONAL IMAGE GENERATION
    Zhai Zhixin
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [40] Decision fusion for few-shot image classification
    Tianhao Yuan
    Weifeng Liu
    Fei Yan
    Baodi Liu
    International Journal of Multimedia Information Retrieval, 2023, 12