Positional Encoding as Spatial Inductive Bias in GANs

被引:30
|
作者
Xu, Rui [1 ]
Wang, Xintao [3 ]
Chen, Kai [4 ,5 ]
Zhou, Bolei [1 ]
Loy, Chen Change [2 ]
机构
[1] Chinese Univ Hong Kong, CUHK SenseTime Joint Lab, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, S Lab, Singapore, Singapore
[3] Tencent PCG, Appl Res Ctr, Shenzhen, Peoples R China
[4] SenseTime Res, Hong Kong, Peoples R China
[5] Shanghai AI Lab, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.01336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SinGAN shows impressive capability in learning internal patch distribution despite its limited effective receptive field. We are interested in knowing how such a translation-invariant convolutional generator could capture the global structure with just a spatially i.i.d. input. In this work, taking SinGAN and StyleGAN2 as examples, we show that such capability, to a large extent, is brought by the implicit positional encoding when using zero padding in the generators. Such positional encoding is indispensable for generating images with high fidelity. The same phenomenon is observed in other generative architectures such as DCGAN and PGGAN. We further show that zero padding leads to an unbalanced spatial bias with a vague relation between locations. To offer a better spatial inductive bias, we investigate alternative positional encodings and analyze their effects. Based on a more flexible positional encoding explicitly, we propose a new multi-scale training strategy and demonstrate its effectiveness in the state-of-the-art unconditional generator StyleGAN2. Besides, the explicit spatial inductive bias substantially improves SinGAN for more versatile image manipulation.
引用
收藏
页码:13564 / 13573
页数:10
相关论文
共 50 条
  • [1] Spatial entropy as an inductive bias for vision transformers
    Peruzzo, Elia
    Sangineto, Enver
    Liu, Yahui
    De Nadai, Marco
    Bi, Wei
    Lepri, Bruno
    Sebe, Nicu
    MACHINE LEARNING, 2024, 113 (09) : 6945 - 6975
  • [2] Multi-Dimensional Hyena for Spatial Inductive Bias
    Zimerman, Itamar
    Wolf, Lior
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [3] POSITIONAL INACCURACY AND BIAS IN STRABISMIC AMBLYOPIA - EFFECT OF SPATIAL SCALE
    DEMANINS, R
    HESS, RF
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1995, 36 (04) : S45 - S45
  • [4] Uncertainty Principles of Encoding GANs
    Feng, Ruili
    Lin, Zhouchen
    Zhu, Jiapeng
    Zhao, Deli
    Zhou, Jinren
    Zha, Zheng-Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] REVISITING SPATIAL INDUCTIVE BIAS WITH MLP-LIKE MODEL
    Imamura, Akihiro
    Arizumi, Nana
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 921 - 925
  • [6] Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding
    Li, Yang
    Si, Si
    Li, Gang
    Hsieh, Cho-Jui
    Bengio, Samy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] On the Inductive Bias of Dropout
    Helmbold, David P.
    Long, Philip M.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 3403 - 3454
  • [8] On the inductive bias of dropout
    Helmbold, David P.
    Long, Philip M.
    Journal of Machine Learning Research, 2015, 16 : 3403 - 3454
  • [9] Studying Bias in GANs Through the Lens of Race
    Maluleke, Vongani H.
    Thakkar, Neerja
    Brooks, Tim
    Weber, Ethan
    Darrell, Trevor
    Efros, Alexei A.
    Kanazawa, Angjoo
    Guillory, Devin
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 344 - 360
  • [10] Positional loss in strabismic amblyopia: Interrelationship of alignment threshold, bias, spatial scale and eccentricity
    Demanins, R
    Hess, RF
    VISION RESEARCH, 1996, 36 (17) : 2771 - 2794