Positional Encoding as Spatial Inductive Bias in GANs

被引:30
|
作者
Xu, Rui [1 ]
Wang, Xintao [3 ]
Chen, Kai [4 ,5 ]
Zhou, Bolei [1 ]
Loy, Chen Change [2 ]
机构
[1] Chinese Univ Hong Kong, CUHK SenseTime Joint Lab, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, S Lab, Singapore, Singapore
[3] Tencent PCG, Appl Res Ctr, Shenzhen, Peoples R China
[4] SenseTime Res, Hong Kong, Peoples R China
[5] Shanghai AI Lab, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.01336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SinGAN shows impressive capability in learning internal patch distribution despite its limited effective receptive field. We are interested in knowing how such a translation-invariant convolutional generator could capture the global structure with just a spatially i.i.d. input. In this work, taking SinGAN and StyleGAN2 as examples, we show that such capability, to a large extent, is brought by the implicit positional encoding when using zero padding in the generators. Such positional encoding is indispensable for generating images with high fidelity. The same phenomenon is observed in other generative architectures such as DCGAN and PGGAN. We further show that zero padding leads to an unbalanced spatial bias with a vague relation between locations. To offer a better spatial inductive bias, we investigate alternative positional encodings and analyze their effects. Based on a more flexible positional encoding explicitly, we propose a new multi-scale training strategy and demonstrate its effectiveness in the state-of-the-art unconditional generator StyleGAN2. Besides, the explicit spatial inductive bias substantially improves SinGAN for more versatile image manipulation.
引用
收藏
页码:13564 / 13573
页数:10
相关论文
共 50 条
  • [21] Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study
    Zhong, Yuanyi
    Tang, Haoran
    Chen, Jun-Kun
    Wang, Yu-Xiong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16281 - 16290
  • [22] Is the positional bias an artefact? Distinguishing positional concerns from egalitarian concerns
    Celse, Jeremy
    JOURNAL OF SOCIO-ECONOMICS, 2012, 41 (03): : 277 - 283
  • [23] INDUCTIVE POLICY - THE PRAGMATICS OF BIAS SELECTION
    PROVOST, FJ
    BUCHANAN, BG
    MACHINE LEARNING, 1995, 20 (1-2) : 35 - 61
  • [24] Scaling MLPs: A Tale of Inductive Bias
    Bachmann, Gregor
    Anagnostidis, Sotiris
    Hofmann, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] On the Inductive Bias of Neural Tangent Kernels
    Bietti, Alberto
    Mairal, Julien
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [26] Towards Exact Computation of Inductive Bias
    Boopathy, Akhilan
    Yue, William
    Hwang, Jaedong
    Iyer, Abhiram
    Fiete, Ila
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3733 - 3741
  • [27] Towards Fairness in AI: Addressing Bias in Data Using GANs
    Rajabi, Amirarsalan
    Garibay, Ozlem O.
    HCI INTERNATIONAL 2021 - LATE BREAKING PAPERS: MULTIMODALITY, EXTENDED REALITY, AND ARTIFICIAL INTELLIGENCE, 2021, 13095 : 509 - 518
  • [28] Positional Encoding to Control Output Sequence Length
    Takase, Sho
    Okazaki, Naoaki
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3999 - 4004
  • [29] The Impact of Positional Encoding on Length Generalization in Transformers
    Kazemnejad, Amirhossein
    Padhi, Inkit
    Ramamurthy, Karthikeyan Natesan
    Das, Payel
    Reddy, Siva
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Relative Positional Encoding for Transformers with Linear Complexity
    Liutkus, Antoine
    Cifka, Ondiej
    Wu, Shih-Lun
    Simsekli, Umut
    Yang, Yi-Hsuan
    Richard, Gael
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139