The SSL Interplay: Augmentations, Inductive Bias, and Generalization

被引:0
|
作者
Cabannes, Vivien [1 ]
Kiani, Bobak T. [2 ]
Balestriero, Randall [1 ]
LeCun, Yann [1 ]
Bietti, Alberto [1 ]
机构
[1] Meta AI, New York, NY 10003 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision. Yet in practice, engineers face issues such as instability in tuning optimizers and collapse of representations during training. Such challenges motivate the need for a theory to shed light on the complex interplay between the choice of data augmentation, network architecture, and training algorithm. We study such an interplay with a precise analysis of generalization performance on both pretraining and downstream tasks in a theory friendly setup, and highlight several insights for SSL practitioners that arise from our theory.
引用
收藏
页数:47
相关论文
共 50 条
  • [41] GENERALIZATION OF CONTINUUM OF INDUCTIVE METHODS TO UNIVERSAL HYPOTHESES
    KUIPERS, TAF
    SYNTHESE, 1978, 37 (03) : 255 - 284
  • [42] On Semantic Cognition, Inductive Generalization, and Language Models
    Misra, Kanishka
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12894 - 12895
  • [43] THE INTERPLAY OF DEDUCTIVE AND INDUCTIVE REASONING IN PSYCHOANALYTIC THEORIZING
    Hanly, Charles
    PSYCHOANALYTIC QUARTERLY, 2014, 83 (04): : 897 - 915
  • [44] INDUCTIVE POLICY - THE PRAGMATICS OF BIAS SELECTION
    PROVOST, FJ
    BUCHANAN, BG
    MACHINE LEARNING, 1995, 20 (1-2) : 35 - 61
  • [45] Scaling MLPs: A Tale of Inductive Bias
    Bachmann, Gregor
    Anagnostidis, Sotiris
    Hofmann, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] On the Inductive Bias of Neural Tangent Kernels
    Bietti, Alberto
    Mairal, Julien
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Towards Exact Computation of Inductive Bias
    Boopathy, Akhilan
    Yue, William
    Hwang, Jaedong
    Iyer, Abhiram
    Fiete, Ila
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3733 - 3741
  • [48] Generalization and Memorization: The Bias Potential Model
    Yang, Hongkang
    E, Weinan
    MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 1013 - 1043
  • [49] Negative Evidence and Inductive Reasoning in Generalization of Associative Learning
    Lee, Jessica C.
    Lovibond, Peter F.
    Hayes, Brett K.
    Navarro, Danielle J.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2019, 148 (02) : 289 - 303
  • [50] Positional Encoding as Spatial Inductive Bias in GANs
    Xu, Rui
    Wang, Xintao
    Chen, Kai
    Zhou, Bolei
    Loy, Chen Change
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13564 - 13573