The SSL Interplay: Augmentations, Inductive Bias, and Generalization

被引:0
|
作者
Cabannes, Vivien [1 ]
Kiani, Bobak T. [2 ]
Balestriero, Randall [1 ]
LeCun, Yann [1 ]
Bietti, Alberto [1 ]
机构
[1] Meta AI, New York, NY 10003 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision. Yet in practice, engineers face issues such as instability in tuning optimizers and collapse of representations during training. Such challenges motivate the need for a theory to shed light on the complex interplay between the choice of data augmentation, network architecture, and training algorithm. We study such an interplay with a precise analysis of generalization performance on both pretraining and downstream tasks in a theory friendly setup, and highlight several insights for SSL practitioners that arise from our theory.
引用
收藏
页数:47
相关论文
共 50 条
  • [1] Cascade Generalization: Is SVMs' Inductive Bias Useful?
    Barakat, Nahla
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [2] Intrinsic dimensionality and generalization properties of the R-norm inductive bias
    Ardeshir, Navid
    Hsu, Daniel
    Sanford, Clayton
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [3] Data Augmentations for Improved (Large) Language Model Generalization
    Feder, Amir
    Wald, Yoav
    Shi, Claudia
    Saria, Suchi
    Blei, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Development of Inductive Generalization
    Fisher, Anna V.
    CHILD DEVELOPMENT PERSPECTIVES, 2015, 9 (03) : 172 - 177
  • [5] INBIASED: INDUCTIVE BIAS DISTILLATION TO IMPROVE GENERALIZATION AND ROBUSTNESS THROUGH SHAPE-AWARENESS
    Gowda, Shruthi
    Zonooz, Bahram
    Arani, Elahe
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [6] AdvST: Revisiting Data Augmentations for Single Domain Generalization
    Zheng, Guangtao
    Huai, Mengdi
    Zhang, Aidong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21832 - 21840
  • [7] On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization
    Thomas, Georgi
    Sahay, Rajat
    Jahan, Chowdhury Sadman
    Manjrekar, Mihir
    Popp, Dan
    Savakis, Andreas
    SENSORS, 2023, 23 (20)
  • [8] Feature Selection for Inductive Generalization
    Yu, Na-Yung
    Yamauchi, Takashi
    Yang, Huei-Fang
    Chen, Yen-Lin
    Gutierrez-Osuna, Ricardo
    COGNITIVE SCIENCE, 2010, 34 (08) : 1574 - 1593
  • [9] Electrophysiological Correlates of Inductive Generalization
    Li, Fuhong
    Luo, Yuejia
    Cao, Bihua
    Li, Hong
    JOURNAL OF PSYCHOPHYSIOLOGY, 2009, 23 (01) : 27 - 34
  • [10] Logics for Qualitative Inductive Generalization
    Diderik Batens
    Studia Logica, 2011, 97 : 61 - 80