The SSL Interplay: Augmentations, Inductive Bias, and Generalization

被引:0
|
作者
Cabannes, Vivien [1 ]
Kiani, Bobak T. [2 ]
Balestriero, Randall [1 ]
LeCun, Yann [1 ]
Bietti, Alberto [1 ]
机构
[1] Meta AI, New York, NY 10003 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision. Yet in practice, engineers face issues such as instability in tuning optimizers and collapse of representations during training. Such challenges motivate the need for a theory to shed light on the complex interplay between the choice of data augmentation, network architecture, and training algorithm. We study such an interplay with a precise analysis of generalization performance on both pretraining and downstream tasks in a theory friendly setup, and highlight several insights for SSL practitioners that arise from our theory.
引用
收藏
页数:47
相关论文
共 50 条
  • [31] Lifelong learning and inductive bias
    Amit, Ron
    Meir, Ron
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2019, 29 : 51 - 54
  • [32] The Inductive Bias of Quantum Kernels
    Kuebler, Jonas M.
    Buchholz, Simon
    Schoelkopf, Bernhard
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [33] A Model of Inductive Bias Learning
    Baxter, Jonathan
    1600, Morgan Kaufmann Publishers (12):
  • [34] A model of inductive bias learning
    Baxter, J
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 : 149 - 198
  • [35] Probing as Quantifying Inductive Bias
    Immer, Alexander
    Hennigen, Lucas Torroba
    Fortuin, Vincent
    Cotterell, Ryan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1839 - 1851
  • [36] RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
    Bharadhwaj, Homanga
    Vakil, Jay
    Sharma, Mohit
    Gupta, Abhinav
    Tulsiani, Shubham
    Kumar, Vikash
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4788 - 4795
  • [37] Checking safety by inductive generalization of counterexamples to induction
    Bradley, Aaron R.
    Manna, Zohar
    FMCAD 2007: FORMAL METHODS IN COMPUTER AIDED DESIGN, PROCEEDINGS, 2007, : 173 - 180
  • [38] Novelty and Inductive Generalization in Human Reinforcement Learning
    Gershman, Samuel J.
    Niv, Yael
    TOPICS IN COGNITIVE SCIENCE, 2015, 7 (03) : 391 - 415
  • [39] From Features to Categories: The Development of Inductive Generalization
    Ralston, Robert W.
    Sloutsky, Vladimir M.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2023, 49 (10) : 1615 - 1634
  • [40] Inductive Generalization of Analytically Learned Goal Hierarchies
    Koenik, Tolga
    Nejati, Negin
    Kuter, Ugur
    INDUCTIVE LOGIC PROGRAMMING, 2010, 5989 : 65 - +