Reverse Engineering Self-Supervised Learning

被引:0
|
作者
Ben-Shaul, Ido [1 ,2 ]
Shwartz-Ziv, Ravid [3 ]
Galanti, Tomer [4 ]
Dekel, Shai [1 ]
LeCun, Yann [3 ,5 ]
机构
[1] Tel Aviv Univ, Dept Appl Math, Tel Aviv, Israel
[2] eBay Res, San Jose, CA USA
[3] NYU, New York, NY 10003 USA
[4] MIT, Cambridge, MA 02139 USA
[5] Meta AI, FAIR, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge. This paper presents an in-depth empirical analysis of SSL-trained representations, encompassing diverse models, architectures, and hyperparameters. Our study reveals an intriguing aspect of the SSL training process: it inherently facilitates the clustering of samples with respect to semantic labels, which is surprisingly driven by the SSL objective's regularization term. This clustering process not only enhances downstream classification but also compresses the data information. Furthermore, we establish that SSL-trained representations align more closely with semantic classes rather than random classes. Remarkably, we show that learned representations align with semantic classes across various hierarchical levels, and this alignment increases during training and when moving deeper into the network. Our findings provide valuable insights into SSL's representation learning mechanisms and their impact on performance across different sets of classes.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Self-Supervised Dialogue Learning
    Wu, Jiawei
    Wang, Xin
    Wang, William Yang
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3857 - 3867
  • [2] Longitudinal self-supervised learning
    Zhao, Qingyu
    Liu, Zixuan
    Adeli, Ehsan
    Pohl, Kilian M.
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 71
  • [3] Self-supervised learning model
    Saga, Kazushie
    Sugasaka, Tamami
    Sekiguchi, Minoru
    [J]. Fujitsu Scientific and Technical Journal, 1993, 29 (03): : 209 - 216
  • [4] Self-Supervised Learning for Electroencephalography
    Rafiei, Mohammad H.
    Gauthier, Lynne V.
    Adeli, Hojjat
    Takabi, Daniel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1457 - 1471
  • [5] Self-Supervised Learning for Recommendation
    Huang, Chao
    Xia, Lianghao
    Wang, Xiang
    He, Xiangnan
    Yin, Dawei
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5136 - 5139
  • [6] Credal Self-Supervised Learning
    Lienen, Julian
    Huellermeier, Eyke
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Quantum self-supervised learning
    Jaderberg, B.
    Anderson, L. W.
    Xie, W.
    Albanie, S.
    Kiffner, M.
    Jaksch, D.
    [J]. QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (03):
  • [8] A New Self-supervised Method for Supervised Learning
    Yang, Yuhang
    Ding, Zilin
    Cheng, Xuan
    Wang, Xiaomin
    Liu, Ming
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [9] Self-supervised Learning: A Succinct Review
    Rani, Veenu
    Nabi, Syed Tufael
    Kumar, Munish
    Mittal, Ajay
    Kumar, Krishan
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2023, 30 (04) : 2761 - 2775
  • [10] Self-Supervised Learning for Recommender System
    Huang, Chao
    Wang, Xiang
    He, Xiangnan
    Yin, Dawei
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3440 - 3443