Reverse Engineering Self-Supervised Learning

被引：0

作者：

Ben-Shaul, Ido ^{[1
,2
]}

Shwartz-Ziv, Ravid ^{[3
]}

Galanti, Tomer ^{[4
]}

Dekel, Shai ^{[1
]}

LeCun, Yann ^{[3
,5
]}

机构：

[1] Tel Aviv Univ, Dept Appl Math, Tel Aviv, Israel

[2] eBay Res, San Jose, CA USA

[3] NYU, New York, NY 10003 USA

[4] MIT, Cambridge, MA 02139 USA

[5] Meta AI, FAIR, New York, NY USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge. This paper presents an in-depth empirical analysis of SSL-trained representations, encompassing diverse models, architectures, and hyperparameters. Our study reveals an intriguing aspect of the SSL training process: it inherently facilitates the clustering of samples with respect to semantic labels, which is surprisingly driven by the SSL objective's regularization term. This clustering process not only enhances downstream classification but also compresses the data information. Furthermore, we establish that SSL-trained representations align more closely with semantic classes rather than random classes. Remarkably, we show that learned representations align with semantic classes across various hierarchical levels, and this alignment increases during training and when moving deeper into the network. Our findings provide valuable insights into SSL's representation learning mechanisms and their impact on performance across different sets of classes.

引用

页数：22

共 50 条

[1] Self-Supervised Dialogue Learning
Wu, Jiawei
Wang, Xin
Wang, William Yang
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3857 - 3867
[2] Longitudinal self-supervised learning
Zhao, Qingyu
Liu, Zixuan
Adeli, Ehsan
Pohl, Kilian M.
[J]. MEDICAL IMAGE ANALYSIS, 2021, 71
[3] Self-supervised learning model
Saga, Kazushie
Sugasaka, Tamami
Sekiguchi, Minoru
[J]. Fujitsu Scientific and Technical Journal, 1993, 29 (03): : 209 - 216
[4] Self-Supervised Learning for Electroencephalography
Rafiei, Mohammad H.
Gauthier, Lynne V.
Adeli, Hojjat
Takabi, Daniel
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1457 - 1471
[5] Self-Supervised Learning for Recommendation
Huang, Chao
Xia, Lianghao
Wang, Xiang
He, Xiangnan
Yin, Dawei
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5136 - 5139
[6] Credal Self-Supervised Learning
Lienen, Julian
Huellermeier, Eyke
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[7] Quantum self-supervised learning
Jaderberg, B.
Anderson, L. W.
Xie, W.
Albanie, S.
Kiffner, M.
Jaksch, D.
[J]. QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (03):
[8] A New Self-supervised Method for Supervised Learning
Yang, Yuhang
Ding, Zilin
Cheng, Xuan
Wang, Xiaomin
Liu, Ming
[J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
[9] Self-supervised Learning: A Succinct Review
Rani, Veenu
Nabi, Syed Tufael
Kumar, Munish
Mittal, Ajay
Kumar, Krishan
[J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2023, 30 (04) : 2761 - 2775
[10] Self-Supervised Learning for Recommender System
Huang, Chao
Wang, Xiang
He, Xiangnan
Yin, Dawei
[J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3440 - 3443

← 1 2 3 4 5 →