Align Representations with Base: A New Approach to Self-Supervised Learning

被引:1
|
作者
Zhang, Shaofeng [1 ]
Qiu, Lyn [1 ]
Zhu, Feng [2 ]
Yan, Junchi [1 ]
Zhang, Hengrui [1 ]
Zhao, Rui [1 ,2 ,3 ]
Li, Hongyang [2 ]
Yang, Xiaokang [1 ]
机构
[1] Shanghai Jiao Tong Univ, Artificial Intelligence Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPR52688.2022.01610
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing symmetric contrastive learning methods suffer from collapses (complete and dimensional) or quadratic complexity of objectives. Departure from these methods which maximize mutual information of two generated views, along either instance or feature dimension, the proposed paradigm introduces intermediate variables at the feature level, and maximizes the consistency between variables and representations of each view. Spec(fically, the proposed intermediate variables are the nearest group of base vectors to representations. Hence, we call the proposed method ARB (Align Representations with Base). Compared with other symmetric approaches, ARB 1) does not require negative pairs, which leads the complexity of the overall objective function is in linear order, 2) reduces feature redundancy, increasing the information density of training samples, 3) is more robust to output dimension size, which outperforms previous feature-wise arts over 28% Top-1 accuracy on ImageNet-100 under low-dimension settings.
引用
收藏
页码:16579 / 16588
页数:10
相关论文
共 50 条
  • [41] Quantum self-supervised learning
    Jaderberg, B.
    Anderson, L. W.
    Xie, W.
    Albanie, S.
    Kiffner, M.
    Jaksch, D.
    QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (03):
  • [42] Self-supervised Phonotactic Representations for Language Identification
    Ramesh, G.
    Kumar, C. Shiva
    Murty, K. Sri Rama
    INTERSPEECH 2021, 2021, : 1514 - 1518
  • [43] Self-Supervised and Invariant Representations for Wireless Localization
    Salihu, Artan
    Rupp, Markus
    Schwarz, Stefan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 8281 - 8296
  • [44] SIMILARITY ANALYSIS OF SELF-SUPERVISED SPEECH REPRESENTATIONS
    Chung, Yu-An
    Belinkov, Yonatan
    Glass, James
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3040 - 3044
  • [45] Utilizing Self-supervised Representations for MOS Prediction
    Tseng, Wei-Cheng
    Huang, Chien-yu
    Kao, Wei-Tsung
    Lin, Yist Y.
    Lee, Hung-yi
    INTERSPEECH 2021, 2021, : 2781 - 2785
  • [46] Enriching Chest Radiography Representations: Self-Supervised Learning With a Recalibrating and Importance Scaling
    Kong, Heesan
    Kim, Donghee
    Kim, Kwangsu
    IEEE ACCESS, 2023, 11 : 108697 - 108704
  • [47] Self-Supervised Learning of Multi-Level Audio Representations for Music Segmentation
    Buisson, Morgan
    McFee, Brian
    Essid, Slim
    Crayencour, Helene C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2141 - 2152
  • [48] Audio-guided self-supervised learning for disentangled visual speech representations
    Feng, Dalu
    Yang, Shuang
    Shan, Shiguang
    Chen, Xilin
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (06)
  • [49] DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections
    Zemlyanskiy, Yury
    Gandhe, Sudeep
    He, Ruining
    Kanagal, Bhargav
    Ravula, Anirudh
    Gottweis, Juraj
    Sha, Fei
    Eckstein, Ilya
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2540 - 2549
  • [50] Learning self-supervised molecular representations for drug-drug interaction prediction
    Kpanou, Rogia
    Dallaire, Patrick
    Rousseau, Elsa
    Corbeil, Jacques
    BMC BIOINFORMATICS, 2024, 25 (01)