SimCSE: Simple Contrastive Learning of Sentence Embeddings

被引:0
|
作者
Gao, Tianyu [1 ]
Yao, Xingcheng [2 ]
Chen, Danqi [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents SimCSE, a simple contrastive learning framework that greatly advances the state-of-the-art sentence embeddings. We first describe an unsupervised approach, which takes an input sentence and predicts itself in a contrastive objective, with only standard dropout used as noise. This simple method works surprisingly well, performing on par with previous supervised counterparts. We find that dropout acts as minimal data augmentation and removing it leads to a representation collapse. Then, we propose a supervised approach, which incorporates annotated pairs from natural language inference datasets into our contrastive learning framework, by using "entailment" pairs as positives and "contradiction" pairs as hard negatives. We evaluate SimCSE on standard semantic textual similarity (STS) tasks, and our unsupervised and supervised models using BERTbase achieve an average of 76.3% and 81.6% Spearman's correlation respectively, a 4.2% and 2.2% improvement compared to previous best results. We also show-both theoretically and empirically-that contrastive learning objective regularizes pre-trained embeddings' anisotropic space to be more uniform, and it better aligns positive pairs when supervised signals are available.(1)
引用
收藏
页码:6894 / 6910
页数:17
相关论文
共 50 条
  • [1] Composition-contrastive Learning for Sentence Embeddings
    Chanchani, Sachin
    Huang, Ruihong
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15836 - 15848
  • [2] MCSE: Multimodal Contrastive Learning of Sentence Embeddings
    Zhang, Miaoran
    Mosbach, Marius
    Adelani, David Ifeoluwa
    Hedderich, Michael A.
    Klakow, Dietrich
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5959 - 5969
  • [3] AdCSE: An Adversarial Method for Contrastive Learning of Sentence Embeddings
    Li, Renhao
    Duan, Lei
    Xie, Guicai
    Xiao, Shan
    Jiang, Weipeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT III, 2022, : 165 - 180
  • [4] Simple Data Transformations for Mitigating the Syntactic Similarity to Improve Sentence Embeddings at Supervised Contrastive Learning
    Kim, Minji
    Cho, Whanhee
    Kim, Soohyeong
    Choi, Yong Suk
    [J]. ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (08)
  • [5] DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings
    Liu, Che
    Wang, Rui
    Liu, Jinghua
    Sun, Jian
    Huang, Fei
    Si, Luo
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2396 - 2406
  • [6] DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings
    Chuang, Yung-Sung
    Dangovski, Rumen
    Luo, Hongyin
    Zhang, Yang
    Chang, Shiyu
    Soljacic, Marin
    Li, Shang-Wen
    Yih, Wen-tau
    Kim, Yoon
    Glass, James
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4207 - 4218
  • [7] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings
    Zhuo, Wenjie
    Sun, Yifan
    Wang, Xiaohan
    Zhu, Linchao
    Yang, Yi
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 12135 - 12148
  • [8] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
    Jian, Yiren
    Gao, Chongyang
    Vosoughi, Soroush
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings
    Klein, Tassilo
    Nabi, Moin
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6159 - 6177
  • [10] Contrastive learning for unsupervised sentence embeddings using negative samples with diminished semantics
    Zhiyi Yu
    Hong Li
    Jialin Feng
    [J]. The Journal of Supercomputing, 2024, 80 : 5428 - 5445