Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

被引:0
|
作者
Chen, Qian [1 ]
Wang, Wen [1 ]
Zhang, Qinglin [1 ]
Zheng, Siqi [1 ]
Deng, Chong [1 ]
Yu, Hai [1 ]
Liu, Jiaqing [1 ]
Ma, Yukun [1 ]
Zhang, Chong [1 ]
机构
[1] Alibaba Grp, Speech Lab, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior studies diagnose the anisotropy problem in sentence representations from pre-trained language models, e.g., BERT, without fine-tuning. Our analysis reveals that the sentence embeddings from BERT suffer from a bias towards uninformative words, limiting the performance in semantic textual similarity (STS) tasks. To address this bias, we propose a simple and efficient unsupervised approach, Diagonal Attention Pooling (Ditto), which weights words with model-based importance estimations and computes the weighted average of word representations from pre-trained models as sentence embeddings. Ditto can be easily applied to any pre-trained language model as a postprocessing operation. Compared to prior sentence embedding approaches, Ditto does not add parameters nor requires any learning. Empirical evaluations demonstrate that our proposed Ditto can alleviate the anisotropy problem and improve various pre-trained models on the STS benchmarks.(1)
引用
收藏
页码:5868 / 5875
页数:8
相关论文
共 50 条
  • [21] Sequential Sentence Embeddings for Semantic Similarity
    Carta, Antonio
    Bacciu, Davide
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1354 - 1361
  • [22] Exploring Semantic Properties of Sentence Embeddings
    Zhu, Xunjie
    Li, Tingfeng
    de Melo, Gerard
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 632 - 637
  • [23] Connecting Supervised and Unsupervised Sentence Embeddings
    Levi, Gil
    REPRESENTATION LEARNING FOR NLP, 2018, : 79 - 83
  • [24] Fusion of sentence embeddings for news retrieval
    Urli, Federico
    Versini, Emiliano
    Snidaro, Lauro
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [25] Ditto: Efficient Serverless Analytics with Elastic Parallelism
    Jin, Chao
    Zhang, Zili
    Xiang, Xingyu
    Zou, Songyun
    Huang, Gang
    Liu, Xuanzhe
    Jin, Xin
    PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023, 2023, : 406 - 419
  • [26] A simple sentence
    Vuillard, Eric
    NOUVELLE REVUE FRANCAISE, 2024, (657): : 32 - 36
  • [27] The simple Sentence
    Iliescu, Maria
    REVUE DE LINGUISTIQUE ROMANE, 2015, 79 (315): : 537 - 540
  • [28] An Approach to Energy-efficient Virtual Network Embeddings
    Fischer, Andreas
    Beck, Michael Till
    De Meer, Hermann
    2013 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2013), 2013, : 1142 - 1147
  • [29] Composition-contrastive Learning for Sentence Embeddings
    Chanchani, Sachin
    Huang, Ruihong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15836 - 15848
  • [30] SEMANTIC SENTENCE EMBEDDINGS FOR PARAPHRASING AND TEXT SUMMARIZATION
    Zhang, Chi
    Sah, Shagan
    Thang Nguyen
    Peri, Dheeraj
    Loui, Alexander
    Salvaggio, Carl
    Ptucha, Raymond
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 705 - 709