An Efficient Self-Supervised Cross-View Training For Sentence Embedding

被引:0
|
作者
Limkonchotiwat, Peerat [1 ]
Ponwitayarat, Wuttikorn [1 ]
Lowphansirikul, Lalita [1 ]
Udomcharoenchaikit, Can [1 ]
Chuangsuwanich, Ekapol [2 ]
Nutanong, Sarana [1 ]
机构
[1] VISTEC, Sch Informat Sci & Technol, Rayong, Thailand
[2] Chulalongkorn Univ, Dept Comp Engn, Bangkok, Thailand
关键词
Computational linguistics - Semantics;
D O I
10.1162/tacl_a_00620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a representation learning method such as contrastive learning. While this approach achieves impressive performance on larger PLMs, the performance rapidly degrades as the number of parameters decreases. In this paper, we propose a framework called Self-supervised Cross-View Training (SCT) to narrow the performance gap between large and small PLMs. To evaluate the effectiveness of SCT, we compare it to 5 baseline and state-of-the-art competitors on seven Semantic Textual Similarity (STS) benchmarks using 5 PLMs with the number of parameters ranging from 4M to 340M. The experimental results show that STC outperforms the competitors for PLMs with less than 100M parameters in 18 of 21 cases.1
引用
下载
收藏
页码:1572 / 1587
页数:16
相关论文
共 50 条
  • [1] Learning Where to Learn in Cross-View Self-Supervised Learning
    Huang, Lang
    You, Shan
    Zheng, Mingkai
    Wang, Fei
    Qian, Chen
    Yamasaki, Toshihiko
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14431 - 14440
  • [2] Self-supervised Cross-view Representation Reconstruction for Change Captioning
    Tu, Yunbin
    Li, Liang
    Su, Li
    Zha, Zheng-Jun
    Yan, Chenggang
    Huang, Qingming
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2793 - 2803
  • [3] Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences
    Jing, Longlong
    Zhang, Ling
    Tian, Yingli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1581 - 1591
  • [4] Cross-View Masked Model for Self-Supervised Graph Representation Learning
    Duan H.
    Yu B.
    Xie C.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 13
  • [5] Cross-View Temporal Contrastive Learning for Self-Supervised Video Representation
    Wang, Lulu
    Xu, Zengmin
    Zhang, Xuelian
    Meng, Ruxing
    Lu, Tao
    Computer Engineering and Applications, 2024, 60 (18) : 158 - 166
  • [6] On Robust Cross-view Consistency in Self-supervised Monocular Depth Estimation
    Zhao, Haimei
    Zhang, Jing
    Chen, Zhuo
    Yuan, Bo
    Tao, Dacheng
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (03) : 495 - 513
  • [7] On Robust Cross-view Consistency in Self-supervised Monocular Depth Estimation
    Haimei Zhao
    Jing Zhang
    Zhuo Chen
    Bo Yuan
    Dacheng Tao
    Machine Intelligence Research, 2024, 21 : 495 - 513
  • [8] Incremental Cross-view Mutual Distillation for Self-supervised Medical CT Synthesis
    Fang, Chaowei
    Wang, Liang
    Zhang, Dingwen
    Xu, Jun
    Yuan, Yixuan
    Han, Junwei
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20645 - 20654
  • [9] CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
    Weinzaepfel, Philippe
    Leroy, Vincent
    Lucas, Thomas
    Bregier, Romain
    Cabon, Yohann
    Arora, Vaibhav
    Antsfeld, Leonid
    Chidlovskii, Boris
    Csurka, Gabriela
    Revaud, Jerome
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Group Identification via Transitional Hypergraph Convolution with Cross-view Self-supervised Learning
    Yang, Mingdai
    Liu, Zhiwei
    Yang, Liangwei
    Liu, Xiaolong
    Wang, Chen
    Peng, Hao
    Yu, Philip S.
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2969 - 2979