Self-supervised scientific document recommendation based on contrastive learning

被引:1
|
作者
Tan, Shicheng [1 ,2 ,3 ]
Zhang, Tao [4 ]
Zhao, Shu [1 ,2 ,3 ]
Zhang, Yanping [1 ,2 ,3 ]
机构
[1] Anhui Univ, Artificial Intelligence Inst, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[3] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Anhui, Peoples R China
[4] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
中国国家自然科学基金;
关键词
Scientific document recommendation; Contrastive learning; Self-supervision learning;
D O I
10.1007/s11192-023-04782-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scientific document recommendation aims to recommend scientific documents that have similar content to a given target scientific document (e.g., paper or patent, etc.). With the explosive growth in scientific documents, how recommending relevant scientific documents from the massive number of scientific documents has become an extremely challenging problem. Existing unsupervised scientific document recommendation works use generic approaches of text representation learning, ignoring the relationships between paragraphs within scientific documents, which is important for highly logical scientific documents. This paper proposes a self-supervised learning method, coupled text pair embedding (CTPE) model, which captures paragraph relations within scientific documents based on contrastive learning. First, we divide the scientific document into two parts. The two parts from the same document are positive samples, and these from different documents are negative samples. Then, we uncover the paragraph relations by contrasting intra-document and inter-document pairs such that intra pairs have the maximum agreement via a contrastive loss in the document embedding space. Finally, we propose a similarity calculation among document embeddings to achieve scientific document recommendations. We perform experiments on three datasets for one patent and two paper recommendation tasks. The experimental results verify the effectiveness of the proposed model. The proposed model can help researchers to efficiently discover relevant literature, foster interdisciplinary connections, and guide their research efforts in the scientometrics community. (The code is available at https://github.com/aitsc/text-representation.)
引用
收藏
页码:5027 / 5049
页数:23
相关论文
共 50 条
  • [1] Self-supervised scientific document recommendation based on contrastive learning
    Shicheng Tan
    Tao Zhang
    Shu Zhao
    Yanping Zhang
    [J]. Scientometrics, 2023, 128 : 5027 - 5049
  • [2] Self-supervised Segment Contrastive Learning for Medical Document Representation
    Abro, Waheed Ahmed
    Kteich, Hanane
    Bouraoui, Zied
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
  • [3] Self-supervised variational autoencoder towards recommendation by nested contrastive learning
    Jing Wang
    Jun Wu
    Caiyan Jia
    Zhifei Zhang
    [J]. Applied Intelligence, 2023, 53 : 18887 - 18897
  • [4] Self-supervised variational autoencoder towards recommendation by nested contrastive learning
    Wang, Jing
    Wu, Jun
    Jia, Caiyan
    Zhang, Zhifei
    [J]. APPLIED INTELLIGENCE, 2023, 53 (15) : 18887 - 18897
  • [5] Self-Supervised Learning for Recommendation
    Huang, Chao
    Xia, Lianghao
    Wang, Xiang
    He, Xiangnan
    Yin, Dawei
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5136 - 5139
  • [6] A Survey on Contrastive Self-Supervised Learning
    Jaiswal, Ashish
    Babu, Ashwin Ramesh
    Zadeh, Mohammad Zaki
    Banerjee, Debapriya
    Makedon, Fillia
    [J]. TECHNOLOGIES, 2021, 9 (01)
  • [7] Adversarial Self-Supervised Contrastive Learning
    Kim, Minseon
    Tack, Jihoon
    Hwang, Sung Ju
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [8] Self-Supervised Learning: Generative or Contrastive
    Liu, Xiao
    Zhang, Fanjin
    Hou, Zhenyu
    Mian, Li
    Wang, Zhaoyu
    Zhang, Jing
    Tang, Jie
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 857 - 876
  • [9] Self-Supervised Interest Transfer Network via Prototypical Contrastive Learning for Recommendation
    Sun, Guoqiang
    Shen, Yibin
    Zhou, Sijin
    Chen, Xiang
    Liu, Hongyan
    Wu, Chunming
    Lei, Chenyi
    Wei, Xianhui
    Fang, Fei
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4614 - 4622
  • [10] Self-Supervised Learning for Multimedia Recommendation
    Tao, Zhulin
    Liu, Xiaohao
    Xia, Yewei
    Wang, Xiang
    Yang, Lifang
    Huang, Xianglin
    Chua, Tat-Seng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5107 - 5116