A Short-Text Similarity Model Combining Semantic and Syntactic Information

被引:4
|
作者
Zhou, Ya [1 ]
Li, Cheng [1 ,2 ]
Huang, Guimin [1 ,2 ]
Guo, Qingkai [1 ,2 ]
Li, Hui [1 ,2 ]
Wei, Xiong [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
short-text similarity; graph convolutional networks; BERT; constituency parse trees;
D O I
10.3390/electronics12143126
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As one of the prominent research directions in the field of natural language processing (NLP), short-text similarity has been widely used in search recommendation and question-and-answer systems. Most of the existing short textual similarity models focus on considering semantic similarity while overlooking the importance of syntactic similarity. In this paper, we first propose an enhanced knowledge language representation model based on graph convolutional networks (KEBERT-GCN), which effectively uses fine-grained word relations in the knowledge base to assess semantic similarity and model the relationship between knowledge structure and text structure. To fully leverage the syntactic information of sentences, we also propose a computational model of constituency parse trees based on tree kernels (CPT-TK), which combines syntactic information, semantic features, and attentional weighting mechanisms to evaluate syntactic similarity. Finally, we propose a comprehensive model that integrates both semantic and syntactic information to comprehensively evaluate short-text similarity. The experimental results demonstrate that our proposed short-text similarity model outperforms the models proposed in recent years, achieving a Pearson correlation coefficient of 0.8805 on the STS-B dataset.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Measuring the short text similarity based on semantic and syntactic information
    Yang, Jiaqi
    Li, Yongjun
    Gao, Congjie
    Zhang, Yinyin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 169 - 180
  • [2] A Study of Using Syntactic Cues in Short-text Similarity Measure
    Huang, Po-Sen
    Chiu, Po-Sheng
    Chang, Jia-Wei
    Huang, Yueh-Min
    Lee, Ming-Che
    JOURNAL OF INTERNET TECHNOLOGY, 2019, 20 (03): : 839 - 850
  • [3] Combining Statistical Information and Semantic Similarity for Short Text Feature Extension
    Li, Xiaohong
    Su, Yun
    Ma, Huifang
    Cao, Lin
    INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 205 - 210
  • [4] Short-Text Semantic Similarity (STSS): Techniques, Challenges and Future Perspectives
    Amur, Zaira Hassan
    Hooi, Yew Kwang
    Bhanbhro, Hina
    Dahri, Kamran
    Soomro, Gul Muhammad
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [5] Exploiting Global Semantic Similarity Biterms for Short-text Topic Discovery
    Lu, Heng-yang
    Ge, Gao-jian
    Li, Yun
    Wang, Chong-jun
    Xie, Jun-yuan
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 975 - 982
  • [6] SyMSS: A syntax-based measure for short-text semantic similarity
    Oliva, Jesus
    Ignacio Serrano, Jose
    Dolores del Castillo, Maria
    Iglesias, Angel
    DATA & KNOWLEDGE ENGINEERING, 2011, 70 (04) : 390 - 405
  • [7] Learning short-text semantic similarity with word embeddings and external knowledge sources
    Nguyen, Hien T.
    Duong, Phuc H.
    Cambria, Erik
    KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [8] Short Text Similarity Calculation Using Semantic Information
    Pu, Haoyu
    Fei, Gaolei
    Zhao, Hailin
    Hu, Guangmin
    Jiao, Chengbo
    Xu, Zhoujun
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 144 - 150
  • [9] Cross-Lingual Short-Text Semantic Similarity for Kannada-English Language Pair
    Muralikrishna, S. N.
    Holla, Raghurama
    Harivinod, N.
    Ganiga, Raghavendra
    COMPUTERS, 2024, 13 (09)
  • [10] Experimental study on short-text clustering using transformer-based semantic similarity measure
    Abdalgader, Khaled
    Matroud, Atheer A.
    Hossin, Khaled
    PEERJ COMPUTER SCIENCE, 2024, 10