A novel sentence similarity model with word embedding based on convolutional neural network

被引:20
|
作者
Yao, Haipeng [1 ]
Liu, Huiwen [1 ]
Zhang, Peiying [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] China Univ Petr, Coll Comp & Commun Engn, Qingdao, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
convolutional neural network; sentence similarity; word embedding;
D O I
10.1002/cpe.4415
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose an effective model for the similarity metrics of English sentences. In the model, we first make use of word embedding and convolutional neural network (CNN) to produce a sentence vector and then leverage the information of the sentence vector pair to calculate the score of sentence similarity. Considering the case of long-range semantic dependencies between words, we propose a novel method transforming word embeddings to construct the three-dimensional sentence feature tensor. In addition, we incorporate the k-max pooling into the convolutional neural network to adapt to variable lengths of input sentences. The proposed model requires no external resource such as WordNet and parse tree. Meanwhile, it consumes very little time for training. Finally, we carried out extensive simulations to evaluate the performance of our model compared with other state-of-the-art works. Experimental results on SemEval 2014 task (SICK test corpus) indicated that our model can achieve a good performance in the terms of Pearson correlation coefficient, Spearman correlation coefficient, and mean squared errors. Furthermore, experimental results on Microsoft research paraphrase identification (MSRP) indicated that our model can achieve an excellent performance in the terms of F1 and Accuracy.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Security Enhanced Sentence Similarity Computing Model Based on Convolutional Neural Network
    Sun, Qifeng
    Huang, Xingzhe
    Kibalya, Godfrey
    Kumar, Neeraj
    Kumar, Santhosh S. V. N.
    Zhang, Peiying
    Xie, Dongliang
    IEEE ACCESS, 2021, 9 (09): : 104183 - 104196
  • [2] Sentence Embedding and Convolutional Neural Network for Semantic Textual Similarity Detection in Arabic Language
    Mahmoud, Adnen
    Zrigui, Mounir
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9263 - 9274
  • [3] Sentence Embedding and Convolutional Neural Network for Semantic Textual Similarity Detection in Arabic Language
    Adnen Mahmoud
    Mounir Zrigui
    Arabian Journal for Science and Engineering, 2019, 44 : 9263 - 9274
  • [4] Building Energy Consumption Prediction Based on Word Embedding and Convolutional Neural Network
    Ji, Tianyao
    Wang, Tingshao
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2021, 49 (06): : 40 - 48
  • [5] A novel model for semantic similarity measurement based on wordnet and word embedding
    Zhao, Fuqiang
    Zhu, Zhengyu
    Han, Ping
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9831 - 9842
  • [6] Sentence Semantic Similarity Model Using Convolutional Neural Networks
    Karthiga M.
    Sountharrajan S.
    Suganya E.
    Sankarananth S.
    EAI Endorsed Transactions on Energy Web, 2021, 8 (35) : 1 - 6
  • [7] Dependency-based Convolutional Neural Networks for Sentence Embedding
    Ma, Mingbo
    Huang, Liang
    Xiang, Bing
    Zhou, Bowen
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 174 - 179
  • [8] Convolutional Neural Network with Contextualized Word Embedding for Text Classification
    Fan, Gaoyang
    Zhu, Cui
    Zhu, Wenjun
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [9] Chinese Sentence Classification Based on Convolutional Neural Network
    Gu, Chengwei
    Wu, Ming
    Zhang, Chuang
    2017 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2017), 2017, 261
  • [10] Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation
    Wu, Kui
    Wang, Xuancong
    Aw, AiTi
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 119 - 122