A novel sentence similarity model with word embedding based on convolutional neural network

被引:20
|
作者
Yao, Haipeng [1 ]
Liu, Huiwen [1 ]
Zhang, Peiying [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] China Univ Petr, Coll Comp & Commun Engn, Qingdao, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
convolutional neural network; sentence similarity; word embedding;
D O I
10.1002/cpe.4415
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose an effective model for the similarity metrics of English sentences. In the model, we first make use of word embedding and convolutional neural network (CNN) to produce a sentence vector and then leverage the information of the sentence vector pair to calculate the score of sentence similarity. Considering the case of long-range semantic dependencies between words, we propose a novel method transforming word embeddings to construct the three-dimensional sentence feature tensor. In addition, we incorporate the k-max pooling into the convolutional neural network to adapt to variable lengths of input sentences. The proposed model requires no external resource such as WordNet and parse tree. Meanwhile, it consumes very little time for training. Finally, we carried out extensive simulations to evaluate the performance of our model compared with other state-of-the-art works. Experimental results on SemEval 2014 task (SICK test corpus) indicated that our model can achieve a good performance in the terms of Pearson correlation coefficient, Spearman correlation coefficient, and mean squared errors. Furthermore, experimental results on Microsoft research paraphrase identification (MSRP) indicated that our model can achieve an excellent performance in the terms of F1 and Accuracy.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] HumourHindiNet: Humour detection in Hindi web series using word embedding and convolutional neural network
    Kumar, Akshi
    Mallik, Abhishek
    Kumar, Sanjay
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (07)
  • [42] Sentence Semantic Similarity based on Word FiImbedding and WordNet
    Farouk, Mamdouh
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 33 - 37
  • [43] Spatial Steganalysis of Low Embedding Rate Based on Convolutional Neural Network
    Shen, Jun
    Liao, Xin
    Qin, Zheng
    Liu, Xu-Chong
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (09): : 2901 - 2915
  • [44] Clause Sentiment Identification Based on Convolutional Neural Network With Context Embedding
    Chen, Peng
    Xu, Bing
    Yang, Muyun
    Li, Sheng
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1532 - 1538
  • [45] Knowledge Graph Embedding Based on Quaternion Transformation and Convolutional Neural Network
    Gao, Yabin
    Tian, Xiaoyun
    Zhou, Jing
    Zheng, Bin
    Li, Hairu
    Zhu, Zizhong
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 128 - 136
  • [46] The Euclidean embedding learning based on convolutional neural network for stereo matching
    Yang, Menglong
    Liu, Yiguang
    You, Zhisheng
    NEUROCOMPUTING, 2017, 267 : 195 - 200
  • [47] Chinese Sentence Similarity based on Word Context and Semantic
    Gu, Tianjiao
    Ren, Fuji
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 535 - 539
  • [48] DeepPatent: patent classification with convolutional neural networks and word embedding
    Li, Shaobo
    Hu, Jie
    Cui, Yuxin
    Hu, Jianjun
    SCIENTOMETRICS, 2018, 117 (02) : 721 - 744
  • [49] DeepPatent: patent classification with convolutional neural networks and word embedding
    Shaobo Li
    Jie Hu
    Yuxin Cui
    Jianjun Hu
    Scientometrics, 2018, 117 : 721 - 744
  • [50] Phenotype Extraction Extraction Based on Word Embedding to Sentence Embedding Cascaded Approach
    Xing, Wenhui
    Yuan, Xiaohui
    Li, Lin
    Hu, Lun
    Peng, Jing
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2018, 17 (03) : 172 - 180