A novel sentence similarity model with word embedding based on convolutional neural network

被引:20
|
作者
Yao, Haipeng [1 ]
Liu, Huiwen [1 ]
Zhang, Peiying [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] China Univ Petr, Coll Comp & Commun Engn, Qingdao, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
convolutional neural network; sentence similarity; word embedding;
D O I
10.1002/cpe.4415
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose an effective model for the similarity metrics of English sentences. In the model, we first make use of word embedding and convolutional neural network (CNN) to produce a sentence vector and then leverage the information of the sentence vector pair to calculate the score of sentence similarity. Considering the case of long-range semantic dependencies between words, we propose a novel method transforming word embeddings to construct the three-dimensional sentence feature tensor. In addition, we incorporate the k-max pooling into the convolutional neural network to adapt to variable lengths of input sentences. The proposed model requires no external resource such as WordNet and parse tree. Meanwhile, it consumes very little time for training. Finally, we carried out extensive simulations to evaluate the performance of our model compared with other state-of-the-art works. Experimental results on SemEval 2014 task (SICK test corpus) indicated that our model can achieve a good performance in the terms of Pearson correlation coefficient, Spearman correlation coefficient, and mean squared errors. Furthermore, experimental results on Microsoft research paraphrase identification (MSRP) indicated that our model can achieve an excellent performance in the terms of F1 and Accuracy.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Attention pooling-based convolutional neural network for sentence modelling
    Er, Meng Joo
    Zhang, Yong
    Wang, Ning
    Pratama, Mahardhika
    INFORMATION SCIENCES, 2016, 373 : 388 - 403
  • [32] A novel malware classification and augmentation model based on convolutional neural network
    Tekerek, Adem
    Yapici, Muhammed Mutlu
    COMPUTERS & SECURITY, 2022, 112
  • [33] A Tsallis' statistics based neural network model for novel word learning
    Hadzibeganovic, Tarik
    Cannas, Sergio A.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2009, 388 (05) : 732 - 746
  • [34] Structured Word Embedding for Low Memory Neural Network Language Model
    Shi, Kaiyu
    Yu, Kai
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1254 - 1258
  • [35] Autoencoder-convolutional neural network-based embedding and extraction model for image watermarking
    Mahapatra, Debolina
    Amrit, Preetam
    Singh, Om Prakash
    Singh, Amit Kumar
    Agrawal, Amrit Kumar
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [36] Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network
    ElKarami, Bashier
    Alkhateeb, Abedalrhman
    Qattous, Hazem
    Alshomali, Lujain
    Shahrrava, Behnam
    CANCER INFORMATICS, 2022, 21
  • [37] Neural sentence embedding models for semantic similarity estimation in the biomedical domain
    Blagec, Kathrin
    Xu, Hong
    Agibetov, Asan
    Samwald, Matthias
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [38] Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network
    ElKarami, Bashier
    Alkhateeb, Abedalrhman
    Qattous, Hazem
    Alshomali, Lujain
    Shahrrava, Behnam
    CANCER INFORMATICS, 2022, 21
  • [39] Neural sentence embedding models for semantic similarity estimation in the biomedical domain
    Kathrin Blagec
    Hong Xu
    Asan Agibetov
    Matthias Samwald
    BMC Bioinformatics, 20
  • [40] Learning Word and Sentence Embeddings Using a Generative Convolutional Network
    Vargas-Ocampo, Edgar
    Roman-Rangel, Edgar
    Hermosillo-Valadez, Jorge
    PATTERN RECOGNITION, 2018, 10880 : 135 - 144