Sentence modeling via multiple word embeddings and multi-level comparison for semantic textual similarity

被引:43
|
作者
Nguyen Huy Tien [1 ]
Nguyen Minh Le [1 ]
Tomohiro, Yamasaki [2 ]
Tatsuya, Izuha [2 ]
机构
[1] Japan Adv Inst Sci & Technol JAIST Japan, Nomi, Japan
[2] Toshiba Res & Dev Ctr, Kawasaki, Kanagawa, Japan
关键词
Multiple word embeddings; Sentence embedding; Semantic; Similarity; Multi-level comparison; REPRESENTATION;
D O I
10.1016/j.ipm.2019.102090
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, using a pretrained word embedding to represent words achieves success in many natural language processing tasks. According to objective functions, different word embedding models capture different aspects of linguistic properties. However, the Semantic Textual Similarity task, which evaluates similarity/relation between two sentences, requires to take into account of these linguistic aspects. Therefore, this research aims to encode various characteristics from multiple sets of word embeddings into one embedding and then learn similarity/relation between sentences via this novel embedding. Representing each word by multiple word embeddings, the proposed MaxLSTM-CNN encoder generates a novel sentence embedding. We then learn the similarity/relation between our sentence embeddings via Multi-level comparison. Our method M-MaxLSTM-CNN consistently shows strong performances in several tasks (i.e., measure textual similarity, identify paraphrase, recognize textual entailment). Our model does not use hand-crafted features (e.g., alignment features, Ngram overlaps, dependency features) as well as does not require pre-trained word embeddings to have the same dimension.
引用
收藏
页数:11
相关论文
共 47 条
  • [1] Sentence-Level Semantic Textual Similarity Using Word-Level Semantics
    Shajalal, Md
    Aono, Masaki
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2018, : 113 - 116
  • [2] Going Beyond Sentence Embeddings: A Token-Level Matching Algorithm for Calculating Semantic Textual Similarity
    Wang, Hongwei
    Yu, Dong
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 563 - 570
  • [3] Sentence Modeling via Graph Construction and Graph Neural Networks for Semantic Textual Similarity
    Zhou, Ke
    Xu, Ke
    Sun, Tanfeng
    Zhang, Yueguo
    [J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 413 - 418
  • [4] Evaluating Semantic Textual Similarity in Clinical Sentences Using Deep Learning and Sentence Embeddings
    Antunes, Rui
    Silva, Joao Figueira
    Matos, Sergio
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 662 - 669
  • [5] Document Summarization Using Sentence-Level Semantic Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (02) : 177 - 196
  • [6] Integrating Word Embeddings and Traditional NLP Features to Measure Textual Entailment and Semantic Relatedness of Sentence Pairs
    Zhao, Jiang
    Lan, Man
    Niu, Zheng-Yu
    Lu, Yue
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [7] The Research of Sentence Similarity Computation based on Multi-Level Fusion
    Nan, Xuanguo
    [J]. 2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2012, : 617 - 619
  • [8] A Comparison of Approaches for Measuring the Semantic Similarity of Short Texts Based on Word Embeddings
    Babic, Karlo
    Guerra, Francesco
    Martincic-Ipsic, Sanda
    Mestrovic, Ana
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2020, 44 (02) : 231 - 246
  • [9] Expressive Multi-level Modeling for the Semantic Web
    Brasileiro, Freddy
    Almeida, Joao Paulo A.
    Carvalho, Victorio A.
    Guizzardi, Giancarlo
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 53 - 69
  • [10] Multi-level Fusion of Multi-modal Semantic Embeddings for Zero Shot Learning
    Kong, Zhe
    Wang, Xin
    Gao, Neng
    Zhang, Yifei
    Liu, Yuhan
    Tu, Chenyang
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 310 - 318