An Algorithm of Semantic Similarity Between Words Based on Word Single-meaning Embedding Model

被引:0
|
作者
Li, Xiao-Tao [1 ]
You, Shu-Juan [1 ]
Chen, Wai [1 ]
机构
[1] China Mobile Research Institute, Beijing,100053, China
来源
关键词
Semantics - Classification (of information) - Natural language processing systems;
D O I
10.16383/j.aas.c180312
中图分类号
学科分类号
摘要
We propose a novel algorithm of semantic similarity between words, based on our word single-meaning embedding model, to address the issue of existing word-embedding-based approaches that have low computation accuracy in polysemous words, nonadjacent words and synonyms. Differently from the existing word embedding models, each polysemous word is decomposed into a series of monosemous words in our model, and there is a one-to-one correspondence between a word meaning and a vector. First of all, the word sense disambiguation (WSD) of polysemous words in different contexts of the corpus is achieved with the help of the prior classification information contained in Tongyici Cilin. Then, the word single-meaning embeddings are learned from the processed corpus and realize the precise expression for each word meaning, and as far as we know, no existing word embedding model could complete this task. At last, two test words are decomposed into marked monosemous words according to the number of meaning and expanded with synonyms, and then semantic relatedness between words is computed based on the word single-meaning embedding model and Tongyici Cilin. The experimental results showed our method can significantly improve the computation accuracy of polysemous words, nonadjacent words and synonyms. Copyright © 2020 Acta Automatica Sinica. All rights reserved.
引用
收藏
页码:1654 / 1669
相关论文
共 50 条
  • [1] Semantic Similarity of Inverse Morpheme Words Based on Word Embedding
    Zhou, Jiaomei
    Liu, Zhiying
    [J]. CHINESE LEXICAL SEMANTICS, CLSW 2021, PT I, 2022, 13249 : 452 - 463
  • [2] Enhancing Accuracy of Semantic Relatedness Measurement by Word Single-Meaning Embeddings
    Li, Xiaotao
    You, Shujuan
    Chen, Wai
    [J]. IEEE ACCESS, 2021, 9 : 117424 - 117433
  • [3] A novel model for semantic similarity measurement based on wordnet and word embedding
    Zhao, Fuqiang
    Zhu, Zhengyu
    Han, Ping
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9831 - 9842
  • [4] Concreteness effects in single-meaning, multi-meaning and newly acquired words
    Palmer, Shekeila D.
    MacGregor, Lucy J.
    Havelka, Jelena
    [J]. BRAIN RESEARCH, 2013, 1538 : 135 - 150
  • [5] Word Embedding based Textual Semantic Similarity Measure in Bengali
    Iqbal, Md Asif
    Sharif, Omar
    Hoque, Mohammed Moshiul
    Sarker, Iqbal H.
    [J]. 10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 92 - 101
  • [6] An Improved Algorithm of Word Semantic Similarity Based on HowNet
    Kang, Bocheng
    Qi, Junpeng
    [J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 266 - 271
  • [7] Combining Word Embedding and Semantic Lexicon for Chinese Word Similarity Computation
    Pei, Jiahuan
    Zhang, Cong
    Huang, Degen
    Ma, Jianjun
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 766 - 777
  • [8] Semantic Word Rank Algorithm Based on the Relation Degree of the Words
    Han, Huijian
    Fu, Kai
    Sun, Xiusheng
    Li, Zhenxian
    [J]. PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 161 - 164
  • [9] A Laplacian Eigenmaps Based Semantic Similarity Measure between Words
    Wu, Yuming
    Cao, Cungen
    Wang, Shi
    Wang, Dongsheng
    [J]. INTELLIGENT INFORMATION PROCESSING V, 2010, 340 : 291 - 296
  • [10] Short Text Clustering based on Word Semantic Graph with Word Embedding Model
    Jinarat, Supakpong
    Manaskasemsak, Bundit
    Rungsawang, Arnon
    [J]. 2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 1427 - 1432