Modeling multi-prototype Chinese word representation learning for word similarity

被引:2
|
作者
Yin, Fulian [1 ]
Wang, Yanyan [1 ]
Liu, Jianbo [1 ]
Tosato, Marco [2 ]
机构
[1] Commun Univ China, Inst Informat & Commun, Beijing 100024, Peoples R China
[2] York Univ, Lab Ind & Appl Math, Toronto, ON M3J 1P3, Canada
基金
中国国家自然科学基金;
关键词
Chinese word representation; Multi-prototype; Synonym knowledge base; Word semantic disambiguation; ONTOLOGY-BASED METHODS; EMBEDDINGS; SENTIMENT;
D O I
10.1007/s40747-021-00482-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The word similarity task is used to calculate the similarity of any pair of words, and is a basic technology of natural language processing (NLP). The existing method is based on word embedding, which fails to capture polysemy and is greatly influenced by the quality of the corpus. In this paper, we propose a multi-prototype Chinese word representation model (MP-CWR) for word similarity based on synonym knowledge base, including knowledge representation module and word similarity module. For the first module, we propose a dual attention to combine semantic information for jointly learning word knowledge representation. The MP-CWR model utilizes the synonyms as prior knowledge to supplement the relationship between words, which is helpful to solve the challenge of semantic expression due to insufficient data. As for the word similarity module, we propose a multi-prototype representation for each word. Then we calculate and fuse the conceptual similarity of two words to obtain the final result. Finally, we verify the effectiveness of our model on three public data sets with other baseline models. In addition, the experiments also prove the stability and scalability of our MP-CWR model under different corpora.
引用
收藏
页码:2977 / 2990
页数:14
相关论文
共 50 条
  • [31] Similarity of referents influences the learning of phonological word forms: Evidence from concurrent word learning
    Zhao, Libo
    Packard, Stephanie
    McMurray, Bob
    Gupta, Prahlad
    COGNITION, 2019, 190 : 42 - 60
  • [32] Learning Chinese word representation better by cascade morphological n-gram
    Zongyang Xiong
    Ke Qin
    Haobo Yang
    Guangchun Luo
    Neural Computing and Applications, 2021, 33 : 3757 - 3768
  • [33] Chinese Sentence Similarity based on Word Context and Semantic
    Gu, Tianjiao
    Ren, Fuji
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 535 - 539
  • [34] Towards a Word Similarity Analysis of Chinese Noun Compounds
    Wang, Lulu
    Wang, Meng
    Tian, Na
    2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 3, 2015, : 181 - 183
  • [35] Chinese Word Similarity Computing Based on Combination Strategy
    Guo, Shaoru
    Guan, Yong
    Li, Ru
    Zhang, Qi
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 744 - 752
  • [36] Multi-task Learning for Chinese Word Usage Errors Detection
    Zhang, Jinbin
    Wang, Heng
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2018, : 93 - 96
  • [37] A Framework for Multi-Prototype Based Federated Learning: Towards the Edge Intelligence
    Qiao, Yu
    Munir, Md. Shirajum
    Adhikary, Apurba
    Raha, Avi Deb
    Hong, Sang Hoon
    Hong, Choong Seon
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 134 - 139
  • [38] Neural Word Segmentation Learning for Chinese
    Cai, Deng
    Zhao, Hai
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 409 - 420
  • [39] A Hybrid Semantic Representation with Internal and External Knowledge for Word Similarity
    Wang, Yanyan
    Liu, Jianbo
    Wang, Kaili
    Yin, Fulian
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 264 - 268
  • [40] Human Recognition with a Hardware-Accelerated Multi-Prototype Learning and Classification System
    Wicaksono, Indra Bagus
    An, Fengwei
    Mattausch, Hans Juergen
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,