Exploring Chinese word embedding with similar context and reinforcement learning

被引:1
|
作者
Zhang, Yun [1 ]
Liu, Yongguo [1 ]
Li, Dongxiao [2 ]
Zhai, Shuangqing [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Knowledge & Data Engn Lab Chinese Med, Chengdu 610054, Peoples R China
[2] Sichuan Acad Chinese Med Sci, Chengdu 610041, Peoples R China
[3] Beijing Univ Chinese Med, Sch Basic Med Sci, Beijing 100029, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 24期
基金
国家重点研发计划;
关键词
Chinese word embedding; Irrelevant neighbouring word; Similar context; Reinforcement learning;
D O I
10.1007/s00521-022-07672-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese word embedding has attracted considerable attention in the field of natural language processing. Existing methods model the relation between target and neighbouring contextual words. However, with the phenomenon of irrelevant neighbouring words in Chinese, these methods are limited in capturing and understanding the semantics of Chinese words. In this study, we designed sc2vec to explore Chinese word embeddings by proposing a similar context to reduce the influence of the above problem and comprehend relevant semantics of Chinese words. Meanwhile, to enhance the learning architecture, sc2vec was modelled with reinforcement learning to generate high-quality Chinese word embeddings, regarding continuous bag-of-words and skip-gram models as two actions of an agent over a corpus. The results on word analogy, word similarity, named entity recognition, and text classification tasks demonstrate that the proposed model outperforms most state-of-the-art approaches.
引用
收藏
页码:22287 / 22302
页数:16
相关论文
共 50 条
  • [41] Waste Not: Meta-Embedding of Word and Context Vectors
    Degirmenci, Selin
    Gerek, Aydin
    Ganiz, Murat Can
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 393 - 401
  • [42] Learning from context: A mutual reinforcement model for Chinese microblog opinion retrieval
    Jingjing Wei
    Xiangwen Liao
    Houdong Zheng
    Guolong Chen
    Xueqi Cheng
    Frontiers of Computer Science, 2018, 12 : 714 - 724
  • [43] Learning from context: A mutual reinforcement model for Chinese microblog opinion retrieval
    Wei, Jingjing
    Liao, Xiangwen
    Zheng, Houdong
    Chen, Guolong
    Cheng, Xueqi
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (04) : 714 - 724
  • [44] Word and Document Embedding with vMF-Mixture Priors on Context Word Vectors
    Jameel, Shoaib
    Schockaert, Steven
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3319 - 3328
  • [45] Attention-Enabled Multi-layer Subword Joint Learning for Chinese Word Embedding
    Xue, Pengpeng
    Xiong, Jing
    Tan, Liang
    Liu, Zhongzhu
    Liu, Kanglong
    COGNITIVE COMPUTATION, 2025, 17 (02)
  • [46] Context and repetition in word learning
    Horst, Jessica S.
    FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [47] TransPhrase: A new method for generating phrase embedding from word embedding in Chinese
    Li, Rongsheng
    Huang, Shaobin
    Mao, Xiangke
    He, Jie
    Shen, Linshan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [48] An Improved Embedding Matching Model for Chinese Word Segmentation
    Deng, Xiaolong
    Sun, Yingfei
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 195 - 200
  • [49] Chinese Textual Entailment Recognition Enhanced with Word Embedding
    Zhang, Zhichang
    Yao, Dongren
    Pang, Yali
    Lu, Xiaoyong
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 89 - 100
  • [50] Mapping Senses in BabelNet to Chinese Based on Word Embedding
    Meng, Fanqing
    Lu, Wenpeng
    Xue, Ruojuan
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,