Exploring Chinese word embedding with similar context and reinforcement learning

被引:1
|
作者
Zhang, Yun [1 ]
Liu, Yongguo [1 ]
Li, Dongxiao [2 ]
Zhai, Shuangqing [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Knowledge & Data Engn Lab Chinese Med, Chengdu 610054, Peoples R China
[2] Sichuan Acad Chinese Med Sci, Chengdu 610041, Peoples R China
[3] Beijing Univ Chinese Med, Sch Basic Med Sci, Beijing 100029, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 24期
基金
国家重点研发计划;
关键词
Chinese word embedding; Irrelevant neighbouring word; Similar context; Reinforcement learning;
D O I
10.1007/s00521-022-07672-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese word embedding has attracted considerable attention in the field of natural language processing. Existing methods model the relation between target and neighbouring contextual words. However, with the phenomenon of irrelevant neighbouring words in Chinese, these methods are limited in capturing and understanding the semantics of Chinese words. In this study, we designed sc2vec to explore Chinese word embeddings by proposing a similar context to reduce the influence of the above problem and comprehend relevant semantics of Chinese words. Meanwhile, to enhance the learning architecture, sc2vec was modelled with reinforcement learning to generate high-quality Chinese word embeddings, regarding continuous bag-of-words and skip-gram models as two actions of an agent over a corpus. The results on word analogy, word similarity, named entity recognition, and text classification tasks demonstrate that the proposed model outperforms most state-of-the-art approaches.
引用
收藏
页码:22287 / 22302
页数:16
相关论文
共 50 条
  • [21] Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context
    Faris, Hossam
    Aljarah, Ibrahim
    Habib, Maria
    Castillo, Pedro A.
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 453 - 460
  • [22] Exploring the effectiveness of word embedding based deep learning model for improving email classification
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 483 - 505
  • [23] Industrial Memories: Exploring the Findings of Government Inquiries with Neural Word Embedding and Machine Learning
    Leavy, Susan
    Pine, Emilie
    Keane, Mark T.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 687 - 690
  • [24] Interaction, modality, and word engagement as factors in lexical learning in a Chinese context
    Niu, Ruiying
    Helms-Park, Rena
    LANGUAGE TEACHING RESEARCH, 2014, 18 (03) : 345 - 372
  • [25] Attention-Based Chinese Word Embedding
    Liang, Yiyuan
    Zhang, Wei
    Yang, Kehua
    CLOUD COMPUTING AND SECURITY, PT IV, 2018, 11066 : 277 - 287
  • [26] Learning Dimensional Sentiment of Traditional Chinese Words with Word Embedding and Support Vector Regression
    Li, Baoli
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 324 - 327
  • [27] Integrating Character Representations into Chinese Word Embedding
    Leshan Normal University, China
    Lect. Notes Comput. Sci.,
  • [28] Integrating Character Representations into Chinese Word Embedding
    Chen, Xingyuan
    Jin, Peng
    McCarthy, Diana
    Carroll, John
    CHINESE LEXICAL SEMANTICS, CLSW 2016, 2016, 10085 : 335 - 349
  • [29] Resolving Chinese Zero Pronoun with Word Embedding
    Liu, Bingquan
    Du, Xinkai
    Liu, Ming
    Sun, Chengjie
    Zheng, Guidong
    Zou, Chao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 828 - 838
  • [30] Pronunciation-Enhanced Chinese Word Embedding
    Yang, Qinjuan
    Xie, Haoran
    Cheng, Gary
    Wang, Fu Lee
    Rao, Yanghui
    COGNITIVE COMPUTATION, 2021, 13 (03) : 688 - 697