Chinese Named Entity Recognition with Character-Word Mixed Embedding

被引:19
|
作者
Shijia, E. [1 ]
Xiang, Yang [1 ]
机构
[1] Tongji Univ, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
named entity recognition; word embedding; character embedding;
D O I
10.1145/3132847.3133088
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is an important basis for the tasks in natural language processing such as relation extraction, entity linking and so on. The common method of existing Chinese NER systems is to use the character sequence as the input, and the intention is to avoid the word segmentation. However, the character sequence cannot express enough semantic information, so that the recognition accuracy of Chinese NER is not as good as western language such as English. To solve this issue, we propose a Chinese NER method based on Character-Word Mixed Embedding (CWME), and the method is in accord with the pipeline of Chinese natural language processing. Our experiments show that incorporating CWME can effectively improve the performance for the Chinese corpus with state-of-the-art neural architectures widely used in NER, and the proposed method yields nearly 9% absolute improvement over previously results.
引用
收藏
页码:2055 / 2058
页数:4
相关论文
共 50 条
  • [1] Chinese Named Entity Recognition Based on Character-Word Vector Fusion
    Ye, Na
    Qin, Xin
    Dong, Lili
    Zhang, Xiang
    Sun, Kangkang
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [2] Enhanced character embedding for Chinese named entity recognition
    Jia, Bingjing
    Wu, Zhongli
    Wu, Bin
    Liu, Yutong
    Zhou, Pengpeng
    [J]. MEASUREMENT & CONTROL, 2020, 53 (9-10): : 1669 - 1681
  • [3] Resolving Entity Morphs based on Character-Word Embedding
    Sha, Ying
    Shi, Zhenhui
    Li, Rui
    Liang, Qi
    Wang, Bin
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 48 - 57
  • [4] Named Entity Recognition in Government Audit Texts Based on ChineseBERT and Character-Word Fusion
    Huang, Baohua
    Lin, Yunjie
    Pang, Si
    Fu, Long
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [5] Word-Character Graph Convolution Network for Chinese Named Entity Recognition
    Tang, Zhuo
    Wan, Boyan
    Yang, Li
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1520 - 1532
  • [6] A Chinese Named Entity Recognition Method Based on Fusion of Character and Word Features
    Chai, Wenguang
    Wang, Jiazhen
    [J]. 2022 IEEE 14TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2022), 2022, : 308 - 313
  • [7] Research on named entity recognition of chinese electronic medical records based on multi-head attention mechanism and character-word information fusion
    Zhang, Qinghui
    Wu, Meng
    Lv, Pengtao
    Zhang, Mengya
    Yang, Hongwei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 4105 - 4116
  • [8] Entity slot recognition based on data enhancement and character-word fusion features
    Liu, Zhenyuan
    Xu, Mingyang
    Wang, Chengtao
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 101 - 106
  • [9] Chinese Named Entity Recognition Method in Electricity Based on Combining Character Sequence and Word Sequence
    Yang, Shuaisong
    Gao, Yankun
    Wang, Jingdong
    Meng, Fanqi
    Guo, Shuqiang
    Zhou, Lina
    [J]. Journal of Network Intelligence, 2022, 7 (04): : 1066 - 1082
  • [10] Hierarchical Lexicon Embedding Architecture for Chinese Named Entity Recognition
    Hu, Jiahao
    Ouyang, Yuanxin
    Li, Chen
    Wang, Chuanrui
    Rong, Wenge
    Xiong, Zhang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 345 - 356