Word-Character Graph Convolution Network for Chinese Named Entity Recognition

被引:23
|
作者
Tang, Zhuo [1 ,2 ]
Wan, Boyan [1 ,2 ]
Yang, Li [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Natl Supercomp Ctr Changsha, Changsha 410082, Peoples R China
[3] Changsha Univ Sci & Technol, Coll Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Lattices; Standards; Training; Earth Observing System; Speech processing; Convolution; Named entity recognition; graph convolutional network; attention mechanism; word-character DAGs;
D O I
10.1109/TASLP.2020.2994436
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent researches try to integrate word information into the character-based Chinese NER by modifying the structure of the standard BiLSTM-CRF model. They follow the paradigm of explicitly modeling forward and backward sequences, adopting an LSTM variant that takes both characters and words as input for each direction. Though enriching the representations, these models cannot fully exploit the interaction between future and past contexts. In this paper, we propose a novel word-character graph convolution network (WC-GCN) which uses a cross GCN block to simultaneously process the word-character directed acyclic graphs (DAGs) of two directions. To improve the capture of long-distance dependency, a global attention GCN block is introduced to learn node representations conditioned on a global context. In both blocks, unlike previous works where each word is attached to its associated character or taken as a shortcut between LSTM cells, words and characters are treated equally as nodes in the graph and have their instance-specific representations. Experiments on four widely used datasets show that our proposed model can work standalone or with the standard BiLSTM. Both forms can outperform previous LSTM-based models without training on extra corpora while only an external lexicon and its corresponding pretrained character and word embeddings are needed.
引用
收藏
页码:1520 / 1532
页数:13
相关论文
共 50 条
  • [21] Enriching Word Information Representation for Chinese Cybersecurity Named Entity Recognition
    Yang, Dongying
    Lian, Tao
    Zheng, Wen
    Zhao, Cai
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7689 - 7707
  • [22] Incorporating word⁃set attention into Chinese named entity recognition Method
    Zhong, Shi-Sheng
    Chen, Xi
    Zhao, Ming-Hang
    Zhang, Yong-Jian
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (05): : 1098 - 1105
  • [23] Enriching Word Information Representation for Chinese Cybersecurity Named Entity Recognition
    Dongying Yang
    Tao Lian
    Wen Zheng
    Cai Zhao
    [J]. Neural Processing Letters, 2023, 55 : 7689 - 7707
  • [24] Robust Chinese Named Entity Recognition Based on Fusion Graph Embedding
    Song, Xuhui
    Yu, Hongtao
    Li, Shaomei
    Wang, Huansha
    [J]. ELECTRONICS, 2023, 12 (03)
  • [25] Named Entity Recognition as Graph Classification
    Harrando, Ismail
    Troncy, Raphael
    [J]. SEMANTIC WEB: ESWC 2021 SATELLITE EVENTS, 2021, 12739 : 103 - 108
  • [26] Character-level neural network for biomedical named entity recognition
    Gridach, Mourad
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 : 85 - 91
  • [27] Lexicon enhanced Chinese named entity recognition with pointer network
    Guo, Qian
    Guo, Yi
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14535 - 14555
  • [28] Empirical Exploring Word-Character Relationship for Chinese Sentence Representation
    Wang, Shaonan
    Zhang, Jiajun
    Zong, Chengqing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
  • [29] Named Entity Recognition for Chinese microblog with Convolutional Neural Network
    Zhang, Liang
    Zhao, Huan
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 87 - 92
  • [30] Lexicon enhanced Chinese named entity recognition with pointer network
    Qian Guo
    Yi Guo
    [J]. Neural Computing and Applications, 2022, 34 : 14535 - 14555