Word-Character Graph Convolution Network for Chinese Named Entity Recognition

被引:23
|
作者
Tang, Zhuo [1 ,2 ]
Wan, Boyan [1 ,2 ]
Yang, Li [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Natl Supercomp Ctr Changsha, Changsha 410082, Peoples R China
[3] Changsha Univ Sci & Technol, Coll Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Lattices; Standards; Training; Earth Observing System; Speech processing; Convolution; Named entity recognition; graph convolutional network; attention mechanism; word-character DAGs;
D O I
10.1109/TASLP.2020.2994436
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent researches try to integrate word information into the character-based Chinese NER by modifying the structure of the standard BiLSTM-CRF model. They follow the paradigm of explicitly modeling forward and backward sequences, adopting an LSTM variant that takes both characters and words as input for each direction. Though enriching the representations, these models cannot fully exploit the interaction between future and past contexts. In this paper, we propose a novel word-character graph convolution network (WC-GCN) which uses a cross GCN block to simultaneously process the word-character directed acyclic graphs (DAGs) of two directions. To improve the capture of long-distance dependency, a global attention GCN block is introduced to learn node representations conditioned on a global context. In both blocks, unlike previous works where each word is attached to its associated character or taken as a shortcut between LSTM cells, words and characters are treated equally as nodes in the graph and have their instance-specific representations. Experiments on four widely used datasets show that our proposed model can work standalone or with the standard BiLSTM. Both forms can outperform previous LSTM-based models without training on extra corpora while only an external lexicon and its corresponding pretrained character and word embeddings are needed.
引用
收藏
页码:1520 / 1532
页数:13
相关论文
共 50 条
  • [1] Applications of Named Entity Recognition Using Graph Convolution Network
    Madan M.
    Rani A.
    Bhateja N.
    [J]. SN Computer Science, 4 (3)
  • [2] Chinese Named Entity Recognition with Character-Word Mixed Embedding
    Shijia, E.
    Xiang, Yang
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2055 - 2058
  • [3] Chinese Named Entity Recognition Based on Character-Word Vector Fusion
    Ye, Na
    Qin, Xin
    Dong, Lili
    Zhang, Xiang
    Sun, Kangkang
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [4] A Chinese Named Entity Recognition Method Based on Fusion of Character and Word Features
    Chai, Wenguang
    Wang, Jiazhen
    [J]. 2022 IEEE 14TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2022), 2022, : 308 - 313
  • [5] Chinese Named Entity Recognition Based on Gated Graph Neural Network
    Zhong, Qing
    Tang, Yan
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 604 - 613
  • [6] Enhanced character embedding for Chinese named entity recognition
    Jia, Bingjing
    Wu, Zhongli
    Wu, Bin
    Liu, Yutong
    Zhou, Pengpeng
    [J]. MEASUREMENT & CONTROL, 2020, 53 (9-10): : 1669 - 1681
  • [7] Chinese named entity recognition based on Heterogeneous Graph and Dynamic Attention Network
    Wang, Yuke
    Lu, Ling
    Yang, Wu
    Chen, Yinong
    [J]. 2023 IEEE 15TH INTERNATIONAL SYMPOSIUM ON AUTONOMOUS DECENTRALIZED SYSTEM, ISADS, 2023, : 91 - 98
  • [8] Leverage Lexical Knowledge for Chinese Named Entity Recognition via Collaborative Graph Network
    Sui, Dianbo
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    Liu, Shengping
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3830 - 3840
  • [9] Chinese word segmentation and named entity recognition: A pragmatic approach
    Gao, JF
    Li, M
    Wu, A
    Huang, CN
    [J]. COMPUTATIONAL LINGUISTICS, 2005, 31 (04) : 531 - 574
  • [10] GWBNER: A named entity recognition method based on character glyph and word boundary features for Chinese EHRs
    Zhang, Jinsong
    Yu, Xiaomei
    Wang, Zhichao
    Zheng, Xiangwei
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)