An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

被引:39
|
作者
Wu, Guohua [1 ]
Tang, Guangen [2 ]
Wang, Zhongru [3 ,4 ]
Zhang, Zhen [1 ]
Wang, Zhen [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
[3] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv BUPT, Minist Educ, Beijing 100876, Peoples R China
[4] Chinese Acad Cyberspace Studies, Beijing 100010, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Natural language processing; named entity recognition; neural networks; self-attention;
D O I
10.1109/ACCESS.2019.2935223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clinic Named Entity Recognition (CNER) aims to recognize named entities such as body part, disease and symptom from Electronic Health Records (EHRs), which can benefit many intelligent biomedical systems. In recent years, more and more attention has been paid to the end-to-end CNER with recurrent neural networks (RNNs), especially for long short-term memory networks (LSTMs). However, it remains a great challenge for RNNs to capture long range dependencies. Moreover, Chinese presents additional challenges, since it uses logograms instead of alphabets, the ambiguities of Chinese word and has no word boundaries. In this work, we present a BiLSTM-CRF with self-attention mechanism (Att-BiLSTM-CRF) model for Chinese CNER task, which aims to address these problems. Self-attention mechanism can learn long range dependencies by establishing a direct connection between each character. In order to learn more semantic information about Chinese characters, we propose a novel fine-grained character-level representation method. We also introduce part-of-speech (POS) labeling information about our model to capture the semantic information in input sentence. We conduct the experiment by using CCKS-2017 Shared Task 2 dataset to evaluate performance, and the experimental results indicated that our model outperforms other state-of-the-art methods.
引用
收藏
页码:113942 / 113949
页数:8
相关论文
共 50 条
  • [41] Named Entity Recognition by Using XLNet-BiLSTM-CRF
    Yan, Rongen
    Jiang, Xue
    Dang, Depeng
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (05) : 3339 - 3356
  • [42] NAMED ENTITY RECOGNITION IN THANGKA FIELD BASED ON BERT-BiLSTM-CRF-a
    Guo, Xiaoran
    Cheng, Sujie
    Wang, Weilan
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (01): : 161 - 174
  • [43] Named entity recognition in thangka field based on bert-bilstm-crf-a
    Guo, Xiaoran
    Cheng, Sujie
    Wang, Weilan
    [J]. UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2021, 83 (01): : 161 - 174
  • [44] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Li, Luqi
    Zhao, Jie
    Hou, Li
    Zhai, Yunkai
    Shi, Jinming
    Cui, Fangfang
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [45] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Luqi Li
    Jie Zhao
    Li Hou
    Yunkai Zhai
    Jinming Shi
    Fangfang Cui
    [J]. BMC Medical Informatics and Decision Making, 19
  • [46] Detecting Simultaneously Chinese Grammar Errors Based on a BiLSTM-CRF Model
    Liu, Yajun
    Zan, Hongying
    Zhong, Mengjie
    Ma, Hongchao
    [J]. NATURAL LANGUAGE PROCESSING TECHNIQUES FOR EDUCATIONAL APPLICATIONS, 2018, : 188 - 193
  • [47] Fusion of multiple features for Chinese Named Entity Recognition based on CRF model
    Zhang, Yuejie
    Xu, Zhiting
    Zhang, Tao
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 95 - +
  • [48] Attention-based BiLSTM Network for Chinese Simile Recognition
    Guo, Jingjin
    Song, Wei
    Liu, Xianjun
    Liu, Lizhen
    Zhao, Xinlei
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 144 - 147
  • [49] Chinese Clinical Entity Recognition via Attention-based CNN-LSTM-CRF
    Liu, Zengjian
    Wang, Xiaolong
    Chen, Qingcai
    Tang, Buzhou
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS WORKSHOPS (ICHI-W), 2018, : 68 - 69
  • [50] A BiLSTM-CRF Based Approach to Word Segmentation in Chinese
    Jin, Yuanyuan
    Tao, Shiyu
    Liu, Qi
    Liu, Xiaodong
    [J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 568 - 571