An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

被引:39
|
作者
Wu, Guohua [1 ]
Tang, Guangen [2 ]
Wang, Zhongru [3 ,4 ]
Zhang, Zhen [1 ]
Wang, Zhen [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
[3] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv BUPT, Minist Educ, Beijing 100876, Peoples R China
[4] Chinese Acad Cyberspace Studies, Beijing 100010, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Natural language processing; named entity recognition; neural networks; self-attention;
D O I
10.1109/ACCESS.2019.2935223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clinic Named Entity Recognition (CNER) aims to recognize named entities such as body part, disease and symptom from Electronic Health Records (EHRs), which can benefit many intelligent biomedical systems. In recent years, more and more attention has been paid to the end-to-end CNER with recurrent neural networks (RNNs), especially for long short-term memory networks (LSTMs). However, it remains a great challenge for RNNs to capture long range dependencies. Moreover, Chinese presents additional challenges, since it uses logograms instead of alphabets, the ambiguities of Chinese word and has no word boundaries. In this work, we present a BiLSTM-CRF with self-attention mechanism (Att-BiLSTM-CRF) model for Chinese CNER task, which aims to address these problems. Self-attention mechanism can learn long range dependencies by establishing a direct connection between each character. In order to learn more semantic information about Chinese characters, we propose a novel fine-grained character-level representation method. We also introduce part-of-speech (POS) labeling information about our model to capture the semantic information in input sentence. We conduct the experiment by using CCKS-2017 Shared Task 2 dataset to evaluate performance, and the experimental results indicated that our model outperforms other state-of-the-art methods.
引用
收藏
页码:113942 / 113949
页数:8
相关论文
共 50 条
  • [1] An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition
    Luo, Ling
    Yang, Zhihao
    Yang, Pei
    Zhang, Yin
    Wang, Lei
    Lin, Hongfei
    Wang, Jian
    [J]. BIOINFORMATICS, 2018, 34 (08) : 1381 - 1388
  • [2] Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF
    Wei, Hao
    Gao, Mingyuan
    Zhou, Ai
    Chen, Fei
    Qu, Wen
    Wang, Chunli
    Lu, Mingyu
    [J]. IEEE ACCESS, 2019, 7 : 73627 - 73636
  • [3] Named Entity Recognition of Traditional Chinese Medicine Patents Based on BiLSTM-CRF
    Deng, Na
    Fu, Hao
    Chen, Xu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [4] Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition
    Xu, Kai
    Yang, Zhenguo
    Kang, Peipei
    Wang, Qi
    Liu, Wenyin
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 122 - 132
  • [5] BiLSTM-CRF for Persian Named-Entity Recognition
    Poostchi, Hanieh
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4427 - 4431
  • [6] Drug Specification Named Entity Recognition base on BiLSTM-CRF Model
    Li, Wei-Yan
    Song, Wen-Ai
    Jia, Xin-Hong
    Yang, Ji-Jiang
    Wang, Qing
    Lei, Yi
    Huang, Ke
    Li, Jun
    Yang, Ting
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 429 - 433
  • [7] A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition
    Ji, Bin
    Liu, Rui
    Li, ShaSha
    Tang, JinTao
    Yu, Jie
    Li, Qian
    Xu, WeiSang
    [J]. 2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [8] Arabic named entity recognition in social media based on BiLSTM-CRF using an attention mechanism
    Benali, B. Ait
    Mihi, S.
    Mlouk, A. Ait
    El Bazi, I
    Laachfoubi, N.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 5427 - 5436
  • [9] Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF
    An, Ying
    Xia, Xianyun
    Chen, Xianlai
    Wu, Fang-Xiang
    Wang, Jianxin
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
  • [10] Named entity recognition of agricultural based entity-level masking BERT and BiLSTM-CRF
    Wei, Zijun
    Song, Ling
    Hu, Xiaochun
    Chen, Ningjiang
    [J]. Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (15): : 195 - 203