Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature

被引:23
|
作者
Gajendran, Sudhakaran [1 ]
Manjula, D. [1 ]
Sugumaran, Vijayan [2 ,3 ]
机构
[1] Anna Univ, Coll Engn Guindy, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[2] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48063 USA
[3] Oakland Univ, Sch Business Adm, Dept Decis & Informat Sci, Rochester, MI 48063 USA
关键词
Biomedical named entity recognition; Embeddings; Deep neural networks; Bidirectional LSTM; Dynamic RNN; CRF; INFORMATION EXTRACTION; MACHINE; SYSTEM; TEXT;
D O I
10.1016/j.jbi.2020.103609
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named Entity Recognition is the process of identifying different entities in a given context. Biomedical Named Entity Recognition (BNER) is the task of extracting chemical names from biomedical texts to support biomedical and translational research. The aim of the system is to extract useful chemical names from biomedical literature text without a lot of handcrafted engineering features. This approach introduces a novel neural network architecture with the composition of bidirectional long short-term memory (BLSTM), dynamic recurrent neural network (RNN) and conditional random field (CRF) that uses character level and word level embedding as the only features to identify the chemical entities. Using this approach we have achieved the F1 score of 89.98 on BioCreAtIvE II GM corpus and 90.84 on NCBI corpus by outperforming the existing systems. Our system is based on the deep neural architecture that uses both character and word level embedding which captures the morphological and orthographic information eliminating the need for handcrafted engineering features. The proposed system outperforms the existing systems without a lot of handcrafted engineering features. The embedding concept along with the bidirectional LSTM network proved to be an effective method to identify most of the chemical entities.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Terminologies augmented recurrent neural network model for clinical named entity recognition
    Lerner, Ivan
    Paris, Nicolas
    Tannier, Xavier
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [32] Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
    Wei, Qikang
    Chen, Tao
    Xu, Ruifeng
    He, Yulan
    Gui, Lin
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016, : 1 - 8
  • [33] Chinese Named Entity Recognition with Character-Level BLSTM and Soft Attention Model
    Jize Yin
    Senlin Luo
    Zhouting Wu
    Limin Pan
    [J]. Journal of Beijing Institute of Technology, 2020, 29 (01) : 60 - 71
  • [34] Chinese Named Entity Recognition with Character-Level BLSTM and Soft Attention Model
    Yin, Jize
    Luo, Senlin
    Wu, Zhouting
    Pan, Limin
    [J]. Journal of Beijing Institute of Technology (English Edition), 2020, 29 (01): : 60 - 71
  • [35] A neural network multi-task learning approach to biomedical named entity recognition
    Crichton, Gamal
    Pyysalo, Sampo
    Chiu, Billy
    Korhonen, Anna
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [36] A neural network multi-task learning approach to biomedical named entity recognition
    Gamal Crichton
    Sampo Pyysalo
    Billy Chiu
    Anna Korhonen
    [J]. BMC Bioinformatics, 18
  • [37] Chinese Clinical Named Entity Recognition with Word-Level Information Incorporating Dictionaries
    Lu, Ningjie
    Zheng, Jun
    Wu, Wen
    Yang, Yan
    Chen, Kaiwei
    Hu, Wenxin
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [38] Effective integration of morphological analysis and named entity recognition based on a recurrent neural network
    Lee, Hyeon-gu
    Park, Geonwoo
    Kim, Harksoo
    [J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 361 - 365
  • [39] A Character-Level Deep Lifelong Learning Model for Named Entity Recognition in Vietnamese Text
    Ngoc-Vu Nguyen
    Thi-Lan Nguyen
    Cam-Van Nguyen Thi
    Mai-Vu Tran
    Quang-Thuy Ha
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 90 - 102
  • [40] Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation
    Wu, Fangzhao
    Liu, Junxin
    Wu, Chuhan
    Huang, Yongfeng
    Xie, Xing
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3342 - 3348