Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature

被引:23
|
作者
Gajendran, Sudhakaran [1 ]
Manjula, D. [1 ]
Sugumaran, Vijayan [2 ,3 ]
机构
[1] Anna Univ, Coll Engn Guindy, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[2] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48063 USA
[3] Oakland Univ, Sch Business Adm, Dept Decis & Informat Sci, Rochester, MI 48063 USA
关键词
Biomedical named entity recognition; Embeddings; Deep neural networks; Bidirectional LSTM; Dynamic RNN; CRF; INFORMATION EXTRACTION; MACHINE; SYSTEM; TEXT;
D O I
10.1016/j.jbi.2020.103609
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named Entity Recognition is the process of identifying different entities in a given context. Biomedical Named Entity Recognition (BNER) is the task of extracting chemical names from biomedical texts to support biomedical and translational research. The aim of the system is to extract useful chemical names from biomedical literature text without a lot of handcrafted engineering features. This approach introduces a novel neural network architecture with the composition of bidirectional long short-term memory (BLSTM), dynamic recurrent neural network (RNN) and conditional random field (CRF) that uses character level and word level embedding as the only features to identify the chemical entities. Using this approach we have achieved the F1 score of 89.98 on BioCreAtIvE II GM corpus and 90.84 on NCBI corpus by outperforming the existing systems. Our system is based on the deep neural architecture that uses both character and word level embedding which captures the morphological and orthographic information eliminating the need for handcrafted engineering features. The proposed system outperforms the existing systems without a lot of handcrafted engineering features. The embedding concept along with the bidirectional LSTM network proved to be an effective method to identify most of the chemical entities.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Incorporating token-level dictionary feature into neural model for named entity recognition
    Mu Xiaofeng
    Wang Wei
    Xu Aiping
    [J]. NEUROCOMPUTING, 2020, 375 : 43 - 50
  • [42] Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition
    Unanue, Inigo Jauregi
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 76 : 102 - 109
  • [43] CHARACTER-LEVEL INCREMENTAL SPEECH RECOGNITION WITH RECURRENT NEURAL NETWORKS
    Hwang, Kyuyeon
    Sung, Wonyong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5335 - 5339
  • [44] MAFN: multi-level attention fusion network for multimodal named entity recognition
    Xiaoying Zhou
    Yijia Zhang
    Zhuang Wang
    Mingyu Lu
    Xiaoxia Liu
    [J]. Multimedia Tools and Applications, 2024, 83 : 45047 - 45058
  • [45] Multi-level semantic fusion network for Chinese medical named entity recognition
    Shi, Jintong
    Sun, Mengxuan
    Sun, Zhengya
    Li, Mingda
    Gu, Yifan
    Zhang, Wensheng
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 133
  • [46] DocBAN: An Efficient Biaffine Attention Network for Document-Level Named Entity Recognition
    Wu, Hao
    Li, Xianxian
    Yang, Danping
    Zhou, Aoxiang
    Wang, Peng
    Liu, Peng
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 65 - 76
  • [47] MAFN: multi-level attention fusion network for multimodal named entity recognition
    Zhou, Xiaoying
    Zhang, Yijia
    Wang, Zhuang
    Lu, Mingyu
    Liu, Xiaoxia
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 45047 - 45058
  • [48] Improving Named Entity Recognition in Vietnamese Texts by a Character-Level Deep Lifelong Learning Model
    Ngoc-Vu Nguyen
    Thi-Lan Nguyen
    Cam-Van Nguyen Thi
    Mai-Vu Tran
    Tri-Thanh Nguyen
    Quang-Thuy Ha
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2019, 6 (04) : 471 - 487
  • [49] Prompt-Based Word-Level Information Injection BERT for Chinese Named Entity Recognition
    He, Qiang
    Chen, Guowei
    Song, Wenchao
    Zhang, Pengzhou
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [50] Improving Bug Localization with Character-level Convolutional Neural Network and Recurrent Neural Network
    Xiao, Yan
    Keung, Jacky
    [J]. 2018 25TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2018), 2018, : 703 - 704