Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature

被引:23
|
作者
Gajendran, Sudhakaran [1 ]
Manjula, D. [1 ]
Sugumaran, Vijayan [2 ,3 ]
机构
[1] Anna Univ, Coll Engn Guindy, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[2] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48063 USA
[3] Oakland Univ, Sch Business Adm, Dept Decis & Informat Sci, Rochester, MI 48063 USA
关键词
Biomedical named entity recognition; Embeddings; Deep neural networks; Bidirectional LSTM; Dynamic RNN; CRF; INFORMATION EXTRACTION; MACHINE; SYSTEM; TEXT;
D O I
10.1016/j.jbi.2020.103609
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named Entity Recognition is the process of identifying different entities in a given context. Biomedical Named Entity Recognition (BNER) is the task of extracting chemical names from biomedical texts to support biomedical and translational research. The aim of the system is to extract useful chemical names from biomedical literature text without a lot of handcrafted engineering features. This approach introduces a novel neural network architecture with the composition of bidirectional long short-term memory (BLSTM), dynamic recurrent neural network (RNN) and conditional random field (CRF) that uses character level and word level embedding as the only features to identify the chemical entities. Using this approach we have achieved the F1 score of 89.98 on BioCreAtIvE II GM corpus and 90.84 on NCBI corpus by outperforming the existing systems. Our system is based on the deep neural architecture that uses both character and word level embedding which captures the morphological and orthographic information eliminating the need for handcrafted engineering features. The proposed system outperforms the existing systems without a lot of handcrafted engineering features. The embedding concept along with the bidirectional LSTM network proved to be an effective method to identify most of the chemical entities.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Character-level neural network for biomedical named entity recognition
    Gridach, Mourad
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 : 85 - 91
  • [2] End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-Level Vs. Character-Level
    Thai-Hoang Pham
    Phuong Le-Hong
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 219 - 232
  • [3] Bidirectional Recurrent Neural Network Approach for Arabic Named Entity Recognition
    Ali, Mohammed N. A.
    Tan, Guanzheng
    Hussain, Aamir
    [J]. FUTURE INTERNET, 2018, 10 (12):
  • [4] Chinese Named Entity Recognition with Character-Word Mixed Embedding
    Shijia, E.
    Xiang, Yang
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2055 - 2058
  • [5] Disambiguation of biomedical acronyms based on a bidirectional recurrent neural network of character-level features
    Kai, Ren
    Na, Li
    Wei, Xiong
    Shi-Wen, Wang
    [J]. Journal of Engineering Science and Technology Review, 2019, 12 (06) : 105 - 112
  • [6] LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition
    Gasmi, Houssem
    Bouras, Abdelaziz
    Laval, Jannik
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 1 - 6
  • [7] Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition
    Cho, Minsoo
    Ha, Jihwan
    Park, Chihyun
    Park, Sanghyun
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
  • [8] Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 495 - 500
  • [9] Disease named entity recognition from biomedical literature using a novel convolutional neural network
    Zhehuan Zhao
    Zhihao Yang
    Ling Luo
    Lei Wang
    Yin Zhang
    Hongfei Lin
    Jian Wang
    [J]. BMC Medical Genomics, 10
  • [10] Disease named entity recognition from biomedical literature using a novel convolutional neural network
    Zhao, Zhehuan
    Yang, Zhihao
    Luo, Ling
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    [J]. BMC MEDICAL GENOMICS, 2017, 10