Biomedical Named-Entity Recognition by Hierarchically Fusing BioBERT Representations and Deep Contextual-Level Word-Embedding

被引:14
|
作者
Naseem, Usman [1 ]
Musial, Katarzyna [1 ]
Eklund, Peter [2 ]
Prasad, Mukesh [1 ]
机构
[1] Univ Technol Sydney, Sch Comp Sci, Sydney, NSW, Australia
[2] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
NER; Name Entity; BioBert; Deep Contextual Embedding; Twitter; Disorders;
D O I
10.1109/ijcnn48605.2020.9206808
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text mining in the biomedical domain is increasingly important as the volume of biomedical documents increases. Thanks to advances in natural language processing (NLP), extracting valuable information from the biomedical literature is gaining popularity among researchers, and deep learning has enabled the development of effective biomedical text mining models. However, directly applying advancements in NLP to biomedical sources often yields unsatisfactory results, due to a word distribution drift from the general language domain corpora to specific biomedical corpora, and this drift introduces linguistic ambiguities. To overcome these challenges, this paper presents a novel method for biomedical named entity-recognition (BioNER) through hierarchically fusing representations from BioBERT, which is trained on biomedical corpora and Deep contextual-level word embeddings to handle the linguistic challenges within biomedical literature. Proposed text representation is then fed to attention-based Bi-directional Long Short Term Memory (BiLSTM) with Conditional random field (CRF) for the BioNER task. The experimental analysis shows that our proposed end-to-end methodology outperforms existing state-of-the-art methods for the BioNER task.
引用
收藏
页数:8
相关论文
共 13 条
  • [1] Deep Learning with Word Embedding Improves Kazakh Named-Entity Recognition
    Haisa, Gulizada
    Altenbek, Gulila
    INFORMATION, 2022, 13 (04)
  • [2] Named Entity Recognition Using Word-Embedding Techniques for ArabicWeb16: An Empirical Study
    Al-Ghamdi, Sharefah
    Al-Duwais, Mashael
    Al-Khalifa, Hend
    Al-Salman, Abdulmalik
    2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,
  • [3] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    BIOINFORMATICS, 2017, 33 (14) : I37 - I48
  • [4] Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations
    Tsendsuren Munkhdalai
    Meijing Li
    Khuyagbaatar Batsuren
    Hyeon Ah Park
    Nak Hyeon Choi
    Keun Ho Ryu
    Journal of Cheminformatics, 7
  • [5] Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations
    Munkhdalai, Tsendsuren
    Li, Meijing
    Batsuren, Khuyagbaatar
    Park, Hyeon Ah
    Choi, Nak Hyeon
    Ryu, Keun Ho
    JOURNAL OF CHEMINFORMATICS, 2015, 7
  • [6] Biomedical named entity recognition using deep neural networks with contextual information
    Hyejin Cho
    Hyunju Lee
    BMC Bioinformatics, 20
  • [7] Biomedical named entity recognition using deep neural networks with contextual information
    Cho, Hyejin
    Lee, Hyunju
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [8] Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature
    Gajendran, Sudhakaran
    Manjula, D.
    Sugumaran, Vijayan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 112
  • [9] NERO: a biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding
    Kanix Wang
    Robert Stevens
    Halima Alachram
    Yu Li
    Larisa Soldatova
    Ross King
    Sophia Ananiadou
    Annika M. Schoene
    Maolin Li
    Fenia Christopoulou
    José Luis Ambite
    Joel Matthew
    Sahil Garg
    Ulf Hermjakob
    Daniel Marcu
    Emily Sheng
    Tim Beißbarth
    Edgar Wingender
    Aram Galstyan
    Xin Gao
    Brendan Chambers
    Weidi Pan
    Bohdan B. Khomtchouk
    James A. Evans
    Andrey Rzhetsky
    npj Systems Biology and Applications, 7
  • [10] A Multichannel Biomedical Named Entity Recognition Model Based on Multitask Learning and Contextualized Word Representations
    Wei, Hao
    Gao, Mingyuan
    Zhou, Ai
    Chen, Fei
    Qu, Wen
    Zhang, Yijia
    Lu, Mingyu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020