Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks

被引:58
|
作者
Gligic, Luka [1 ]
Kormilitzin, Andrey [1 ]
Goldberg, Paul [1 ]
Nevado-Holgado, Alejo [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国医学研究理事会;
关键词
Neural Networks; NLP; Named entity recognition; Electronic health records; Transfer learning; LSTM; PATIENT SMOKING STATUS; MEDICATION INFORMATION;
D O I
10.1016/j.neunet.2019.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner. Crown Copyright (C) 2019 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 139
页数:8
相关论文
共 50 条
  • [1] Transfer learning for biomedical named entity recognition with neural networks
    Giorgi, John M.
    Bader, Gary D.
    BIOINFORMATICS, 2018, 34 (23) : 4087 - 4094
  • [2] Transfer Learning for Named-Entity Recognition with Neural Networks
    Lee, Ji Young
    Dernoncourt, Franck
    Szolovits, Peter
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4470 - 4473
  • [3] Transfer Learning for Arabic Named Entity Recognition With Deep Neural Networks
    Al-Smadi, Mohammad
    Al-Zboon, Saad
    Jararweh, Yaser
    Juola, Patrick
    IEEE ACCESS, 2020, 8 : 37736 - 37745
  • [4] Named Entity Recognition for Chinese Electronic Medical Records Based on Multitask and Transfer Learning
    Guo, Wenming
    Lu, Junda
    Han, Fang
    IEEE ACCESS, 2022, 10 : 77375 - 77382
  • [5] Named Entity Recognition in Electronic Health Records: A Methodological Review
    Durango, Maria C.
    Torres-Silva, Ever A.
    Orozco-Duque, Andres
    HEALTHCARE INFORMATICS RESEARCH, 2023, 29 (04) : 286 - 300
  • [6] Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods
    Zhang, Yu
    Wang, Xuwen
    Hou, Zhen
    Li, Jiao
    JMIR MEDICAL INFORMATICS, 2018, 6 (04) : 242 - 254
  • [7] Evaluation of clinical named entity recognition methods for Serbian electronic health records
    Kaplar, Aleksandar
    Stosovic, Milan
    Kaplar, Aleksandra
    Brkovic, Voin
    Naumovic, Radomir
    Kovacevic, Aleksandar
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 164
  • [8] Named entity recognition for de-identifying Spanish electronic health records
    Moreno-Barea, Francisco J.
    López-García, Guillermo
    Mesa, Héctor
    Ribelles, Nuria
    Alba, Emilio
    Jerez, José M.
    Veredas, Francisco J.
    Computers in Biology and Medicine, 2025, 185
  • [9] A Named Entity Recognition System for Malayalam using Neural Networks
    Ajees, A. P.
    Idicula, Sumam Mary
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 962 - 969
  • [10] Neural negated entity recognition in Spanish electronic health records
    Santiso, Sara
    Perez, Alicia
    Casillas, Arantza
    Oronoz, Maite
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 105