Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks

被引:58
|
作者
Gligic, Luka [1 ]
Kormilitzin, Andrey [1 ]
Goldberg, Paul [1 ]
Nevado-Holgado, Alejo [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国医学研究理事会;
关键词
Neural Networks; NLP; Named entity recognition; Electronic health records; Transfer learning; LSTM; PATIENT SMOKING STATUS; MEDICATION INFORMATION;
D O I
10.1016/j.neunet.2019.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner. Crown Copyright (C) 2019 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 139
页数:8
相关论文
共 50 条
  • [21] A Multiclass Classification Method Based on Deep Learning for Named Entity Recognition in Electronic Medical Records
    Dong, Xishuang
    Qian, Lijun
    Guan, Yi
    Huang, Lei
    Yu, Qiubin
    Yang, Jinfeng
    2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,
  • [22] Data Masking for Chinese Electronic Medical Records with Named Entity Recognition
    He, Tianyu
    Xu, Xiaolong
    Hu, Zhichen
    Zhao, Qingzhan
    Dai, Jianguo
    Dai, Fei
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3657 - 3673
  • [23] Named Entity Recognition and Event Extraction in Chinese Electronic Medical Records
    Ma, Cheng
    Huang, Wenkang
    CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 133 - 138
  • [24] Chinese electronic medical record named entity recognition algorithm based on transfer learning
    Li, Yi
    Liu, Jianyi
    Zhang, Ru
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 19 - 20
  • [25] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
    Liu, Kaixin
    Hu, Qingcheng
    Liu, Jianwei
    Xing, Chunxiao
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
  • [26] A Hybrid Model for Named Entity Recognition on Chinese Electronic Medical Records
    Wang, Yu
    Sun, Yining
    Ma, Zuchang
    Gao, Lisheng
    Xu, Yang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (02)
  • [27] Bootstrapped Text-level Named Entity Recognition for Literature
    Brooke, Julian
    Baldwin, Timothy
    Hammond, Adam
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 344 - 350
  • [28] Named Entity Recognition in Semi Structured Documents Using Neural Tensor Networks
    Shehzad, Khurram
    Ul-Hasan, Adnan
    Malik, Muhammad Imran
    Shafait, Faisal
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 398 - 409
  • [29] Biomedical named entity recognition using deep neural networks with contextual information
    Hyejin Cho
    Hyunju Lee
    BMC Bioinformatics, 20
  • [30] Biomedical named entity recognition using deep neural networks with contextual information
    Cho, Hyejin
    Lee, Hyunju
    BMC BIOINFORMATICS, 2019, 20 (01)