Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks

被引:58
|
作者
Gligic, Luka [1 ]
Kormilitzin, Andrey [1 ]
Goldberg, Paul [1 ]
Nevado-Holgado, Alejo [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国医学研究理事会;
关键词
Neural Networks; NLP; Named entity recognition; Electronic health records; Transfer learning; LSTM; PATIENT SMOKING STATUS; MEDICATION INFORMATION;
D O I
10.1016/j.neunet.2019.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner. Crown Copyright (C) 2019 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 139
页数:8
相关论文
共 50 条
  • [41] Ensemble Transfer Learning on Augmented Domain Resources for Oncological Named Entity Recognition in Chinese Clinical Records
    Zhou, Meifeng
    Tan, Jindian
    Yang, Song
    Wang, Haixia
    Wang, Lin
    Xiao, Zhifeng
    IEEE ACCESS, 2023, 11 : 80416 - 80428
  • [42] A weakly supervised method for named entity recognition of Chinese electronic medical records
    Li, Meng
    Gao, Chunrong
    Zhang, Kuang
    Zhou, Huajian
    Ying, Jing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (10) : 2733 - 2743
  • [43] Coreference aware representation learning for neural named entity recognition
    Dai, Zeyu
    Fei, Hongliang
    Li, Ping
    IJCAI International Joint Conference on Artificial Intelligence, 2019, 2019-August : 4946 - 4953
  • [44] Transfer learning for Turkish named entity recognition on noisy text
    Kagan Akkaya, Emre
    Can, Burcu
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (01) : 35 - 64
  • [45] Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents
    Francis, Sumam
    Van Landeghem, Jordy
    Moens, Marie-Francine
    INFORMATION, 2019, 10 (08)
  • [46] Improving Clinical Named-Entity Recognition with Transfer Learning
    Zhang, Edmond
    Thurier, Quentin
    Boyle, Luke
    CONNECTING THE SYSTEM TO ENHANCE THE PRACTITIONER AND CONSUMER EXPERIENCE IN HEALTHCARE, 2018, 252 : 182 - 187
  • [47] Hierarchical shared transfer learning for biomedical named entity recognition
    Zhaoying Chai
    Han Jin
    Shenghui Shi
    Siyan Zhan
    Lin Zhuo
    Yu Yang
    BMC Bioinformatics, 23
  • [48] GHHT at CALCS 2018: Named Entity Recognition for Dialectal Arabic Using Neural Networks
    Attia, Mohammed
    Samih, Younes
    Maier, Wolfgang
    COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 98 - 102
  • [49] Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN
    Dong, Xishuang
    Chowdhury, Shanta
    Qian, Lijun
    Li, Xiangfang
    Guan, Yi
    Yang, Jinfeng
    Yu, Qiubin
    PLOS ONE, 2019, 14 (05):
  • [50] Weakly Supervised Named Entity Recognition for Carbon Storage Using Deep Neural Networks
    Londono, Rene Gomez
    Wlodarczyk, Sylvain
    Arman, Molood
    Bugiotti, Francesca
    Seghouani, Nacera Bennacer
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 227 - 242