Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks

被引:58
|
作者
Gligic, Luka [1 ]
Kormilitzin, Andrey [1 ]
Goldberg, Paul [1 ]
Nevado-Holgado, Alejo [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国医学研究理事会;
关键词
Neural Networks; NLP; Named entity recognition; Electronic health records; Transfer learning; LSTM; PATIENT SMOKING STATUS; MEDICATION INFORMATION;
D O I
10.1016/j.neunet.2019.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner. Crown Copyright (C) 2019 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 139
页数:8
相关论文
共 50 条
  • [31] Transfer Bi-directional LSTM RNN for Named Entity Recognition in Chinese Electronic Medical Records
    Dong, Xishuang
    Chowdhury, Shanta
    Qian, Lijun
    Guan, Yi
    Yang, Jinfeng
    Yu, Qiubin
    2017 IEEE 19TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2017,
  • [32] A Named Entity Recognition Approach for Electronic Medical Records Using BERT Semantic Enhancement and BiLSTM
    Lai, Xuewei
    Jie, Qingqing
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2023, 19 (01)
  • [33] Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining
    Gong, Lejun
    Zhang, Zhifei
    Chen, Shiqi
    JOURNAL OF HEALTHCARE ENGINEERING, 2020, 2020
  • [34] Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 495 - 500
  • [35] Can Synthetic Text Help Clinical Named Entity Recognition? A Study of Electronic Health Records in French
    Hiebel, Nicolas
    Ferret, Olivier
    Fort, Karen
    Neveol, Aurelie
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2320 - 2338
  • [36] Named entity recognition over electronic health records through a combined dictionary-based approach
    Pomares Quimbaya, Alexandra
    Sierra Munera, Alejandro
    Gonzalez Rivera, Rafael Andres
    Daza Rodriguez, Julian Camilo
    Munoz Velandia, Oscar Mauricio
    Garcia Pena, Angel Alberto
    Labbe, Cyril
    INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2016, 2016, 100 : 55 - 61
  • [37] LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition
    Gasmi, Houssem
    Bouras, Abdelaziz
    Laval, Jannik
    THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 1 - 6
  • [38] Coreference Aware Representation Learning for Neural Named Entity Recognition
    Dai, Zeyu
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4946 - 4953
  • [39] Combined Attention Mechanism for Named Entity Recognition in Chinese Electronic Medical Records
    Li, Luqi
    Hou, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 476 - 477
  • [40] A weakly supervised method for named entity recognition of Chinese electronic medical records
    Meng Li
    Chunrong Gao
    Kuang Zhang
    Huajian Zhou
    Jing Ying
    Medical & Biological Engineering & Computing, 2023, 61 : 2733 - 2743