Information Extraction from Electronic Medical Records Using Multitask Recurrent Neural Network with Contextual Word Embedding

被引:25
|
作者
Yang, Jianliang [1 ]
Liu, Yuenan [1 ]
Qian, Minghui [1 ]
Guan, Chenghua [2 ]
Yuan, Xiangfei [2 ]
机构
[1] Renmin Univ China, Sch Informat Resource Management, 59 Zhongguancun Ave, Beijing 100872, Peoples R China
[2] Beijing Normal Univ, Sch Econ & Resource Management, 19 Xinjiekou Outer St, Beijing 100875, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 18期
关键词
clinical named entity recognition; information extraction; multitask model; long short-term memory; conditional random field;
D O I
10.3390/app9183658
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Clinical named entity recognition is an essential task for humans to analyze large-scale electronic medical records efficiently. Traditional rule-based solutions need considerable human effort to build rules and dictionaries; machine learning-based solutions need laborious feature engineering. For the moment, deep learning solutions like Long Short-term Memory with Conditional Random Field (LSTM-CRF) achieved considerable performance in many datasets. In this paper, we developed a multitask attention-based bidirectional LSTM-CRF (Att-biLSTM-CRF) model with pretrained Embeddings from Language Models (ELMo) in order to achieve better performance. In the multitask system, an additional task named entity discovery was designed to enhance the model's perception of unknown entities. Experiments were conducted on the 2010 Informatics for Integrating Biology & the Bedside/Veterans Affairs (I2B2/VA) dataset. Experimental results show that our model outperforms the state-of-the-art solution both on the single model and ensemble model. Our work proposes an approach to improve the recall in the clinical named entity recognition task based on the multitask mechanism.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Contextual embedding bootstrapped neural network for medical information extraction of coronary artery disease records
    Xingxing Cen
    Junyi Yuan
    Changqing Pan
    Qinhua Tang
    Qunsheng Ma
    [J]. Medical & Biological Engineering & Computing, 2021, 59 : 1111 - 1121
  • [2] Contextual embedding bootstrapped neural network for medical information extraction of coronary artery disease records
    Cen, Xingxing
    Yuan, Junyi
    Pan, Changqing
    Tang, Qinhua
    Ma, Qunsheng
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2021, 59 (05) : 1111 - 1121
  • [3] Natural Language Inference for Arabic using Recurrent Neural Network and Word Embedding
    Bensghaier, Mabrouka
    Bakari, Wided
    Neji, Mahmoud
    [J]. 2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [4] Impact analysis of keyword extraction using contextual word embedding
    Khan, Muhammad Qasim
    Shahid, Abdul
    Uddin, M. Irfan
    Roman, Muhammad
    Alharbi, Abdullah
    Alosaimi, Wael
    Almalki, Jameel
    Alshahrani, Saeed M.
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [5] WORD EMBEDDING FOR RECURRENT NEURAL NETWORK BASED TTS SYNTHESIS
    Wang, Peilu
    Qian, Yao
    Soong, Frank K.
    He, Lei
    Zhao, Hai
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4879 - 4883
  • [6] A Word Embedding Based Approach for Focused Web Crawling Using the Recurrent Neural Network
    Dhanith, P. R. Joe
    Surendiran, B.
    Raja, S. P.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 6 (06): : 122 - 132
  • [7] Learning to Recognize Protected Health Information in Electronic Health Records with Recurrent Neural Network
    Li, Kun
    Chai, Yumei
    Zhao, Hongling
    Nan, Xiaofei
    Zhao, Yueshu
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 575 - 582
  • [8] Information extraction of Chinese medical electronic records via evolutionary neural architecture search
    Zhang, Tian
    Li, Nan
    Zhou, Yuee
    Cai, Wei
    Ma, Lianbo
    [J]. 2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 396 - 405
  • [9] Information Extraction for Intestinal Cancer Electronic Medical Records
    Wang, Sufen
    Pang, Minmin
    Pan, Changqing
    Yuan, Junyi
    Xu, Bo
    Du, Ming
    Zhang, Hong
    [J]. IEEE ACCESS, 2020, 8 : 125923 - 125934
  • [10] Word Embedding Bootstrapped Deep Active Learning Method to Information Extraction on Chinese Electronic Medical Record
    Ma Q.
    Cen X.
    Yuan J.
    Hou X.
    [J]. Journal of Shanghai Jiaotong University (Science), 2021, 26 (04) : 494 - 502