A two-stage deep learning approach for extracting entities and relationships from medical texts

被引:37
|
作者
Suarez-Paniagua, Victor [1 ]
Rivera Zavala, Renzo M. [1 ]
Segura-Bedmar, Isabel [1 ]
Martinez, Paloma [1 ]
机构
[1] Carlos III Univ Madrid, Comp Sci Dept, Madrid 28911, Spain
关键词
Name entity recognition; Relation extraction; Deep learning; Health documents; PROTEINS;
D O I
10.1016/j.jbi.2019.103285
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work presents a two-stage deep learning system for Named Entity Recognition (NER) and Relation Extraction (RE) from medical texts. These tasks are a crucial step to many natural language understanding applications in the biomedical domain. Automatic medical coding of electronic medical records, automated summarizing of patient records, automatic cohort identification for clinical studies, text simplification of health documents for patients, early detection of adverse drug reactions or automatic identification of risk factors are only a few examples of the many possible opportunities that the text analysis can offer in the clinical domain. In this work, our efforts are primarily directed towards the improvement of the pharmacovigilance process by the automatic detection of drug-drug interactions (DDI) from texts. Moreover, we deal with the semantic analysis of texts containing health information for patients. Our two-stage approach is based on Deep Learning architectures. Concretely, NER is performed combining a bidirectional Long Short-Term Memory (Bi-LSTM) and a Conditional Random Field (CRF), while RE applies a Convolutional Neural Network (CNN). Since our approach uses very few language resources, only the pre-trained word embeddings, and does not exploit any domain resources (such as dictionaries or ontologies), this can be easily expandable to support other languages and clinical applications that require the exploitation of semantic information (concepts and relationships) from texts. During the last years, the task of DDI extraction has received great attention by the BioNLP community. However, the problem has been traditionally evaluated as two separate subtasks: drug name recognition and extraction of DDIs. To the best of our knowledge, this is the first work that provides an evaluation of the whole pipeline. Moreover, our system obtains state-of-the-art results on the eHealth-KD challenge, which was part of the Workshop on Semantic Analysis at SEPLN (TASS-2018).
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Two-Stage Approach to Image Classification by Deep Neural Networks
    Ososkov, Gennady
    Goncharov, Pavel
    MATHEMATICAL MODELING AND COMPUTATIONAL PHYSICS 2017 (MMCP 2017), 2018, 173
  • [42] Extracting Fine-grained Knowledge Units from Texts with Deep Learning
    Yu L.
    Qian L.
    Fu C.
    Zhao H.
    Data Analysis and Knowledge Discovery, 2019, 3 (01) : 38 - 45
  • [43] y Pain Detection from Facial Videos Using Two-Stage Deep Learning
    Menchetti, Guglielmo
    Chen, Zhanli
    Wilkie, Diana J.
    Ansari, Rashid
    Yardimci, Yasemin
    Cetin, A. Enis
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [44] A semantic sequence similarity based approach for extracting medical entities from clinical conversations
    Satti, Fahad Ahmed
    Hussain, Musarrat
    Ali, Syed Imran
    Saleem, Misha
    Ali, Husnain
    Chung, Tae Choong
    Lee, Sungyoung
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [45] Extracting candidate terms from medical texts
    Bentounsi, Imene
    Boufaida, Zizette
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [46] SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical Texts
    Nhung Thi-Hong Nguyen
    Phuong Phan-Dieu Ha
    Luan Thanh Nguyen
    Kiet Van Nguyen
    Ngan Luu-Thuy Nguyen
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 371 - 382
  • [47] Extracting and sharing knowledge from medical texts
    Cao, CG
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (03) : 295 - 303
  • [48] Extracting and sharing knowledge from medical texts
    Cungen Cao
    Journal of Computer Science and Technology, 2002, 17 : 295 - 303
  • [49] Extracting Fine-Grained Location with Temporal Awareness in Tweets: A Two-Stage Approach
    Li, Chenliang
    Sun, Aixin
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (07) : 1652 - 1670
  • [50] A two-stage algorithm for extracting atrial fibrillation signal
    Ye, Yalan
    Xu, Jie
    Li, Yunxia
    Wang, Gang
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1707 - 1714