Unsupervised entity and relation extraction from clinical records in Italian

被引:33
|
作者
Alicante, Anita [1 ]
Corazza, Anna [1 ]
Isgro, Francesco [1 ]
Silvestri, Stefano [1 ]
机构
[1] Univ Naples Federico II, DIETI, Dipartimento Ingn Elettr & Tecnol Informaz, Via Claudio 21, I-80125 Naples, Italy
关键词
Unsupervised learning; Relation clustering; Entity extraction; Medical information extraction; Entities relation discovery;
D O I
10.1016/j.compbiomed.2016.01.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper proposes and discusses the use of text mining techniques for the extraction of information from clinical records written in Italian. However, as it is very difficult and expensive to obtain annotated material for languages different from English, we only consider unsupervised approaches, where no annotated training set is necessary. We therefore propose a complete system that is structured in two steps. In the first one domain entities are extracted from the clinical records by means of a metathesaurus and standard natural language processing tools. The second step attempts to discover relations between the entity pairs extracted from the whole set of clinical records. For this last step we investigate the performance of unsupervised methods such as clustering in the space of entity pairs, represented by an ad hoc feature vector. The resulting clusters are then automatically labelled by using the most significant features. The system has been tested on a fairly large data set of clinical records in Italian, investigating the variation in the performance adopting different similarity measures in the feature space. The results of our experiments show that the unsupervised approach proposed is promising and well suited for a semi-automatic labelling of the extracted relations. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:263 / 275
页数:13
相关论文
共 50 条
  • [1] Unsupervised Information Extraction from Italian Clinical Records
    Alicante, Anita
    Corazza, Anna
    Isgro, Francesco
    Silvestri, Stefano
    [J]. INNOVATION IN MEDICINE AND HEALTHCARE 2014, 2014, 207 : 340 - 349
  • [2] Research of unsupervised entity relation extraction
    Liu, Yun
    Li, Mingxin
    Liu, Hui
    Cheng, Junjun
    Fu, Yanping
    [J]. Journal of Computers (Taiwan), 2019, 30 (01): : 31 - 41
  • [4] Structured Learning for Temporal Relation Extraction from Clinical Records
    Leeuwenberg, Artuur
    Moens, Marie-Francine
    [J]. 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1150 - 1158
  • [5] Unsupervised Relation Extraction from Web Documents
    Eichler, Kathrin
    Hemsen, Holmer
    Neumann, Guenter
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1674 - 1679
  • [6] Modeling of joint extraction of entity relationships in clinical electronic medical records
    Yang, Yousen
    Tong, Jijun
    Zhou, Qingli
    [J]. Computers in Biology and Medicine, 2024, 182
  • [7] Entity Relation Extraction Based on Entity Indicators
    Qin, Yongbin
    Yang, Weizhe
    Wang, Kai
    Huang, Ruizhang
    Tian, Feng
    Ao, Shaolin
    Chen, Yanping
    [J]. SYMMETRY-BASEL, 2021, 13 (04):
  • [8] Unsupervised named-entity extraction from the Web: An experimental study
    Etzioni, O
    Cafarella, M
    Downey, D
    Popescu, AM
    Shaked, T
    Soderland, S
    Weld, DS
    Yates, A
    [J]. ARTIFICIAL INTELLIGENCE, 2005, 165 (01) : 91 - 134
  • [9] Entity relation extraction from electronic medical records based on improved annotation rules and BiLSTM-CRF
    Chen, Tingyin
    Hu, Yongmei
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2021, 9 (18)
  • [10] Review of entity relation extraction
    Tuo, Meimei
    Yang, Wenzhong
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7391 - 7405