Evaluation of semantic similarity metrics applied to the automatic retrieval of medical documents: An UMLS approach

被引:20
|
作者
Alonso, Israel [1 ]
Contreras, David [1 ]
机构
[1] Comillas Pontifical Univ, Dept Telemat & Comp Sci, C Alberto Aguilera 25, Madrid 28015, Spain
关键词
Semantic similarity; Information retrieval; Electronic Health Record; UMLS; ELECTRONIC HEALTH RECORDS; QUERY EXPANSION; INFORMATION-CONTENT; BIOMEDICAL DOMAIN; LANGUAGE SYSTEM; KNOWLEDGE; RELATEDNESS; TEXT;
D O I
10.1016/j.eswa.2015.09.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One promise of current information retrieval systems is the capability to identify risk groups for certain diseases and pathologies based on the automatic analysis of vast amounts of Electronic Medical Records repositories. However, the complexity and the degree of specialization of the language used by the experts in this context, make this task both challenging and complex. In this work, we introduce a novel experimental study to evaluate the performance of the two semantic similarity metrics (Path and Intrinsic IC-Path, both widely accepted in the literature) in a real-life information retrieval situation. In order to achieve this goal and due to the lack of methodologies for this context in the literature, we propose a straightforward information retrieval system for the biomedical field based on the UMLS Metathesaurus and on semantic similarity metrics. In contrast with previous studies which focus on testbeds with limited and controlled sets of concepts, we use a large amount of information (101,712 medical documents extracted from TREC Medical Records Track 2011). Our results show that in real-life cases, both metrics display similar performance, Path (F-Measure = 0.430) e Intrinsic IC-Path (F-Measure = 0.427). Thereby we suggest that the use of Intrinsic IC-Path is not justified in real scenarios. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:386 / 399
页数:14
相关论文
共 50 条
  • [1] Semantic Structuring of and Information Extraction from Medical Documents Using the UMLS
    Denecke, K.
    METHODS OF INFORMATION IN MEDICINE, 2008, 47 (05) : 425 - 434
  • [2] Combination of semantic word similarity metrics in video retrieval
    Memar, Sara
    Affendey, Lilly Suriani
    Mustapha, Norwati
    Doraisamy, Shyamala C.
    International Review on Computers and Software, 2011, 6 (03) : 299 - 305
  • [3] Semantic Retrieval Approach for Web Documents
    Harb, Hany M.
    Fouad, Khaled M.
    Nagdy, Nagdy M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (09) : 67 - 76
  • [4] Exploiting the semantic graph for the representation and retrieval of medical documents
    Zhao, Qing
    Kang, Yangyang
    Li, Jianqiang
    Wang, Dan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 101 : 39 - 50
  • [5] A semantic space approach for automatic summarization of documents
    Kaszas, Valer
    Tundik, Mate Akos
    Szaszak, Gyorgy
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 153 - 157
  • [6] Ontology-Based Automatic Annotation: An Approach for Efficient Retrieval of Semantic Results of Web Documents
    Tulasi, R. Lakshmi
    Rao, Meda Sreenivasa
    Ankita, K.
    Hgoudar, R.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 331 - 339
  • [7] A semantic fusion approach between medical images and reports using UMLS
    Racoceanu, Daniel
    Lacoste, Caroline
    Teodorescu, Roxana
    Vuillemenot, Nicolas
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 460 - 475
  • [8] Combination of Visual and Textual Similarity Retrieval from Medical Documents
    Eggel, Ivan
    Mueller, Henning
    MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 841 - 845
  • [9] AUTOMATIC INFORMATION RETRIEVAL APPLIED TO MEDICAL RECORDS
    FERNET, P
    PRESSE MEDICALE, 1971, 79 (23): : 1045 - &
  • [10] Automatic annotation generation of medical documents for effective medical information retrieval
    School of Computing Science and Engineering, VIT University, Vellore, Tamil Nadu
    632014, India
    Int. J. Reasoning based Intell. Syst., 3-4 (305-314):