Factors affecting the effectiveness of biomedical document indexing and retrieval based on terminologies

被引:9
|
作者
Duy Dinh [1 ]
Tamine, Lynda [1 ]
Boubekeur, Fatiha [2 ]
机构
[1] Univ Toulouse 3, Inst Rech Informat Toulouse, F-31062 Toulouse, France
[2] Mouloud Mammeri Univ, Dept Comp Sci, Tizi Ouzou 15000, Algeria
关键词
Multi-terminology indexing; Voting techniques; Document/query expansion; Concept extraction; Biomedical retrieval; QUERY EXPANSION; INFORMATION-RETRIEVAL; TEXT; DICTIONARY; GENE;
D O I
10.1016/j.artmed.2012.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The aim of this work is to evaluate a set of indexing and retrieval strategies based on the integration of several biomedical terminologies on the available TREC Genomics collections for an ad hoc information retrieval (IR) task. Materials and methods: We propose a multi-terminology based concept extraction approach to selecting best concepts from free text by means of voting techniques. We instantiate this general approach on four terminologies (MeSH, SNOMED, ICD-10 and GO). We particularly focus on the effect of integrating terminologies into a biomedical IR process, and the utility of using voting techniques for combining the extracted concepts from each document in order to provide a list of unique concepts. Results: Experimental studies conducted on the TREC Genomics collections show that our multi-terminology IR approach based on voting techniques are statistically significant compared to the baseline. For example, tested on the 2005 TREC Genomics collection, our multi-terminology based IR approach provides an improvement rate of +6.98% in terms of MAP (mean average precision) (p<0.05) compared to the baseline. In addition, our experimental results show that document expansion using preferred terms in combination with query expansion using terms from top ranked expanded documents improve the biomedical IR effectiveness. Conclusion: We have evaluated several voting models for combining concepts issued from multiple terminologies. Through this study, we presented many factors affecting the effectiveness of biomedical IR system including term weighting, query expansion, and document expansion models. The appropriate combination of those factors could be useful to improve the IR performance. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:155 / 167
页数:13
相关论文
共 50 条
  • [41] Biomedical image indexing and retrieval descriptors: A comparative study
    Deep, Gagan
    Kaur, Lakhwinder
    Gupta, Savita
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 : 954 - 961
  • [42] REFERENCE RETRIEVAL TOOLS - BIOMEDICAL ABSTRACTING + INDEXING SERVICES
    ORR, RH
    PINGS, VM
    LEEDS, AA
    FEDERATION PROCEEDINGS, 1964, 23 (5P1) : 1164 - &
  • [43] A Context Sensitive Document Indexing Approach for Information Retrieval
    Vanishree, M.
    Sudha, R.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [44] Framework for document retrieval using latent semantic indexing
    Phadnis, Neelam
    Gadge, Jayant
    International Journal of Computers and Applications, 2014, 94 (14) : 37 - 41
  • [45] Hierarchical indexing and flexible element retrieval for structured document
    Cui, H
    Wen, JR
    Chua, TS
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 73 - 87
  • [46] The AMTEx approach in the medical document indexing and retrieval application
    Hliaoutakis, Angelos
    Zervanou, Kaliope
    Petrakis, Euripides G. M.
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (03) : 380 - 392
  • [47] Semantic Indexing and Document Retrieval for Personalized Language Modeling
    Stas, Jan
    Hladek, Daniel
    Juhar, Jozef
    PROCEEDINGS OF 2017 INTERNATIONAL SYMPOSIUM ELMAR, 2017, : 157 - 161
  • [48] Impact of term-indexing for arabic document retrieval
    LINA FRE CNRS 2729, Université de Nantes, 2 rue la Houssinière, 44322 Nantes Cedex 03, France
    不详
    Lect. Notes Comput. Sci., 2008, (380-383):
  • [49] Impact of term-indexing for Arabic document retrieval
    Boulaknadel, Siham
    NATURAL LANGUAGE AND INFORMATION SYSTEMS, PROCEEDINGS, 2008, 5039 : 380 - 383
  • [50] XML Document Retrieval by Developing an Effective Indexing Technique
    Posonia, A. Mary
    Jyothi, V. L.
    2014 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, 2014, : 120 - 123