An empirical study of gene synonym query expansion in biomedical information retrieval

被引:16
|
作者
Lu, Yue [1 ]
Fang, Hui [2 ]
Zhai, Chengxiang [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Univ Delaware, Newark, DE 19716 USA
来源
INFORMATION RETRIEVAL | 2009年 / 12卷 / 01期
基金
美国国家科学基金会;
关键词
Biomedical information retrieval; Synonym query expansion; Language modeling;
D O I
10.1007/s10791-008-9075-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the heavy use of gene synonyms in biomedical text, people have tried many query expansion techniques using synonyms in order to improve performance in biomedical information retrieval. However, mixed results have been reported. The main challenge is that it is not trivial to assign appropriate weights to the added gene synonyms in the expanded query; under-weighting of synonyms would not bring much benefit, while overweighting some unreliable synonyms can hurt performance significantly. So far, there has been no systematic evaluation of various synonym query expansion strategies for biomedical text. In this work, we propose two different strategies to extend a standard language modeling approach for gene synonym query expansion and conduct a systematic evaluation of these methods on all the available TREC biomedical text collections for ad hoc document retrieval. Our experiment results show that synonym expansion can significantly improve the retrieval accuracy. However, different query types require different synonym expansion methods, and appropriate weighting of gene names and synonym terms is critical for improving performance.
引用
收藏
页码:51 / 68
页数:18
相关论文
共 50 条
  • [21] An information retrieval model based on query expansion
    Huang, Mingxuan
    Zhang, Shichao
    Yan, Xiaowei
    Huang, Faliang
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 217 - 221
  • [23] Using Ontologies for Query Expansion in Image Retrieval in the Biomedical Domain
    Mata, Jacinto
    Crespo, Mariano
    Mana, Manuel J.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 39 - 46
  • [24] Simple weighting techniques for query expansion in biomedical document retrieval
    Song, Young-In
    Han, Kyoung-Soo
    Park, So-Young
    Kim, Sang-Bum
    Rim, Hae-Chang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (11) : 1873 - 1876
  • [25] Exploring noise control strategies for UMLS-based query expansion in health and biomedical information retrieval
    Wu H.
    Li J.
    Kang Y.
    Zhong T.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (3) : 1825 - 1836
  • [26] Information Retrieval Experiment on Subjective Words Query Expansion
    Sodanil, Maleerat
    Ketmaneechairat, Hathairat
    2013 INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2013, : 161 - 165
  • [27] Query expansion based on clustering and personalized information retrieval
    Hamid Khalifi
    Walid Cherif
    Abderrahim El Qadi
    Youssef Ghanou
    Progress in Artificial Intelligence, 2019, 8 : 241 - 251
  • [28] An Empirical Study of Word Sense Disambiguation for Biomedical Information Retrieval System
    Rais, Mohammed
    Lachkar, Abdelmonaime
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2018, PT I, 2018, 10813 : 314 - 326
  • [29] Query expansion based on clustering and personalized information retrieval
    Khalifi, Hamid
    Cherif, Walid
    El Qadi, Abderrahim
    Ghanou, Youssef
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2019, 8 (02) : 241 - 251
  • [30] Query Expansion for Mixed-Script Information Retrieval
    Gupta, Parth
    Bali, Kalika
    Banchs, Rafael E.
    Choudhury, Monojit
    Rosso, Paolo
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 677 - 686