An empirical study of gene synonym query expansion in biomedical information retrieval

被引:16
|
作者
Lu, Yue [1 ]
Fang, Hui [2 ]
Zhai, Chengxiang [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Univ Delaware, Newark, DE 19716 USA
来源
INFORMATION RETRIEVAL | 2009年 / 12卷 / 01期
基金
美国国家科学基金会;
关键词
Biomedical information retrieval; Synonym query expansion; Language modeling;
D O I
10.1007/s10791-008-9075-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the heavy use of gene synonyms in biomedical text, people have tried many query expansion techniques using synonyms in order to improve performance in biomedical information retrieval. However, mixed results have been reported. The main challenge is that it is not trivial to assign appropriate weights to the added gene synonyms in the expanded query; under-weighting of synonyms would not bring much benefit, while overweighting some unreliable synonyms can hurt performance significantly. So far, there has been no systematic evaluation of various synonym query expansion strategies for biomedical text. In this work, we propose two different strategies to extend a standard language modeling approach for gene synonym query expansion and conduct a systematic evaluation of these methods on all the available TREC biomedical text collections for ad hoc document retrieval. Our experiment results show that synonym expansion can significantly improve the retrieval accuracy. However, different query types require different synonym expansion methods, and appropriate weighting of gene names and synonym terms is critical for improving performance.
引用
收藏
页码:51 / 68
页数:18
相关论文
共 50 条
  • [1] An empirical study of gene synonym query expansion in biomedical information retrieval
    Yue Lu
    Hui Fang
    Chengxiang Zhai
    Information Retrieval, 2009, 12 : 51 - 68
  • [2] Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval
    Rivas, A. R.
    Iglesias, E. L.
    Borrajo, L.
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [3] Applying Lemur Query Expansion Techniques in Biomedical Information Retrieval
    Rivas, A. R.
    Borrajo, L.
    Iglesias, E. L.
    Romero, R.
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2012, 151 : 403 - 410
  • [4] Ontology Graph based Query Expansion for Biomedical Information Retrieval
    Dong, Liang
    Srimani, Pradip K.
    Wang, James Z.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 488 - 493
  • [5] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Ahmed AbdoAziz Ahmed Abdulla
    Hongfei Lin
    Bo Xu
    Santosh Kumar Banbhrani
    BMC Bioinformatics, 17
  • [6] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Abdulla, Ahmed AbdoAziz Ahmed
    Lin, Hongfei
    Xu, Bo
    Banbhrani, Santosh Kumar
    BMC BIOINFORMATICS, 2016, 17
  • [7] Parallel information retrieval with query expansion
    Chung, YJ
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (06) : 1593 - 1595
  • [8] Parallel information retrieval with query expansion
    Chung, Y
    APPLIED PARALLEL COMPUTING: ADVANCED SCIENTIFIC COMPUTING, 2002, 2367 : 195 - 202
  • [9] Parallel information retrieval with query expansion
    Chung, Y
    APPLIED PARALLEL COMPUTING: ADVANCED SCIENTIFIC COMPUTING, 2002, 2367 : 195 - 202
  • [10] An empirical study of tokenization strategies for biomedical information retrieval
    Jing Jiang
    ChengXiang Zhai
    Information Retrieval, 2007, 10 : 341 - 363