An empirical study of gene synonym query expansion in biomedical information retrieval

被引:16
|
作者
Lu, Yue [1 ]
Fang, Hui [2 ]
Zhai, Chengxiang [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Univ Delaware, Newark, DE 19716 USA
来源
INFORMATION RETRIEVAL | 2009年 / 12卷 / 01期
基金
美国国家科学基金会;
关键词
Biomedical information retrieval; Synonym query expansion; Language modeling;
D O I
10.1007/s10791-008-9075-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the heavy use of gene synonyms in biomedical text, people have tried many query expansion techniques using synonyms in order to improve performance in biomedical information retrieval. However, mixed results have been reported. The main challenge is that it is not trivial to assign appropriate weights to the added gene synonyms in the expanded query; under-weighting of synonyms would not bring much benefit, while overweighting some unreliable synonyms can hurt performance significantly. So far, there has been no systematic evaluation of various synonym query expansion strategies for biomedical text. In this work, we propose two different strategies to extend a standard language modeling approach for gene synonym query expansion and conduct a systematic evaluation of these methods on all the available TREC biomedical text collections for ad hoc document retrieval. Our experiment results show that synonym expansion can significantly improve the retrieval accuracy. However, different query types require different synonym expansion methods, and appropriate weighting of gene names and synonym terms is critical for improving performance.
引用
收藏
页码:51 / 68
页数:18
相关论文
共 50 条
  • [31] Neural text generation for query expansion in information retrieval
    Claveau, Vincent
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 202 - 209
  • [32] Introducing query expansion methods for collaborative information retrieval
    Hust, A
    READING AND LEARNING: ADAPTIVE CONTENT RECOGNITION, 2004, 2956 : 252 - 280
  • [33] Clustering Algorithms for Query Expansion Based Information Retrieval
    Khennak, Ilyes
    Drias, Habiba
    Kechid, Amine
    Moulai, Hadjer
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 261 - 272
  • [34] An empirical study of query expansion and cluster-based retrieval in language modeling approach
    Na, Seung-Hoon
    Kang, In-Su
    Roh, Ji-Eun
    Lee, Jong-Hyeok
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (02) : 302 - 314
  • [35] An empirical study of query expansion and cluster-based retrieval in language modeling approach
    Na, SH
    Kang, IS
    Roh, JE
    Lee, JH
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 274 - 287
  • [36] Multi-field query expansion is effective for biomedical dataset retrieval
    Bouadjenek, Mohamed Reda
    Verspoor, Karin
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
  • [37] Boosting Biomedical Information Retrieval Performance through Citation Graph: An Empirical Study
    Yin, Xiaoshi
    Huang, Xiangji
    Hu, Qinmin
    Li, Zhoujun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 949 - +
  • [38] Information-retrieval algorithm based on query expansion and classification
    Yue, Wen
    Chen, Zhi-Ping
    Lin, Ya-Ping
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2006, 18 (07): : 1926 - 1929
  • [39] Query Expansion and Verification with Large Language Model for Information Retrieval
    Zhang, Wenjing
    Liu, Zhaoxiang
    Wang, Kai
    Lian, Shiguo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 341 - 351
  • [40] An Information Retrieval System for Malayalam Using Query Expansion Technique
    Babu, Arjun
    Sindhu, L.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1559 - 1564