Improving Biomedical Information Retrieval with Neural Retrievers

被引:0
|
作者
Luo, Man [1 ]
Mitra, Arindam [2 ]
Gokhale, Tejas [1 ]
Baral, Chitta [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85287 USA
[2] Microsoft, Redmond, WA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information retrieval (IR) is essential in search engines and dialogue systems as well as natural language processing tasks such as open-domain question answering. IR serve an important function in the biomedical domain, where content and sources of scientific knowledge may evolve rapidly. Although neural retrievers have surpassed traditional IR approaches such as TF-IDF and BM25 in standard open-domain question answering tasks, they are still found lacking in the biomedical domain. In this paper, we seek to improve information retrieval (IR) using neural retrievers (NR) in the biomedical domain, and achieve this goal using a three-pronged approach. First, to tackle the relative lack of data in the biomedical domain, we propose a template-based question generation method that can be leveraged to train neural retriever models. Second, we develop two novel pre-training tasks that are closely aligned to the downstream task of information retrieval. Third, we introduce the "Poly-DPR" model which encodes each context into multiple context vectors. Extensive experiments and analysis on the BioASQ challenge suggest that our proposed method leads to large gains over existing neural approaches and beats BM25 in the small-corpus setting. We show that BM25 and our method can complement each other, and a simple hybrid model leads to further gains in the large corpus setting.
引用
收藏
页码:11038 / 11046
页数:9
相关论文
共 50 条
  • [1] An approach based on langage modeling for improving biomedical information retrieval
    Majdoubi, Jihen
    Loukil, Hatem
    Tmar, Mohamed
    Gargouri, Faiez
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2012, 16 (04) : 235 - 246
  • [2] A MEDICAL INFORMATION RETRIEVAL BASED ON RETRIEVERS' INTENTIONS
    Takaki, Osamu
    Murata, Koichiro
    Izumi, Noriaki
    Hasida, Koiti
    [J]. HEALTHINF 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HEALTH INFORMATICS, 2011, : 596 - 603
  • [3] Modeling and mining term association for improving biomedical information retrieval performance
    Qinmin Hu
    Jimmy Xiangji Huang
    Xiaohua Hu
    [J]. BMC Bioinformatics, 13
  • [4] Modeling and mining term association for improving biomedical information retrieval performance
    Hu, Qinmin
    Huang, Jimmy Xiangji
    Hu, Xiaohua
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [5] Combining Global and Local Semantic Contexts for Improving Biomedical Information Retrieval
    Dinh, Duy
    Tamine, Lynda
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 375 - 386
  • [6] Query Disambiguation to Enhance Biomedical Information Retrieval Based on Neural Networks
    Wided, Selmi
    Kammoun, Hager
    Amous, Ikram
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 151 - 156
  • [7] Biomedical Semantic Information Retrieval
    Lopez-Ubeda, Pilar
    Carlos Diaz-Galiano, Manuel
    Montejo-Raez, Arturo
    Martinez-Santiago, Fernando
    Andreu-Marin, Alberto
    Teresa Martin-Valdivia, M.
    Urena Lopez, L. Alfonso
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61): : 189 - 192
  • [8] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Abdulla, Ahmed AbdoAziz Ahmed
    Lin, Hongfei
    Xu, Bo
    Banbhrani, Santosh Kumar
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [9] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Ahmed AbdoAziz Ahmed Abdulla
    Hongfei Lin
    Bo Xu
    Santosh Kumar Banbhrani
    [J]. BMC Bioinformatics, 17
  • [10] Mining and modeling linkage information from citation context for improving biomedical literature retrieval
    Yin, Xiaoshi
    Huang, Jimmy Xiangji
    Li, Zhoujun
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (01) : 53 - 67