Query2doc: Query Expansion with Large Language Models

被引:0
|
作者
Wang, Liang [1 ]
Yang, Nan [1 ]
Wei, Furu [1 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a simple yet effective query expansion approach, denoted as query2doc, to improve both sparse and dense retrieval systems. The proposed method first generates pseudo-documents by few-shot prompting large language models (LLMs), and then expands the query with generated pseudo-documents. LLMs are trained on web-scale text corpora and are adept at knowledge memorization. The pseudo-documents from LLMs often contain highly relevant information that can aid in query disambiguation and guide the retrievers. Experimental results demonstrate that query2doc boosts the performance of BM25 by 3% to 15% on ad-hoc IR datasets, such as MS-MARCO and TREC DL, without any model fine-tuning. Furthermore, our method also benefits state-of-the-art dense retrievers in terms of both in-domain and out-of-domain results.
引用
收藏
页码:9414 / 9423
页数:10
相关论文
共 50 条
  • [31] Exploiting Underrepresented Query Aspects for Automatic Query Expansion
    Crabtree, Daniel
    Andreae, Peter
    Gao, Xiaoying
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 191 - 200
  • [32] Query Bootstrapping: A Visual Mining Based Query Expansion
    Kasamwattanarote, Siriwat
    Uchida, Yusuke
    Satoh, Shin'ichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (02): : 454 - 466
  • [33] Understanding query aspects with applications to interactive query expansion
    Crabtree, Daniel
    Andreae, Peter
    Gao, Xiaoying
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 691 - 695
  • [34] Query difficulty, robustness, and selective application of query expansion
    Amati, G
    Carpineto, C
    Romano, G
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 2997 : 127 - 137
  • [35] Conceptual query expansion
    Grootjen, FA
    van der Weide, TP
    DATA & KNOWLEDGE ENGINEERING, 2006, 56 (02) : 174 - 193
  • [36] Conceptual query expansion
    Hoeber, O
    Yang, XD
    Yao, YY
    ADVANCES IN WEB INTELLIGENCE, PROCEEDINGS, 2005, 3528 : 190 - 196
  • [37] Context-aware query expansion method using Language Models and Latent Semantic Analyses
    El Ghali, Btihal
    El Qadi, Abderrahim
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (03) : 751 - 762
  • [38] Query expansion and medline
    Srinivasan, Padmini
    Information Processing and Management, 1996, 32 (04): : 431 - 443
  • [39] Query expansion and MEDLINE
    Srinivasan, P
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 431 - 443
  • [40] Context-aware query expansion method using Language Models and Latent Semantic Analyses
    Btihal El Ghali
    Abderrahim El Qadi
    Knowledge and Information Systems, 2017, 50 : 751 - 762