Query2doc: Query Expansion with Large Language Models

被引:0
|
作者
Wang, Liang [1 ]
Yang, Nan [1 ]
Wei, Furu [1 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a simple yet effective query expansion approach, denoted as query2doc, to improve both sparse and dense retrieval systems. The proposed method first generates pseudo-documents by few-shot prompting large language models (LLMs), and then expands the query with generated pseudo-documents. LLMs are trained on web-scale text corpora and are adept at knowledge memorization. The pseudo-documents from LLMs often contain highly relevant information that can aid in query disambiguation and guide the retrievers. Experimental results demonstrate that query2doc boosts the performance of BM25 by 3% to 15% on ad-hoc IR datasets, such as MS-MARCO and TREC DL, without any model fine-tuning. Furthermore, our method also benefits state-of-the-art dense retrievers in terms of both in-domain and out-of-domain results.
引用
收藏
页码:9414 / 9423
页数:10
相关论文
共 50 条
  • [21] Quantum Language Model-based Query Expansion
    Li, Qiuchi
    Melucci, Massimo
    Tiwari, Prayag
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 183 - 186
  • [22] Query Expansion for Language Modeling Using Sentence Similarities
    Ganguly, Debasis
    Leveling, Johannes
    Jones, Gareth J. F.
    MULTIDISCIPLINARY INFORMATION RETRIEVAL, 2011, 6653 : 62 - 77
  • [23] Data Annotation Models and Annotation Query Language
    Bhatnagar, Neerja
    Juliano, Benjoe A.
    Renner, Renee S.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 19, 2007, 19 : 440 - +
  • [24] A GRAPHICAL QUERY LANGUAGE FOR SEMANTIC DATA MODELS
    SCHNEIDER, M
    TREPIED, C
    PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS, 1989, : 153 - 164
  • [25] EditQL: A Textual Query Language for Evolving Models
    Pietron, Jakob
    Jutz, Benedikt
    Raschke, Alexander
    Tichy, Matthias
    27TH INTERNATIONAL ACM/IEEE CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS, 2024, : 37 - 48
  • [26] Collaborative Language Models for Localized Query Prediction
    Fang, Yi
    Al Bawab, Ziad
    Crespo, Jean-Francois
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (02)
  • [27] A graph query language and its query processing
    Sheng, L
    Özsoyoglu, ZM
    Özsoyoglu, G
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 572 - 581
  • [28] Query Reformulation Behavior in an Interactive Query Expansion Environment
    Carola Carstens
    Dorothea Mildner
    Datenbank-Spektrum, 2011, 11 (3) : 161 - 172
  • [29] Impact of query structure and query expansion on retrieval performance
    Univ of Tampere, Tampere, Finland
    SIGIR Forum, (130-137):
  • [30] Refining Query Expansion Terms using Query Context
    Crimp, Reuben
    Trotman, Andrew
    ADCS'18: PROCEEDINGS OF THE 23RD AUSTRALASIAN DOCUMENT COMPUTING SYMPOSIUM, 2018,