CEQE to SQET: A study of contextualized embeddings for query expansion

被引:0
|
作者
Shahrzad Naseri
Jeffrey Dalton
Andrew Yates
James Allan
机构
[1] University of Massachusetts Amherst,
[2] University of Glasgow,undefined
[3] University of Amsterdam,undefined
来源
关键词
Query expansion; Contextualized language models; Embeddings;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we study recent advances in context-sensitive language models for the task of query expansion. We study the behavior of existing and new approaches for lexical word-based expansion in both unsupervised and supervised contexts. For unsupervised models, we study the behavior of the Contextualized Embeddings for Query Expansion (CEQE) model. We introduce a new model, Supervised Contextualized Query Expansion with Transformers (SQET) that performs expansion as a supervised classification task and leverages context in pseudo-relevant results. We study the behavior of these expansion approaches for the tasks of ad-hoc document and passage retrieval. We conduct experiments combining expansion with probabilistic retrieval models as well as neural document ranking models. We evaluate expansion effectiveness on three standard TREC collections: Robust, Complex Answer Retrieval, and Deep Learning. We analyze the results of extrinsic retrieval effectiveness, intrinsic ability to rank expansion terms, and perform a qualitative analysis of the differences between the methods. We find out CEQE statically significantly outperforms static embeddings across all three datasets for Recall@1000. Moreover, CEQE outperforms static embedding-based expansion methods on multiple collections (by up to 18% on Robust and 31% on Deep Learning on average precision) and also improves over proven probabilistic pseudo-relevance feedback (PRF) models. SQET outperforms CEQE by 6% in P@20 on the intrinsic term ranking evaluation and is approximately as effective in retrieval performance. Models incorporating neural and CEQE-based expansion score achieves gains of up to 5% in P@20 and 2% in AP on Robust over the state-of-the-art transformer-based re-ranking model, Birch.
引用
收藏
页码:184 / 208
页数:24
相关论文
共 50 条
  • [1] CEQE to SQET: A study of contextualized embeddings for query expansion
    Naseri, Shahrzad
    Dalton, Jeffrey
    Yates, Andrew
    Allan, James
    INFORMATION RETRIEVAL JOURNAL, 2022, 25 (02): : 184 - 208
  • [2] Contextualized Query Embeddings for Conversational Search
    Lin, Sheng-Chieh
    Yang, Jheng-Hong
    Lin, Jimmy
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1004 - 1015
  • [3] Query Expansion Using Word Embeddings
    Kuzi, Saar
    Shtok, Anna
    Kurland, Oren
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1929 - 1932
  • [4] Personalized Query Expansion with Contextual Word Embeddings
    Bassani, Elias
    Tonellotto, Nicola
    Pasi, Gabriella
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)
  • [5] Query Expansion with Locally-Trained Word Embeddings
    Diaz, Fernando
    Mitra, Bhaskar
    Craswell, Nick
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 367 - 377
  • [6] Contextualized query expansion via unsupervised chunk selection for text retrieval
    Zheng, Zhi
    Hui, Kai
    He, Ben
    Han, Xianpei
    Sun, Le
    Yates, Andrew
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
  • [7] Query Expansion With Local Conceptual Word Embeddings in Microblog Retrieval
    Wang, Yashen
    Huang, Heyan
    Feng, Chong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1737 - 1749
  • [8] Learning Concept Embeddings for Query Expansion by Quantum Entropy Minimization
    Sordoni, Alessandro
    Bengio, Yoshua
    Nie, Jian-Yun
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1586 - 1592
  • [9] Visually Analyzing Contextualized Embeddings
    Berger, Matthew
    2020 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2020), 2020, : 276 - 280
  • [10] BERT-QE: Contextualized Query Expansion for Document Re-ranking
    Zheng, Zhi
    Hui, Kai
    He, Ben
    Han, Xianpei
    Sun, Le
    Yates, Andrew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4718 - 4728