A Relative Information Gain-based Query Performance Prediction Framework with Generated Query Variants

被引:4
|
作者
Datta, Suchana [1 ]
Ganguly, Debasis [2 ]
Mitra, Mandar [3 ]
Greene, Derek [1 ]
机构
[1] Univ Coll Dublin, Belfield, Dublin D04 V1W8, Ireland
[2] Univ Glasgow, Glasgow G12 8QQ, Lanark, Scotland
[3] Indian Stat Inst, 203 Barrackpore Trunk Rd, Kolkata 700108, W Bengal, India
基金
爱尔兰科学基金会;
关键词
Query performance prediction; neural model retrieval scores; query variant generation; MODELS;
D O I
10.1145/3545112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query performance prediction (QPP) methods, which aim to predict the performance of a query, often rely on evidences in the form of different characteristic patterns in the distribution of Retrieval Status Values (RSVs). However, for neural IR models, it is usually observed that the RSVs are often less reliable for QPP because they are bounded within short intervals, different from the situation for statistical models. To address this limitation, we propose a model-agnostic QPP framework that gathers additional evidences by leveraging information from the characteristic patterns of RSV distributions computed over a set of automatically generated query variants, relative to that of the current query. Specifically, the idea behind our proposed method-Weighted Relative Information Gain (WRIG), is that a substantial relative decrease or increase in the standard deviation of the RSVs of the query variants is likely to be a relative indicator of how easy or difficult the original query is. To cater for the absence of human-annotated query variants in real-world scenarios, we further propose an automatic query variant generation method. This can produce variants in a controlled manner by substituting terms from the original query with new ones sampled from a weighted distribution, constructed either via a relevance model or with the help of an embedded representation of query terms. Our experiments on the TREC-Robust, ClueWeb09B, and MS MARCO datasets show thatWRIG, by the use of this relative changes in QPP estimate, leads to significantly better results than a state-of-the-art baseline method that leverages information from (manually created) query variants by the application of additive smoothing [64]. The results also show that our approach can improve the QPP effectiveness of neural retrieval approaches in particular.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Information Retrieval Based on Pseudo Prediction Query Performance
    Gong, Yu-Xi
    Zhang, Min-Xia
    Luo, Rong
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 283 - 286
  • [2] Information Needs, Queries, and Query Performance Prediction
    Zendel, Oleg
    Shtok, Anna
    Rabier, Fiana
    Kurland, Oren
    Culpepper, J. Shane
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 395 - 404
  • [3] An Extended Query Performance Prediction Framework Utilizing Passage-Level Information
    Roitman, Haggai
    [J]. PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 35 - 42
  • [4] Query Performance Prediction using Passage Information
    Roitman, Haggai
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 893 - 896
  • [5] Query performance prediction for information retrieval based on covering topic score
    Lang, Hao
    Wang, Bin
    Jones, Gareth
    Li, Jin-Tao
    Ding, Fan
    Liu, Yi-Xuan
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2008, 23 (04) : 590 - 601
  • [6] Query Performance Prediction for Information Retrieval Based on Covering Topic Score
    Hao Lang
    Bin Wang
    Gareth Jones
    Jin-Tao Li
    Fan Ding
    Yi-Xuan Liu
    [J]. Journal of Computer Science and Technology, 2008, 23 : 590 - 601
  • [7] Query Performance Prediction for Information Retrieval Based on Covering Topic Score
    郎皓
    王斌
    Gareth Jones
    李锦涛
    丁凡
    刘宜轩
    [J]. Journal of Computer Science & Technology, 2008, (04) : 590 - 601
  • [8] Query performance prediction
    He, Ben
    Ounis, Iadh
    [J]. INFORMATION SYSTEMS, 2006, 31 (07) : 585 - 594
  • [9] A Geometric Framework for Query Performance Prediction in Conversational Search
    Faggioli, Guglielmo
    Ferro, Nicola
    Muntean, Cristina Ioana
    Perego, Raffaele
    Tonellotto, Nicola
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1355 - 1365
  • [10] Query Performance Prediction and Classification for Information Search Systems
    Zhang, Zhongmin
    Chen, Jiawei
    Wu, Shengli
    [J]. WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 277 - 285