A Relative Information Gain-based Query Performance Prediction Framework with Generated Query Variants

被引:4
|
作者
Datta, Suchana [1 ]
Ganguly, Debasis [2 ]
Mitra, Mandar [3 ]
Greene, Derek [1 ]
机构
[1] Univ Coll Dublin, Belfield, Dublin D04 V1W8, Ireland
[2] Univ Glasgow, Glasgow G12 8QQ, Lanark, Scotland
[3] Indian Stat Inst, 203 Barrackpore Trunk Rd, Kolkata 700108, W Bengal, India
基金
爱尔兰科学基金会;
关键词
Query performance prediction; neural model retrieval scores; query variant generation; MODELS;
D O I
10.1145/3545112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query performance prediction (QPP) methods, which aim to predict the performance of a query, often rely on evidences in the form of different characteristic patterns in the distribution of Retrieval Status Values (RSVs). However, for neural IR models, it is usually observed that the RSVs are often less reliable for QPP because they are bounded within short intervals, different from the situation for statistical models. To address this limitation, we propose a model-agnostic QPP framework that gathers additional evidences by leveraging information from the characteristic patterns of RSV distributions computed over a set of automatically generated query variants, relative to that of the current query. Specifically, the idea behind our proposed method-Weighted Relative Information Gain (WRIG), is that a substantial relative decrease or increase in the standard deviation of the RSVs of the query variants is likely to be a relative indicator of how easy or difficult the original query is. To cater for the absence of human-annotated query variants in real-world scenarios, we further propose an automatic query variant generation method. This can produce variants in a controlled manner by substituting terms from the original query with new ones sampled from a weighted distribution, constructed either via a relevance model or with the help of an embedded representation of query terms. Our experiments on the TREC-Robust, ClueWeb09B, and MS MARCO datasets show thatWRIG, by the use of this relative changes in QPP estimate, leads to significantly better results than a state-of-the-art baseline method that leverages information from (manually created) query variants by the application of additive smoothing [64]. The results also show that our approach can improve the QPP effectiveness of neural retrieval approaches in particular.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] The Combination and Evaluation of Query Performance Prediction Methods
    Hauff, Claudi
    Azzopardi, Leif
    Hienstra, Djoerd
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 301 - +
  • [42] Keyphrase extraction through query performance prediction
    Ercan, Gonenc
    Cicekli, Ilyas
    [J]. JOURNAL OF INFORMATION SCIENCE, 2012, 38 (05) : 476 - 488
  • [43] An Analysis of Variations in the Effectiveness of Query Performance Prediction
    Ganguly, Debasis
    Datta, Suchana
    Mitra, Mandar
    Greene, Derek
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 215 - 229
  • [44] An information gain-based approach for evaluating protein structure models
    Postic, Guillaume
    Janel, Nathalie
    Tuffery, Pierre
    Moroy, Gautier
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2228 - 2236
  • [45] DBQA: A Comprehensive Query Performance Analyzer with Django Framework
    Simon, Judy
    Kapileswar, N.
    Datchinamoorthi, M.
    Devi, Keerthana G.
    Muthukumar, S.
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [46] A new framework based on features modeling and ensemble learning to predict query performance
    Zaghloul, Mohamed
    Salem, Mofreh
    Ali-Eldin, Amr
    [J]. PLOS ONE, 2021, 16 (10):
  • [47] Probability-based prediction query algorithm
    Yan, Yushuang
    Pei, Qingqi
    Wang, Xiang
    Wang, Yong
    [J]. AD HOC NETWORKS, 2017, 60 : 52 - 65
  • [48] Information gain-based metric for recognizing transitions in human activities
    Sadri, Amin
    Ren, Yongli
    Salim, Flora D.
    [J]. PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 92 - 109
  • [49] A dynamic query scheduling framework for distributed and evolving information systems
    Liu, L
    Pu, CT
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1997, : 474 - 481
  • [50] Robust Standard Deviation Estimation for Query Performance Prediction
    Roitman, Haggai
    Erera, Shai
    Weiner, Bar
    [J]. ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, : 245 - 248