Approximated Doubly Robust Search Relevance Estimation

被引:2
|
作者
Zou, Lixin [1 ]
Hao, Changying [1 ]
Cai, Hengyi [1 ]
Wang, Shuaiqiang [1 ]
Cheng, Suqi [1 ]
Cheng, Zhicong [1 ]
Ye, Wenwen [1 ]
Gu, Simiu [1 ]
Yin, Dawei [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
Doubly Robust; Search Relevance;
D O I
10.1145/3511808.3557145
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting query-document relevance from the sparse, biased click-through log is among the most fundamental tasks in the web search system. Prior art mainly learns a relevance judgment model with semantic features of the query and document and ignores directly counterfactual relevance evaluation from the clicking log. Though the learned semantic matching models can provide relevance signals for tail queries as long as the semantic feature is available. However, such a paradigm lacks the capability to introspectively adjust the biased relevance estimation whenever it conflicts with massive implicit user feedback. The counterfactual evaluation methods, on the contrary, ensure unbiased relevance estimation with sufficient click information. However, they suffer from the sparse or even missing clicks caused by the long-tailed query distribution. In this paper, we propose to unify the counterfactual evaluating and learning approaches for unbiased relevance estimation on search queries with various popularities. Specifically, we theoretically develop a doubly robust estimator with low bias and variance, which intentionally combines the benefits of existing relevance evaluating and learning approaches. We further instantiate the proposed unbiased relevance estimation framework in Baidu search, with comprehensive practical solutions designed regarding the data pipeline for click behavior tracking and online relevance estimation with an approximated deep neural network. Finally, we present extensive empirical evaluations to verify the effectiveness of our proposed framework, finding that it is robust in practice and manages to improve online ranking performance substantially.
引用
收藏
页码:3756 / 3765
页数:10
相关论文
共 50 条
  • [41] Improved Doubly Robust Estimation in Learning Optimal Individualized Treatment Rules
    Pan, Yinghao
    Zhao, Ying-Qi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (533) : 283 - 294
  • [42] Doubly Robust Estimation of Treatment Effects from Observational Multilevel Data
    Hall, Courtney E.
    Steiner, Peter M.
    Kim, Jee-Seon
    [J]. Quantitative Psychology Research, 2015, 140 : 321 - 340
  • [43] Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
    Shen, Ye
    Cai, Hengrui
    Song, Rui
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [44] Doubly Robust Semiparametric Estimation for Multi-group Causal Comparisons
    Anqi Yin
    Ao Yuan
    Ming T. Tan
    [J]. Statistics in Biosciences, 2024, 16 : 45 - 68
  • [45] Efficient and doubly robust estimation in covariate-missing data problems
    Zhang, Biao
    [J]. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2015, 18 (03): : 213 - 250
  • [46] Improved Doubly Robust Estimation in Marginal Mean Models for Dynamic Regimes
    Sun, Hao
    Ertefaie, Ashkan
    Lu, Xin
    Johnson, Brent A.
    [J]. JOURNAL OF CAUSAL INFERENCE, 2020, 8 (01) : 300 - 314
  • [47] Doubly Robust and Efficient Estimation of Marginal Structural Models for the Hazard Function
    Zheng, Wenjing
    Petersen, Maya
    van der Laan, Mark J.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2016, 12 (01): : 233 - 252
  • [48] Data-Adaptive Bias-Reduced Doubly Robust Estimation
    Vermeulen, Karel
    Vansteelandt, Stijn
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2016, 12 (01): : 253 - 282
  • [49] Nonparametric doubly robust estimation of causal effect on networks in observational studies
    Liu, Jie
    Ye, Fangjuan
    Yang, Yang
    [J]. STAT, 2023, 12 (01):
  • [50] Doubly robust estimation of multivariate fractional outcome means with multivalued treatments
    Negi, Akanksha
    Jeffrey, Wooldridge M.
    [J]. ECONOMETRIC REVIEWS, 2024, 43 (2-4) : 175 - 196