Approximated Doubly Robust Search Relevance Estimation

被引:2
|
作者
Zou, Lixin [1 ]
Hao, Changying [1 ]
Cai, Hengyi [1 ]
Wang, Shuaiqiang [1 ]
Cheng, Suqi [1 ]
Cheng, Zhicong [1 ]
Ye, Wenwen [1 ]
Gu, Simiu [1 ]
Yin, Dawei [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
Doubly Robust; Search Relevance;
D O I
10.1145/3511808.3557145
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting query-document relevance from the sparse, biased click-through log is among the most fundamental tasks in the web search system. Prior art mainly learns a relevance judgment model with semantic features of the query and document and ignores directly counterfactual relevance evaluation from the clicking log. Though the learned semantic matching models can provide relevance signals for tail queries as long as the semantic feature is available. However, such a paradigm lacks the capability to introspectively adjust the biased relevance estimation whenever it conflicts with massive implicit user feedback. The counterfactual evaluation methods, on the contrary, ensure unbiased relevance estimation with sufficient click information. However, they suffer from the sparse or even missing clicks caused by the long-tailed query distribution. In this paper, we propose to unify the counterfactual evaluating and learning approaches for unbiased relevance estimation on search queries with various popularities. Specifically, we theoretically develop a doubly robust estimator with low bias and variance, which intentionally combines the benefits of existing relevance evaluating and learning approaches. We further instantiate the proposed unbiased relevance estimation framework in Baidu search, with comprehensive practical solutions designed regarding the data pipeline for click behavior tracking and online relevance estimation with an approximated deep neural network. Finally, we present extensive empirical evaluations to verify the effectiveness of our proposed framework, finding that it is robust in practice and manages to improve online ranking performance substantially.
引用
收藏
页码:3756 / 3765
页数:10
相关论文
共 50 条
  • [1] Doubly robust estimation of attributable fractions
    Sjolander, Arvid
    Vansteelandt, Stijn
    [J]. BIOSTATISTICS, 2011, 12 (01) : 112 - 121
  • [2] Doubly Robust Estimation of Causal Effects
    Funk, Michele Jonsson
    Westreich, Daniel
    Wiesen, Chris
    Stuermer, Til
    Brookhart, M. Alan
    Davidian, Marie
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 173 (07) : 761 - 767
  • [3] On doubly robust estimation of the hazard difference
    Dukes, Oliver
    Martinussen, Torben
    Tchetgen, Eric J. Tchetgen
    Vansteelandt, Stijn
    [J]. BIOMETRICS, 2019, 75 (01) : 100 - 109
  • [4] Doubly robust estimation of the generalized impact fraction
    Taguri, Masataka
    Matsuyama, Yutaka
    Ohashi, Yasuo
    Harada, Akiko
    Ueshima, Hirotsugu
    [J]. BIOSTATISTICS, 2012, 13 (03) : 455 - 467
  • [5] Semiparametric Bayesian doubly robust causal estimation
    Luo, Yu
    Graham, Daniel J.
    McCoy, Emma J.
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2023, 225 : 171 - 187
  • [6] Doubly Robust Estimation of Optimal Dosing Strategies
    Schulz, Juliana
    Moodie, Erica E. M.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (533) : 256 - 268
  • [7] Bias-Reduced Doubly Robust Estimation
    Vermeulen, Karel
    Vansteelandt, Stijn
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (511) : 1024 - 1036
  • [8] Doubly robust estimation in generalized linear models
    Orsini, Nicola
    Bellocco, Rino
    Sjolander, Arvid
    [J]. STATA JOURNAL, 2013, 13 (01): : 185 - 205
  • [9] Approximate Bayesian Inference for Doubly Robust Estimation
    Graham, Daniel J.
    McCoy, Emma J.
    Stephens, David A.
    [J]. BAYESIAN ANALYSIS, 2016, 11 (01): : 47 - 69
  • [10] Relaxed doubly robust estimation in causal inference
    Xu, Tinghui
    Zhao, Jiwei
    [J]. STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (01) : 69 - 79