Learning to Rank with Selection Bias in Personal Search

被引:155
|
作者
Wang, Xuanhui [1 ]
Bendersky, Michael [1 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
Personal Search; Selection Bias; Learning-to-Rank;
D O I
10.1145/2911451.2911537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through data has proven to be a critical resource for improving search ranking quality. Though a large amount of click data can be easily collected by search engines, various biases make it difficult to fully leverage this type of data. In the past, many click models have been proposed and successfully used to estimate the relevance for individual query-document pairs in the context of web search. These click models typically require a large quantity of clicks for each individual pair and this makes them difficult to apply in systems where click data is highly sparse due to personalized corpora and information needs, e.g., personal search. In this paper, we study the problem of how to leverage sparse click data in personal search and introduce a novel selection bias problem and address it in the learning-to-rank framework. This paper proposes a few bias estimation methods, including a novel query-dependent one that captures queries with similar results and can successfully deal with sparse data. We empirically demonstrate that learning-to-rank that accounts for query-dependent selection bias yields significant improvements in search effectiveness through online experiments with one of the world's largest personal search engines.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 50 条
  • [31] Learning to rank query suggestions for adhoc and diversity search
    Santos, Rodrygo L. T.
    Macdonald, Craig
    Ounis, Iadh
    INFORMATION RETRIEVAL, 2013, 16 (04): : 429 - 451
  • [32] Learning to rank code examples for code search engines
    Niu, Haoran
    Keivanloo, Iman
    Zou, Ying
    EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (01) : 259 - 291
  • [33] Learning to Rank Personalized Search Results in Professional Networks
    Ha-Thuc, Viet
    Sinha, Shakti
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 461 - 462
  • [34] Improving Medical Search Tasks Using Learning to Rank
    Alsulmi, Mohammad
    Carterette, Ben
    2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2018, : 47 - 54
  • [35] A Large Scale Search Dataset for Unbiased Learning to Rank
    Zou, Lixin
    Mao, Haitao
    Chu, Xiaokai
    Tang, Jiliang
    Wang, Shuaiqiang
    Ye, Wenwen
    Yin, Dawei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [36] Learning to rank code examples for code search engines
    Haoran Niu
    Iman Keivanloo
    Ying Zou
    Empirical Software Engineering, 2017, 22 : 259 - 291
  • [37] Learning to Rank for Consumer Health Search: A Semantic Approach
    Soldaini, Luca
    Goharian, Nazli
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 640 - 646
  • [38] On Application of Learning to Rank for E-Commerce Search
    Santu, Shubhra Kanti Karmaker
    Sondhi, Parikshit
    Zhai, ChengXiang
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 475 - 484
  • [39] Image Search Reranking with Transductive Learning to Rank Framework
    Zhang, Jing
    Jing, Peiguang
    Ji, Zhong
    Su, Yuting
    INFORMATION COMPUTING AND APPLICATIONS, PT II, 2011, 244 : 529 - 536
  • [40] Learning with Sparse and Biased Feedback for Personal Search
    Bendersky, Michael
    Wang, Xuanhui
    Najork, Marc
    Metzler, Donald
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5219 - 5223