Learning to Rank with Selection Bias in Personal Search

被引：155

作者：

Wang, Xuanhui ^{[1
]}

Bendersky, Michael ^{[1
]}

Metzler, Donald ^{[1
]}

Najork, Marc ^{[1
]}

机构：

[1] Google Inc, Mountain View, CA 94043 USA

来源：

SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2016年

关键词：

Personal Search; Selection Bias; Learning-to-Rank;

D O I：

10.1145/2911451.2911537

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Click-through data has proven to be a critical resource for improving search ranking quality. Though a large amount of click data can be easily collected by search engines, various biases make it difficult to fully leverage this type of data. In the past, many click models have been proposed and successfully used to estimate the relevance for individual query-document pairs in the context of web search. These click models typically require a large quantity of clicks for each individual pair and this makes them difficult to apply in systems where click data is highly sparse due to personalized corpora and information needs, e.g., personal search. In this paper, we study the problem of how to leverage sparse click data in personal search and introduce a novel selection bias problem and address it in the learning-to-rank framework. This paper proposes a few bias estimation methods, including a novel query-dependent one that captures queries with similar results and can successfully deal with sparse data. We empirically demonstrate that learning-to-rank that accounts for query-dependent selection bias yields significant improvements in search effectiveness through online experiments with one of the world's largest personal search engines.

引用

页码：115 / 124

页数：10

共 50 条

[31] Learning to rank query suggestions for adhoc and diversity search
Santos, Rodrygo L. T.
Macdonald, Craig
Ounis, Iadh
INFORMATION RETRIEVAL, 2013, 16 (04): : 429 - 451
[32] Learning to rank code examples for code search engines
Niu, Haoran
Keivanloo, Iman
Zou, Ying
EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (01) : 259 - 291
[33] Learning to Rank Personalized Search Results in Professional Networks
Ha-Thuc, Viet
Sinha, Shakti
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 461 - 462
[34] Improving Medical Search Tasks Using Learning to Rank
Alsulmi, Mohammad
Carterette, Ben
2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2018, : 47 - 54
[35] A Large Scale Search Dataset for Unbiased Learning to Rank
Zou, Lixin
Mao, Haitao
Chu, Xiaokai
Tang, Jiliang
Wang, Shuaiqiang
Ye, Wenwen
Yin, Dawei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[36] Learning to rank code examples for code search engines
Haoran Niu
Iman Keivanloo
Ying Zou
Empirical Software Engineering, 2017, 22 : 259 - 291
[37] Learning to Rank for Consumer Health Search: A Semantic Approach
Soldaini, Luca
Goharian, Nazli
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 640 - 646
[38] On Application of Learning to Rank for E-Commerce Search
Santu, Shubhra Kanti Karmaker
Sondhi, Parikshit
Zhai, ChengXiang
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 475 - 484
[39] Image Search Reranking with Transductive Learning to Rank Framework
Zhang, Jing
Jing, Peiguang
Ji, Zhong
Su, Yuting
INFORMATION COMPUTING AND APPLICATIONS, PT II, 2011, 244 : 529 - 536
[40] Learning with Sparse and Biased Feedback for Personal Search
Bendersky, Michael
Wang, Xuanhui
Najork, Marc
Metzler, Donald
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5219 - 5223

← 1 2 3 4 5 →