Learning to Rank with Selection Bias in Personal Search

被引:155
|
作者
Wang, Xuanhui [1 ]
Bendersky, Michael [1 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
Personal Search; Selection Bias; Learning-to-Rank;
D O I
10.1145/2911451.2911537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through data has proven to be a critical resource for improving search ranking quality. Though a large amount of click data can be easily collected by search engines, various biases make it difficult to fully leverage this type of data. In the past, many click models have been proposed and successfully used to estimate the relevance for individual query-document pairs in the context of web search. These click models typically require a large quantity of clicks for each individual pair and this makes them difficult to apply in systems where click data is highly sparse due to personalized corpora and information needs, e.g., personal search. In this paper, we study the problem of how to leverage sparse click data in personal search and introduce a novel selection bias problem and address it in the learning-to-rank framework. This paper proposes a few bias estimation methods, including a novel query-dependent one that captures queries with similar results and can successfully deal with sparse data. We empirically demonstrate that learning-to-rank that accounts for query-dependent selection bias yields significant improvements in search effectiveness through online experiments with one of the world's largest personal search engines.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 50 条
  • [41] Sampling Bias Due to Near-Duplicates in Learning to Rank
    Froebe, Maik
    Bevendorff, Janek
    Reimer, Jan Heinrich
    Potthast, Martin
    Hagen, Matthias
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1997 - 2000
  • [42] ADVERSE SELECTION, LEARNING, AND COMPETITIVE SEARCH
    Mayr-Dorn, Karin
    INTERNATIONAL ECONOMIC REVIEW, 2023, 64 (01) : 129 - 153
  • [43] Care to Share? Learning to Rank Personal Photos for Public Sharing
    Guy, Ido
    Nus, Alexander
    Pelleg, Dan
    Szpektor, Idan
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 207 - 215
  • [44] Automatic selection of learning bias for active sampling
    dos Santos, Davi P.
    de Carvalho, Andre C. P. L. F.
    PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 55 - 60
  • [45] Feature Selection Using Tabu Search with Learning Memory: Learning Tabu Search
    Mousin, Lucien
    Jourdan, Laetitia
    Marmion, Marie-Eleonore Kessaci
    Dhaenens, Clarisse
    LEARNING AND INTELLIGENT OPTIMIZATION (LION 10), 2016, 10079 : 141 - 156
  • [46] Positive Unlabeled Learning with a Sequential Selection Bias
    Gerych, Walter
    Hartvigsen, Tom
    Buquicchio, Luke
    Alajaji, Abdulaziz
    Chandrasekaran, Kavin
    Mansoor, Hamid
    Rundensteiner, Elke
    Agu, Emmanuel
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 19 - 27
  • [47] Selection Bias in News Coverage: Learning it, Fighting it
    Bourgeois, Dylan
    Rappaz, Jeremie
    Aberer, Karl
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 535 - 543
  • [48] Meta learning application in rank aggregation feature selection
    Smetannikov, Ivan
    Deyneka, Alexander
    Filchenkov, Andrey
    2016 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2016), 2016, : 120 - 123
  • [49] Feature Selection for Analogy-Based Learning to Rank
    Fahandar, Mohsen Ahmadi
    Huellermeier, Eyke
    DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 279 - 289
  • [50] Drug Selection via Joint Push and Learning to Rank
    He, Yicheng
    Liu, Junfeng
    Ning, Xia
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (01) : 110 - 123