Unbiased Top-k Learning to Rank with Causal Likelihood Decomposition

被引:0
|
作者
Zhao, Haiyuan [1 ]
Xu, Jun [2 ,4 ]
Zhang, Xiao [2 ]
Cai, Guohao [3 ]
Dong, Zhenhua [3 ]
Wen, Ji-Rong [2 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei, Shenzhen, Peoples R China
[4] Minist Educ, Engn Res Ctr Next Generat Intelligent Search & Re, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
unbiased learning to rank; position bias; sample selection bias; SAMPLE SELECTION; MODELS; BIAS;
D O I
10.1145/3624918.3625340
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unbiased learning to rank methods have been proposed to address biases in search ranking. These biases, known as position bias and sample selection bias, often occur simultaneously in real applications. Existing approaches either tackle these biases separately or treat them as identical, leading to incomplete elimination of both biases. This paper employs a causal graph approach to investigate the mechanisms and interplay between position bias and sample selection bias. The analysis reveals that position bias is a common confounder bias, while sample selection bias falls under the category of collider bias. These biases collectively introduce a cascading process that leads to biased clicks. Based on our analysis, we propose Causal Likelihood Decomposition (CLD), a unified method that effectively mitigates both biases in top-k learning to rank. CLD removes position bias by leveraging propensity scores and then decomposes the likelihood of selection biased data into sample selection bias term and relevance term. By maximizing the overall log-likelihood function, we obtain an unbiased ranking model from the relevance term. We also extend CLD to pairwise neural ranking. Extensive experiments demonstrate that CLD and its pairwise neural extension outperform baseline methods by effectively mitigating both position bias and sample selection bias. The robustness of CLD is further validated through empirical studies considering variations in bias severity and click noise.
引用
收藏
页码:129 / 138
页数:10
相关论文
共 50 条
  • [1] Policy-Aware Unbiased Learning to Rank for Top-k Rankings
    Oosterhuis, Harrie
    de Rijke, Maarten
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 489 - 498
  • [2] Online Learning to Rank with Top-k Feedback
    Chaudhuri, Sougata
    Tewari, Ambuj
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [3] Top-k Learning to Rank: Labeling, Ranking and Evaluation
    Niu, Shuzi
    Guo, Jiafeng
    Lan, Yanyan
    Cheng, Xueqi
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 751 - 760
  • [4] Active Learning for Top-K Rank Aggregation from Noisy Comparisons
    Mohajer, Soheil
    Suh, Changho
    Elmandy, Adel
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Differentiable Top-k Classification Learning
    Petersen, Felix
    Kuehne, Hilde
    Borgelt, Christian
    Deussen, Oliver
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] A Top-k Learning to Rank Approach to Cross-Project Software Defect Prediction
    Wang, Feng
    Huang, Jinxiao
    Ma, Yutao
    [J]. 2018 25TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2018), 2018, : 335 - 344
  • [7] Deep Metric Learning Based on Rank-sensitive Optimization of Top-k Precision
    Muramoto, Naoki
    Yu, Hai-Tao
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2161 - 2164
  • [8] Learning with Average Top-k Loss
    Fan, Yanbo
    Lyu, Siwei
    Ying, Yiming
    Hu, Bao-Gang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [9] WavingSketch: an unbiased and generic sketch for finding top-k items in data streams
    Liu, Zirui
    Dong, Fenghao
    Liu, Chengwu
    Deng, Xiangwei
    Yang, Tong
    Zhao, Yikai
    Li, Jizhou
    Cui, Bin
    Zhang, Gong
    [J]. VLDB JOURNAL, 2024, 33 (05): : 1697 - 1722
  • [10] WavingSketch An Unbiased and Generic Sketch for Finding Top-k Items in Data Streams
    Li, Jizhou
    Li, Zikun
    Xu, Yifei
    Jiang, Shiqi
    Yang, Tong
    Cui, Bin
    Dai, Yafei
    Zhang, Gong
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1574 - 1584