Unbiased Top-k Learning to Rank with Causal Likelihood Decomposition

被引:0
|
作者
Zhao, Haiyuan [1 ]
Xu, Jun [2 ,4 ]
Zhang, Xiao [2 ]
Cai, Guohao [3 ]
Dong, Zhenhua [3 ]
Wen, Ji-Rong [2 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei, Shenzhen, Peoples R China
[4] Minist Educ, Engn Res Ctr Next Generat Intelligent Search & Re, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
unbiased learning to rank; position bias; sample selection bias; SAMPLE SELECTION; MODELS; BIAS;
D O I
10.1145/3624918.3625340
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unbiased learning to rank methods have been proposed to address biases in search ranking. These biases, known as position bias and sample selection bias, often occur simultaneously in real applications. Existing approaches either tackle these biases separately or treat them as identical, leading to incomplete elimination of both biases. This paper employs a causal graph approach to investigate the mechanisms and interplay between position bias and sample selection bias. The analysis reveals that position bias is a common confounder bias, while sample selection bias falls under the category of collider bias. These biases collectively introduce a cascading process that leads to biased clicks. Based on our analysis, we propose Causal Likelihood Decomposition (CLD), a unified method that effectively mitigates both biases in top-k learning to rank. CLD removes position bias by leveraging propensity scores and then decomposes the likelihood of selection biased data into sample selection bias term and relevance term. By maximizing the overall log-likelihood function, we obtain an unbiased ranking model from the relevance term. We also extend CLD to pairwise neural ranking. Extensive experiments demonstrate that CLD and its pairwise neural extension outperform baseline methods by effectively mitigating both position bias and sample selection bias. The robustness of CLD is further validated through empirical studies considering variations in bias severity and click noise.
引用
收藏
页码:129 / 138
页数:10
相关论文
共 50 条
  • [21] Transfer learning from rating prediction to Top-k recommendation
    Ye, Fan
    Lu, Xiaobo
    Li, Hongwei
    Chen, Zhenyu
    [J]. PLOS ONE, 2024, 19 (03):
  • [22] Location-aware online learning for top-k recommendation
    Palovics, Robert
    Szalai, Peter
    Pap, Julia
    Frigo, Erzsebet
    Kocsis, Levente
    Benczur, Andras A.
    [J]. PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 490 - 504
  • [23] Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data
    Song, Chunyao
    Li, Zheng
    Ge, Tingjian
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 146 - 157
  • [24] Unbiased Likelihood Backpropagation Learning
    Sekino, Masashi
    Nitta, Katsumi
    [J]. NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 446 - 455
  • [25] Probabilistic Metric Learning with Adaptive Margin for Top-K Recommendation
    Ma, Chen
    Ma, Liheng
    Zhang, Yingxue
    Tang, Ruiming
    Liu, Xue
    Coates, Mark
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1036 - 1044
  • [26] Learning to Recommend Top-k Items in Online Social Networks
    Xing, Xing
    Zhang, Weishi
    Jia, Zhichun
    Zhang, Xiuguo
    [J]. PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 1171 - 1176
  • [27] Reducing Communication for Split Learning by Randomized Top-k Sparsification
    Zheng, Fei
    Chen, Chaochao
    Lyu, Lingjuan
    Yao, Binhui
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4665 - 4673
  • [28] Top-k spatial joins
    Zhu, ML
    Papadias, D
    Zhang, J
    Lee, DL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) : 567 - 579
  • [29] Top-k φ Correlation Computation
    Xiong, Hui
    Zhou, Wenjun
    Brodie, Mark
    Ma, Sheng
    [J]. INFORMS JOURNAL ON COMPUTING, 2008, 20 (04) : 539 - 552
  • [30] Parameterized top-K algorithms
    Chen, Jianer
    Kanj, Iyad A.
    Meng, Jie
    Xia, Ge
    Zhang, Fenghui
    [J]. THEORETICAL COMPUTER SCIENCE, 2013, 470 : 105 - 119