Unbiased Top-k Learning to Rank with Causal Likelihood Decomposition

被引：0

作者：

Zhao, Haiyuan ^{[1
]}

Xu, Jun ^{[2
,4
]}

Zhang, Xiao ^{[2
]}

Cai, Guohao ^{[3
]}

Dong, Zhenhua ^{[3
]}

Wen, Ji-Rong ^{[2
]}

机构：

[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China

[2] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China

[3] Noahs Ark Lab Huawei, Shenzhen, Peoples R China

[4] Minist Educ, Engn Res Ctr Next Generat Intelligent Search & Re, Beijing, Peoples R China

来源：

ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL IN THE ASIA PACIFIC REGION, SIGIR-AP 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

unbiased learning to rank; position bias; sample selection bias; SAMPLE SELECTION; MODELS; BIAS;

D O I：

10.1145/3624918.3625340

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unbiased learning to rank methods have been proposed to address biases in search ranking. These biases, known as position bias and sample selection bias, often occur simultaneously in real applications. Existing approaches either tackle these biases separately or treat them as identical, leading to incomplete elimination of both biases. This paper employs a causal graph approach to investigate the mechanisms and interplay between position bias and sample selection bias. The analysis reveals that position bias is a common confounder bias, while sample selection bias falls under the category of collider bias. These biases collectively introduce a cascading process that leads to biased clicks. Based on our analysis, we propose Causal Likelihood Decomposition (CLD), a unified method that effectively mitigates both biases in top-k learning to rank. CLD removes position bias by leveraging propensity scores and then decomposes the likelihood of selection biased data into sample selection bias term and relevance term. By maximizing the overall log-likelihood function, we obtain an unbiased ranking model from the relevance term. We also extend CLD to pairwise neural ranking. Extensive experiments demonstrate that CLD and its pairwise neural extension outperform baseline methods by effectively mitigating both position bias and sample selection bias. The robustness of CLD is further validated through empirical studies considering variations in bias severity and click noise.

引用

页码：129 / 138

页数：10

共 50 条

[21] Transfer learning from rating prediction to Top-k recommendation
Ye, Fan
Lu, Xiaobo
Li, Hongwei
Chen, Zhenyu
[J]. PLOS ONE, 2024, 19 (03):
[22] Location-aware online learning for top-k recommendation
Palovics, Robert
Szalai, Peter
Pap, Julia
Frigo, Erzsebet
Kocsis, Levente
Benczur, Andras A.
[J]. PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 490 - 504
[23] Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data
Song, Chunyao
Li, Zheng
Ge, Tingjian
[J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 146 - 157
[24] Unbiased Likelihood Backpropagation Learning
Sekino, Masashi
Nitta, Katsumi
[J]. NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 446 - 455
[25] Probabilistic Metric Learning with Adaptive Margin for Top-K Recommendation
Ma, Chen
Ma, Liheng
Zhang, Yingxue
Tang, Ruiming
Liu, Xue
Coates, Mark
[J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1036 - 1044
[26] Learning to Recommend Top-k Items in Online Social Networks
Xing, Xing
Zhang, Weishi
Jia, Zhichun
Zhang, Xiuguo
[J]. PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 1171 - 1176
[27] Reducing Communication for Split Learning by Randomized Top-k Sparsification
Zheng, Fei
Chen, Chaochao
Lyu, Lingjuan
Yao, Binhui
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4665 - 4673
[28] Top-k spatial joins
Zhu, ML
Papadias, D
Zhang, J
Lee, DL
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) : 567 - 579
[29] Top-k φ Correlation Computation
Xiong, Hui
Zhou, Wenjun
Brodie, Mark
Ma, Sheng
[J]. INFORMS JOURNAL ON COMPUTING, 2008, 20 (04) : 539 - 552
[30] Parameterized top-K algorithms
Chen, Jianer
Kanj, Iyad A.
Meng, Jie
Xia, Ge
Zhang, Fenghui
[J]. THEORETICAL COMPUTER SCIENCE, 2013, 470 : 105 - 119

← 1 2 3 4 5 →