Training Recommenders Over Large Item Corpus With Importance Sampling

被引：0

作者：

Lian, Defu ^{[1
]}

Gao, Zhenguo ^{[2
]}

Song, Xia ^{[2
]}

Li, Yucheng ^{[1
]}

Liu, Qi ^{[1
]}

Chen, Enhong ^{[1
]}

机构：

[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230052, Anhui, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2024年 / 36卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Personalized ranking; cluster-based sampling; implicit feedback; item recommendation;

D O I：

10.1109/TKDE.2023.3344657

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

By predicting a personalized ranking on a set of items, item recommendation helps users determine the information they need. While optimizing a ranking-focused loss is more in line with the objectives of item recommendation, previous studies have indicated that current sampling-based ranking methods don't always surpass non-sampling ones. This is because it is either inefficient to sample a pool of representative negatives for better generalization or challenging to gauge their contributions to ranking-focused losses accurately. To this end, we propose a novel weighted ranking loss, which weights each negative with the softmax probability based on model's predictive score. Our theoretical analysis suggests that optimizing this loss boosts the normalized discounted cumulative gain. Furthermore, it appears that this loss acts as an approximate analytic solution for adversarial training of personalized ranking. To improve optimization efficiency, we approximate the weighted ranking loss with self-normalized importance sampling and show that the loss has good generalization properties. To improve generalization, we further develop efficient cluster-based negative samplers based on clustering over item vectors, to decrease approximation error caused by the divergence between the proposal and the target distribution. Comprehensive evaluations on real-world datasets show that our methods remarkably outperform leading item recommendation algorithms.

引用

页码：9433 / 9447

页数：15

共 50 条

[31] Training Variational Autoencoders with Discrete Latent Variables Using Importance Sampling
Bartler, Alexander
Wiewel, Felix
Mauch, Lukas
Yang, Bin
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[32] Impact of morphological analysis and a large training corpus on the performances of Arabic diacritization
Chennoufi, Amine
Mazroui, Azzeddine
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 269 - 280
[33] Improving Model Training by Periodic Sampling over Weight Distributions
Tripathi, Samarth
Liu, Jiayi
Dhar, Sauptik
Kurup, Unmesh
Shah, Mohak
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 112 - 122
[34] Importance Sampling for Turbo Codes over Slow Rayleigh Fading Channels
Sakai, Takakazu
Shibata, Koji
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (05) : 982 - 985
[35] SEQUENTIAL IMPORTANCE SAMPLING FOR ESTIMATING EXPECTATIONS OVER THE SPACE OF PERFECT MATCHINGS
Alimohammadi, Yeganeh
Diaconis, Persi
Roghani, Mohammad
Saberi, Amin
ANNALS OF APPLIED PROBABILITY, 2023, 33 (02): : 799 - 833
[36] Efficient sampling of training set in large and noisy multimedia data
Wang, Surong
Dash, Manoranjan
Chia, Liang-Tien
Xu, Min
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (03)
[37] Large deviations and importance sampling for a tandem network with slow-down
Paul Dupuis
Kevin Leder
Hui Wang
Queueing Systems, 2007, 57 : 71 - 83
[38] Large deviations and importance sampling for modulated Poisson-Markov processes
Macci, C
BOLLETTINO DELLA UNIONE MATEMATICA ITALIANA, 2000, 3A : 117 - 120
[39] Large deviations and importance sampling for a tandem network with slow-down
Dupuis, Paul
Leder, Kevin
Wang, Hui
QUEUEING SYSTEMS, 2007, 57 (2-3) : 71 - 83
[40] Importance sampling for simulation of large and moderate deviation probabilities of tests and estimators
Ermakov, M. S.
THEORY OF PROBABILITY AND ITS APPLICATIONS, 2007, 51 (02) : 279 - 290

← 1 2 3 4 5 →