Training Recommenders Over Large Item Corpus With Importance Sampling

被引:0
|
作者
Lian, Defu [1 ]
Gao, Zhenguo [2 ]
Song, Xia [2 ]
Li, Yucheng [1 ]
Liu, Qi [1 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230052, Anhui, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Personalized ranking; cluster-based sampling; implicit feedback; item recommendation;
D O I
10.1109/TKDE.2023.3344657
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By predicting a personalized ranking on a set of items, item recommendation helps users determine the information they need. While optimizing a ranking-focused loss is more in line with the objectives of item recommendation, previous studies have indicated that current sampling-based ranking methods don't always surpass non-sampling ones. This is because it is either inefficient to sample a pool of representative negatives for better generalization or challenging to gauge their contributions to ranking-focused losses accurately. To this end, we propose a novel weighted ranking loss, which weights each negative with the softmax probability based on model's predictive score. Our theoretical analysis suggests that optimizing this loss boosts the normalized discounted cumulative gain. Furthermore, it appears that this loss acts as an approximate analytic solution for adversarial training of personalized ranking. To improve optimization efficiency, we approximate the weighted ranking loss with self-normalized importance sampling and show that the loss has good generalization properties. To improve generalization, we further develop efficient cluster-based negative samplers based on clustering over item vectors, to decrease approximation error caused by the divergence between the proposal and the target distribution. Comprehensive evaluations on real-world datasets show that our methods remarkably outperform leading item recommendation algorithms.
引用
收藏
页码:9433 / 9447
页数:15
相关论文
共 50 条
  • [31] Training Variational Autoencoders with Discrete Latent Variables Using Importance Sampling
    Bartler, Alexander
    Wiewel, Felix
    Mauch, Lukas
    Yang, Bin
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [32] Impact of morphological analysis and a large training corpus on the performances of Arabic diacritization
    Chennoufi, Amine
    Mazroui, Azzeddine
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 269 - 280
  • [33] Improving Model Training by Periodic Sampling over Weight Distributions
    Tripathi, Samarth
    Liu, Jiayi
    Dhar, Sauptik
    Kurup, Unmesh
    Shah, Mohak
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 112 - 122
  • [34] Importance Sampling for Turbo Codes over Slow Rayleigh Fading Channels
    Sakai, Takakazu
    Shibata, Koji
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (05) : 982 - 985
  • [35] SEQUENTIAL IMPORTANCE SAMPLING FOR ESTIMATING EXPECTATIONS OVER THE SPACE OF PERFECT MATCHINGS
    Alimohammadi, Yeganeh
    Diaconis, Persi
    Roghani, Mohammad
    Saberi, Amin
    ANNALS OF APPLIED PROBABILITY, 2023, 33 (02): : 799 - 833
  • [36] Efficient sampling of training set in large and noisy multimedia data
    Wang, Surong
    Dash, Manoranjan
    Chia, Liang-Tien
    Xu, Min
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (03)
  • [37] Large deviations and importance sampling for a tandem network with slow-down
    Paul Dupuis
    Kevin Leder
    Hui Wang
    Queueing Systems, 2007, 57 : 71 - 83
  • [38] Large deviations and importance sampling for modulated Poisson-Markov processes
    Macci, C
    BOLLETTINO DELLA UNIONE MATEMATICA ITALIANA, 2000, 3A : 117 - 120
  • [39] Large deviations and importance sampling for a tandem network with slow-down
    Dupuis, Paul
    Leder, Kevin
    Wang, Hui
    QUEUEING SYSTEMS, 2007, 57 (2-3) : 71 - 83
  • [40] Importance sampling for simulation of large and moderate deviation probabilities of tests and estimators
    Ermakov, M. S.
    THEORY OF PROBABILITY AND ITS APPLICATIONS, 2007, 51 (02) : 279 - 290