Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

被引:7
|
作者
Ren, Zhaochun [1 ]
Huang, Na [1 ]
Wang, Yidan [1 ]
Ren, Pengjie [1 ]
Ma, Jun [1 ]
Lei, Jiahuan [2 ]
Shi, Xinlei [2 ]
Luo, Hengliang [2 ]
Jose, Joemon [3 ]
Xin, Xin [1 ]
机构
[1] Shandong Univ, Qingdao, Peoples R China
[2] Meituan, Beijing, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender system; Reinforcement learning; Contrastive learning; Data augmentation; Sequential recommendation; NETWORKS;
D O I
10.1145/3539618.3591656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance.
引用
收藏
页码:922 / 931
页数:10
相关论文
共 50 条
  • [1] REVEAL 2022: Reinforcement Learning-Based Recommender Systems at Scale
    Li, Ying
    Basilico, Justin
    Raimond, Yves
    Dimakopoulou, Maria
    Liaw, Richard
    Bailey, Paige
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 684 - 685
  • [2] Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling
    Wang, Jie
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 375 - 385
  • [3] Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems
    Cao, Yuanjiang
    Chen, Xiaocong
    Yao, Lina
    Wang, Xianzhi
    Zhang, Wei Emma
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1669 - 1672
  • [4] PARL: Poisoning Attacks Against Reinforcement Learning-based Recommender Systems
    Du, Linkang
    Yuan, Quan
    Chen, Min
    Sun, Mingyang
    Cheng, Peng
    Chen, Jiming
    Zhang, Zhikun
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1331 - 1344
  • [5] Reinforcement Learning based Recommender Systems: A Survey
    Afsar, M. Mehdi
    Crump, Trafford
    Far, Behrouz
    ACM COMPUTING SURVEYS, 2023, 55 (07)
  • [6] Hyperparameter Learning for Deep Learning-Based Recommender Systems
    Wu, Di
    Sun, Bo
    Shang, Mingsheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (04) : 2699 - 2712
  • [7] Graph Contrastive Learning with Augmentations
    You, Yuning
    Chen, Tianlong
    Sui, Yongduo
    Chen, Ting
    Wang, Zhangyang
    Shen, Yang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] Contrastive Learning With Stronger Augmentations
    Wang, Xiao
    Qi, Guo-Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5549 - 5560
  • [9] State-of-the-Art Survey on Deep Learning-Based Recommender Systems for E-Learning
    Salau, Latifat
    Hamada, Mohamed
    Prasad, Rajesh
    Hassan, Mohammed
    Mahendran, Anand
    Watanobe, Yutaka
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [10] Multiobjective Evaluation of Reinforcement Learning Based Recommender Systems
    Grishanov, Alexey
    Ianinat, Anastasia
    Vorontsov, Konstantin
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 622 - 627