Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

被引:7
|
作者
Ren, Zhaochun [1 ]
Huang, Na [1 ]
Wang, Yidan [1 ]
Ren, Pengjie [1 ]
Ma, Jun [1 ]
Lei, Jiahuan [2 ]
Shi, Xinlei [2 ]
Luo, Hengliang [2 ]
Jose, Joemon [3 ]
Xin, Xin [1 ]
机构
[1] Shandong Univ, Qingdao, Peoples R China
[2] Meituan, Beijing, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender system; Reinforcement learning; Contrastive learning; Data augmentation; Sequential recommendation; NETWORKS;
D O I
10.1145/3539618.3591656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance.
引用
收藏
页码:922 / 931
页数:10
相关论文
共 50 条
  • [31] DeepMovRS: A unified framework for deep learning-based movie recommender systems
    Taheri, S. M.
    Irajian, Iman
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 200 - 204
  • [32] Contrastive Learning-Based Semantic Communications
    Tang, Shunpu
    Yang, Qianqian
    Fan, Lisheng
    Lei, Xianfu
    Nallanathan, Arumugam
    Karagiannidis, George K.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (10) : 6328 - 6343
  • [33] DDFL: A Deep Dual Function Learning-Based Model for Recommender Systems
    Shah, Syed Tauhid Ullah
    Li, Jianjun
    Guo, Zhiqiang
    Li, Guohui
    Zhou, Quan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 590 - 606
  • [34] Social-aware graph contrastive learning for recommender systems
    Zhang, Yuanyuan
    Zhu, Junwu
    Zhang, Yonglong
    Zhu, Yi
    Zhou, Jialuo
    Xie, Yaling
    APPLIED SOFT COMPUTING, 2024, 158
  • [35] Adaptive Knowledge Contrastive Learning with Dynamic Attention for Recommender Systems
    Li, Hongchan
    Zheng, Jinming
    Jin, Baohua
    Zhu, Haodong
    ELECTRONICS, 2024, 13 (18)
  • [36] Contrastive Self-supervised Learning in Recommender Systems: A Survey
    Jing, Mengyuan
    Zhu, Yanmin
    Zang, Tianzi
    Wang, Ke
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)
  • [37] Reinforcement learning-based portfolio optimization with deterministic state transition
    Song, Guangle
    Zhao, Tianlong
    Ma, Xiang
    Lin, Peiguang
    Cui, Chaoran
    INFORMATION SCIENCES, 2025, 690
  • [38] Steady-State Error Compensation for Reinforcement Learning-Based Control of Power Electronic Systems
    Weber, Daniel
    Schenke, Maximilian
    Wallscheid, Oliver
    IEEE ACCESS, 2023, 11 : 76524 - 76536
  • [39] Intelligent Systems Utilization in Recommender Systems: A Reinforcement Learning Approach
    Yazici, Ibrahim
    Ari, Emre
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 2, 2022, 505 : 124 - 130
  • [40] Reinforcement learning-based optimal control of uncertain nonlinear systems
    Garcia, Miguel
    Dong, Wenjie
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850