Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

被引:7
|
作者
Ren, Zhaochun [1 ]
Huang, Na [1 ]
Wang, Yidan [1 ]
Ren, Pengjie [1 ]
Ma, Jun [1 ]
Lei, Jiahuan [2 ]
Shi, Xinlei [2 ]
Luo, Hengliang [2 ]
Jose, Joemon [3 ]
Xin, Xin [1 ]
机构
[1] Shandong Univ, Qingdao, Peoples R China
[2] Meituan, Beijing, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender system; Reinforcement learning; Contrastive learning; Data augmentation; Sequential recommendation; NETWORKS;
D O I
10.1145/3539618.3591656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance.
引用
收藏
页码:922 / 931
页数:10
相关论文
共 50 条
  • [41] Reinforcement Learning-based Response Shaping Control of Dynamical Systems
    Shivani, Chepuri
    Kandath, Harikumar
    2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA, 2023, : 403 - 408
  • [42] Reinforcement learning-based estimation for spatio-temporal systems
    Mowlavi, Saviz
    Benosman, Mouhacine
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] Deep Reinforcement Learning-based Continuous Control for Multicopter Systems
    Manukyan, Anush
    Olivares-Mendez, Miguel A.
    Geist, Maifflieu
    Voos, Holger
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 1876 - 1881
  • [44] Reinforcement learning-based power control in mobile communications systems
    Gao, XZ
    Ovaska, SJ
    Vasilakos, AV
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2002, 8 (04): : 337 - 352
  • [45] Fighting Boredom in Recommender Systems with Linear Reinforcement Learning
    Warlop, Romain
    Lazaric, Alessandro
    Mary, Jeremie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [46] Sequence Adaptation via Reinforcement Learning in Recommender Systems
    Antaris, Stefanos
    Rafailidis, Dimitrios
    15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 714 - 718
  • [47] Self-Supervised Reinforcement Learning for Recommender Systems
    Xin, Xin
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 931 - 940
  • [48] Compressive Features in Offline Reinforcement Learning for Recommender Systems
    Minh Pham
    Hung Nguyen
    Long Dang
    Nieves, Jennifer Adorno
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5719 - 5726
  • [49] On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
    Chen, Xiaocong
    Wang, Siyu
    Mcauley, Julian
    Jannach, Dietmar
    Yao, Lina
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
  • [50] Deep learning-based collaborative filtering recommender systems: a comprehensive and systematic review
    Atena Torkashvand
    Seyed Mahdi Jameii
    Akram Reza
    Neural Computing and Applications, 2023, 35 : 24783 - 24827