Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

被引:7
|
作者
Ren, Zhaochun [1 ]
Huang, Na [1 ]
Wang, Yidan [1 ]
Ren, Pengjie [1 ]
Ma, Jun [1 ]
Lei, Jiahuan [2 ]
Shi, Xinlei [2 ]
Luo, Hengliang [2 ]
Jose, Joemon [3 ]
Xin, Xin [1 ]
机构
[1] Shandong Univ, Qingdao, Peoples R China
[2] Meituan, Beijing, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
Recommender system; Reinforcement learning; Contrastive learning; Data augmentation; Sequential recommendation; NETWORKS;
D O I
10.1145/3539618.3591656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance.
引用
收藏
页码:922 / 931
页数:10
相关论文
共 50 条
  • [21] Contrastive Initial State Buffer for Reinforcement Learning
    Messikommer, Nico
    Song, Yunlong
    Scaramuzza, Davide
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2866 - 2872
  • [22] Augmentations in Hypergraph Contrastive Learning: Fabricated and Generative
    Wei, Tianxin
    You, Yuning
    Chen, Tianlong
    Shen, Yang
    He, Jingrui
    Wang, Zhangyang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Toward Auto-Learning Hyperparameters for Deep Learning-Based Recommender Systems
    Sun, Bo
    Wu, Di
    Shang, Mingsheng
    He, Yi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 323 - 331
  • [24] Ethereum Phishing Scams Detection Based on Graph Contrastive Learning with Augmentations
    Chen, Yongxin
    Hou, Wenhan
    Zhang, Xin
    Li, Ru
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2047 - 2052
  • [25] PyRecGym: A Reinforcement Learning Gym for Recommender Systems
    Shi, Bichen
    Ozsoy, Makbule Gulcin
    Hurley, Neil
    Smyth, Barry
    Tragos, Elias Z.
    Geraci, James
    Lawlor, Aonghus
    RECSYS 2019: 13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2019, : 491 - 495
  • [26] User Tampering in Reinforcement Learning Recommender Systems
    Kasirzadeh, Atoosa
    Evans, Charles
    PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 58 - 69
  • [27] Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems
    Wang, Siyu
    Cao, Yuanjiang
    Chen, Xiaocong
    Yao, Lina
    Wang, Xianzhi
    Sheng, Quan Z.
    FRONTIERS IN BIG DATA, 2022, 5
  • [28] Analysis of Augmentations for Contrastive ECG Representation Learning
    Soltanieh, Sahar
    Etemad, Ali
    Hashemi, Javad
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [29] A machine learning-based hybrid recommender framework for smart medical systems
    Wei, Jianhua
    Yan, Honglin
    Shao, Xiaoli
    Zhao, Lili
    Han, Lin
    Yan, Peng
    Wang, Shengyu
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [30] Deep Learning-Based Sequential Recommender Systems: Concepts, Algorithms, and Evaluations
    Fang, Hui
    Guo, Guibing
    Zhang, Danning
    Shu, Yiheng
    WEB ENGINEERING (ICWE 2019), 2019, 11496 : 574 - 577