Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

被引：7

作者：

Ren, Zhaochun ^{[1
]}

Huang, Na ^{[1
]}

Wang, Yidan ^{[1
]}

Ren, Pengjie ^{[1
]}

Ma, Jun ^{[1
]}

Lei, Jiahuan ^{[2
]}

Shi, Xinlei ^{[2
]}

Luo, Hengliang ^{[2
]}

Jose, Joemon ^{[3
]}

Xin, Xin ^{[1
]}

机构：

[1] Shandong Univ, Qingdao, Peoples R China

[2] Meituan, Beijing, Peoples R China

[3] Univ Glasgow, Glasgow, Lanark, Scotland

来源：

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年

关键词：

Recommender system; Reinforcement learning; Contrastive learning; Data augmentation; Sequential recommendation; NETWORKS;

D O I：

10.1145/3539618.3591656

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance.

引用

页码：922 / 931

页数：10

共 50 条

[1] REVEAL 2022: Reinforcement Learning-Based Recommender Systems at Scale
Li, Ying
Basilico, Justin
Raimond, Yves
Dimakopoulou, Maria
Liaw, Richard
Bailey, Paige
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 684 - 685
[2] Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling
Wang, Jie
Karatzoglou, Alexandros
Arapakis, Ioannis
Jose, Joemon M.
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 375 - 385
[3] Adversarial Attacks and Detection on Reinforcement Learning-Based Interactive Recommender Systems
Cao, Yuanjiang
Chen, Xiaocong
Yao, Lina
Wang, Xianzhi
Zhang, Wei Emma
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1669 - 1672
[4] PARL: Poisoning Attacks Against Reinforcement Learning-based Recommender Systems
Du, Linkang
Yuan, Quan
Chen, Min
Sun, Mingyang
Cheng, Peng
Chen, Jiming
Zhang, Zhikun
PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1331 - 1344
[5] Reinforcement Learning based Recommender Systems: A Survey
Afsar, M. Mehdi
Crump, Trafford
Far, Behrouz
ACM COMPUTING SURVEYS, 2023, 55 (07)
[6] Hyperparameter Learning for Deep Learning-Based Recommender Systems
Wu, Di
Sun, Bo
Shang, Mingsheng
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (04) : 2699 - 2712
[7] Graph Contrastive Learning with Augmentations
You, Yuning
Chen, Tianlong
Sui, Yongduo
Chen, Ting
Wang, Zhangyang
Shen, Yang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[8] Contrastive Learning With Stronger Augmentations
Wang, Xiao
Qi, Guo-Jun
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5549 - 5560
[9] State-of-the-Art Survey on Deep Learning-Based Recommender Systems for E-Learning
Salau, Latifat
Hamada, Mohamed
Prasad, Rajesh
Hassan, Mohammed
Mahendran, Anand
Watanobe, Yutaka
APPLIED SCIENCES-BASEL, 2022, 12 (23):
[10] Multiobjective Evaluation of Reinforcement Learning Based Recommender Systems
Grishanov, Alexey
Ianinat, Anastasia
Vorontsov, Konstantin
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 622 - 627

← 1 2 3 4 5 →