Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning

被引：4

作者：

Chen, Haokun ^{[1
,2
]}

Zhu, Chenxu ^{[1
]}

Tang, Ruiming ^{[3
]}

Zhang, Weinan ^{[1
]}

He, Xiuqiang ^{[3
]}

Yu, Yong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China

[2] Alibaba Grp, Hangzhou 310052, Peoples R China

[3] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Predictive models; interactive recommender system; large-scale recommendation; cold-start; ALGORITHMS;

D O I：

10.1109/TKDE.2021.3137310

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although reinforcement learning (RL) techniques are regarded as promising solutions for interactive recommender systems (IRS), such solutions still face three main challenges, namely, i) time inefficiency when handling large discrete action space in IRS, ii) inability to deal with the cold-start scenarios in IRS, iii) data inefficiency during training the RL-based methods. To tackle these challenges, we propose a generic tree-structured RL framework taking both policy-based and value-based approaches into consideration. We propose to construct a balanced tree over representations of the items, such that picking an item is formulated as seeking a suitable path from the root to a leaf node in the balanced tree, which dramatically reduces the time complexity of item recommendation. Further, for cold-start scenarios where prior information of the items is unavailable, we initialize a random balanced tree as the starting point and then refine the tree structure based on the learned item representations. Besides, we also incorporate a user modeling component to explicitly model the environment, which can be utilized in the training phase to improve data efficiency. Extensive experiments on two real-world datasets are conducted and demonstrate that our framework can achieve superior recommendation performance and provide time and data efficiency improvement over state-of-the-art methods in both warm-start and cold-start IRS scenarios.

引用

页码：4018 / 4032

页数：15

共 50 条

[31] Algorithms or Actions? A Study in Large-Scale Reinforcement Learning
Tavares, Anderson Rocha
Anbalagan, Sivasubramanian
Marcolino, Leandro Soriano
Chaimowicz, Luiz
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2717 - 2723
[32] Deep Reinforcement Learning for Large-Scale Epidemic Control
Libin, Pieter J. K.
Moonens, Arno
Verstraeten, Timothy
Perez-Sanjines, Fabian
Hens, Niel
Lemey, Philippe
Nowe, Ann
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
[33] A Tree-Structured Representation for Book Author and Its Recommendation Using Multilayer SOM
Lu, Lu
Zhang, Haijun
2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[34] Tree-structured analysis of treatment effects with large observational data
Kang, Joseph
Su, Xiaogang
Hitsman, Brian
Liu, Kiang
Lloyd-Jones, Donald
JOURNAL OF APPLIED STATISTICS, 2012, 39 (03) : 513 - 529
[35] A Tree-Structured Database Machine for Large Relational Database Systems
孟力明
徐晓飞
常会友
陈光熙
胡铭曾
李生
Journal of Computer Science and Technology, 1987, (04) : 265 - 275
[36] LEARNING A TREE-STRUCTURED ISING MODEL IN ORDER TO MAKE PREDICTIONS
Bresler, Guy
Karzand, Mina
ANNALS OF STATISTICS, 2020, 48 (02): : 713 - 737
[37] Tree-structured Bayesian network learning with application to scene classification
Wang, Z. F.
Wang, Z. H.
Xie, W. J.
ELECTRONICS LETTERS, 2011, 47 (09) : 540 - 541
[38] TIFE: Tree-Structured Interactive Feature Enhancement for Multivariate Time Series Forecasting
Wang, Yixiang
Li, Haonan
Zhang, Zhenguo
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 134 - 145
[39] Tree-structured multi-layer fuzzy cognitive maps for modelling large scale, complex problems
Mateou, N. H.
Andreou, A. S.
INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 131 - +
[40] Tree2Vector: Learning a Vectorial Representation for Tree-Structured Data
Zhang, Haijun
Wang, Shuang
Xu, Xiaofei
Chow, Tommy W. S.
Wu, Q. M. Jonathan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5304 - 5318

← 1 2 3 4 5 →