Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning

被引：4

作者：

Chen, Haokun ^{[1
,2
]}

Zhu, Chenxu ^{[1
]}

Tang, Ruiming ^{[3
]}

Zhang, Weinan ^{[1
]}

He, Xiuqiang ^{[3
]}

Yu, Yong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China

[2] Alibaba Grp, Hangzhou 310052, Peoples R China

[3] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Predictive models; interactive recommender system; large-scale recommendation; cold-start; ALGORITHMS;

D O I：

10.1109/TKDE.2021.3137310

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although reinforcement learning (RL) techniques are regarded as promising solutions for interactive recommender systems (IRS), such solutions still face three main challenges, namely, i) time inefficiency when handling large discrete action space in IRS, ii) inability to deal with the cold-start scenarios in IRS, iii) data inefficiency during training the RL-based methods. To tackle these challenges, we propose a generic tree-structured RL framework taking both policy-based and value-based approaches into consideration. We propose to construct a balanced tree over representations of the items, such that picking an item is formulated as seeking a suitable path from the root to a leaf node in the balanced tree, which dramatically reduces the time complexity of item recommendation. Further, for cold-start scenarios where prior information of the items is unavailable, we initialize a random balanced tree as the starting point and then refine the tree structure based on the learned item representations. Besides, we also incorporate a user modeling component to explicitly model the environment, which can be utilized in the training phase to improve data efficiency. Extensive experiments on two real-world datasets are conducted and demonstrate that our framework can achieve superior recommendation performance and provide time and data efficiency improvement over state-of-the-art methods in both warm-start and cold-start IRS scenarios.

引用

页码：4018 / 4032

页数：15

共 50 条

[11] Tree-Structured CRF Models for Interactive Image Labeling
Mensink, Thomas
Verbeek, Jakob
Csurka, Gabriela
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 476 - 489
[12] Online Formation of Large Tree-Structured Team
Ding, Cheng
Xia, Fan
Gopakumar
Qian, Weining
Zhou, Aoying
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 118 - 132
[13] Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video
Wu, Jie
Li, Guanbin
Liu, Si
Lin, Liang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12386 - 12393
[14] Research on Automated Reinforcement Learning: based on Tree-structured Parzen Estimators and Median Pruning
Wang, Zhaolei
Wang, Ludi
Liang, Qi
Luo, Wuyi
Gong, Qinghai
Li, Shanshan
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5461 - 5465
[15] Statistical Tests for Large Tree-Structured Data
Bharath, Karthik
Kambadur, Prabhanjan
Dey, Dipak K.
Rao, Arvind
Baladandayuthapani, Veerabhadran
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (520) : 1733 - 1743
[16] The Simulation Technique for Large-Scale Tree Structured Interconnects
Gorshkov, K.
2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, APPLICATIONS AND MANUFACTURING (ICIEAM), 2016,
[17] Learning Program Representations with a Tree-Structured Transformer
Wang, Wenhan
Zhang, Kechi
Li, Ge
Liu, Shangqing
Li, Anran
Jin, Zhi
Liu, Yang
2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 248 - 259
[18] Learning Tree-Structured Data in the Model Space
Dong, Ya-dong
Lv, Sheng-fei
2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 258 - 266
[19] Tree-structured supervised learning and the genetics of hypertension
Huang, J
Lin, A
Narasimhan, B
Quertermous, T
Hsiung, CA
Ho, LT
Grove, JS
Olivier, M
Ranade, K
Risch, NJ
Shen, RA
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (29) : 10529 - 10534
[20] Interactive tree-structured regression via principal Hessian directions
Li, KC
Lue, HH
Chen, CH
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (450) : 547 - 560

← 1 2 3 4 5 →