Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning

被引:4
|
作者
Chen, Haokun [1 ,2 ]
Zhu, Chenxu [1 ]
Tang, Ruiming [3 ]
Zhang, Weinan [1 ]
He, Xiuqiang [3 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Alibaba Grp, Hangzhou 310052, Peoples R China
[3] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Predictive models; interactive recommender system; large-scale recommendation; cold-start; ALGORITHMS;
D O I
10.1109/TKDE.2021.3137310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) techniques are regarded as promising solutions for interactive recommender systems (IRS), such solutions still face three main challenges, namely, i) time inefficiency when handling large discrete action space in IRS, ii) inability to deal with the cold-start scenarios in IRS, iii) data inefficiency during training the RL-based methods. To tackle these challenges, we propose a generic tree-structured RL framework taking both policy-based and value-based approaches into consideration. We propose to construct a balanced tree over representations of the items, such that picking an item is formulated as seeking a suitable path from the root to a leaf node in the balanced tree, which dramatically reduces the time complexity of item recommendation. Further, for cold-start scenarios where prior information of the items is unavailable, we initialize a random balanced tree as the starting point and then refine the tree structure based on the learned item representations. Besides, we also incorporate a user modeling component to explicitly model the environment, which can be utilized in the training phase to improve data efficiency. Extensive experiments on two real-world datasets are conducted and demonstrate that our framework can achieve superior recommendation performance and provide time and data efficiency improvement over state-of-the-art methods in both warm-start and cold-start IRS scenarios.
引用
收藏
页码:4018 / 4032
页数:15
相关论文
共 50 条
  • [31] Algorithms or Actions? A Study in Large-Scale Reinforcement Learning
    Tavares, Anderson Rocha
    Anbalagan, Sivasubramanian
    Marcolino, Leandro Soriano
    Chaimowicz, Luiz
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2717 - 2723
  • [32] Deep Reinforcement Learning for Large-Scale Epidemic Control
    Libin, Pieter J. K.
    Moonens, Arno
    Verstraeten, Timothy
    Perez-Sanjines, Fabian
    Hens, Niel
    Lemey, Philippe
    Nowe, Ann
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
  • [33] A Tree-Structured Representation for Book Author and Its Recommendation Using Multilayer SOM
    Lu, Lu
    Zhang, Haijun
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [34] Tree-structured analysis of treatment effects with large observational data
    Kang, Joseph
    Su, Xiaogang
    Hitsman, Brian
    Liu, Kiang
    Lloyd-Jones, Donald
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (03) : 513 - 529
  • [35] A Tree-Structured Database Machine for Large Relational Database Systems
    孟力明
    徐晓飞
    常会友
    陈光熙
    胡铭曾
    李生
    Journal of Computer Science and Technology, 1987, (04) : 265 - 275
  • [36] LEARNING A TREE-STRUCTURED ISING MODEL IN ORDER TO MAKE PREDICTIONS
    Bresler, Guy
    Karzand, Mina
    ANNALS OF STATISTICS, 2020, 48 (02): : 713 - 737
  • [37] Tree-structured Bayesian network learning with application to scene classification
    Wang, Z. F.
    Wang, Z. H.
    Xie, W. J.
    ELECTRONICS LETTERS, 2011, 47 (09) : 540 - 541
  • [38] TIFE: Tree-Structured Interactive Feature Enhancement for Multivariate Time Series Forecasting
    Wang, Yixiang
    Li, Haonan
    Zhang, Zhenguo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 134 - 145
  • [39] Tree-structured multi-layer fuzzy cognitive maps for modelling large scale, complex problems
    Mateou, N. H.
    Andreou, A. S.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 131 - +
  • [40] Tree2Vector: Learning a Vectorial Representation for Tree-Structured Data
    Zhang, Haijun
    Wang, Shuang
    Xu, Xiaofei
    Chow, Tommy W. S.
    Wu, Q. M. Jonathan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5304 - 5318