Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning

被引:4
|
作者
Chen, Haokun [1 ,2 ]
Zhu, Chenxu [1 ]
Tang, Ruiming [3 ]
Zhang, Weinan [1 ]
He, Xiuqiang [3 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Alibaba Grp, Hangzhou 310052, Peoples R China
[3] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Predictive models; interactive recommender system; large-scale recommendation; cold-start; ALGORITHMS;
D O I
10.1109/TKDE.2021.3137310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) techniques are regarded as promising solutions for interactive recommender systems (IRS), such solutions still face three main challenges, namely, i) time inefficiency when handling large discrete action space in IRS, ii) inability to deal with the cold-start scenarios in IRS, iii) data inefficiency during training the RL-based methods. To tackle these challenges, we propose a generic tree-structured RL framework taking both policy-based and value-based approaches into consideration. We propose to construct a balanced tree over representations of the items, such that picking an item is formulated as seeking a suitable path from the root to a leaf node in the balanced tree, which dramatically reduces the time complexity of item recommendation. Further, for cold-start scenarios where prior information of the items is unavailable, we initialize a random balanced tree as the starting point and then refine the tree structure based on the learned item representations. Besides, we also incorporate a user modeling component to explicitly model the environment, which can be utilized in the training phase to improve data efficiency. Extensive experiments on two real-world datasets are conducted and demonstrate that our framework can achieve superior recommendation performance and provide time and data efficiency improvement over state-of-the-art methods in both warm-start and cold-start IRS scenarios.
引用
收藏
页码:4018 / 4032
页数:15
相关论文
共 50 条
  • [41] Tree-structured Curriculum Learning based on Semantic Similarity of Text
    Han, Sanggyu
    Myaeng, Sung-Hyon
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 971 - 976
  • [42] Improving Uncertainty Quantification of Variance Networks by Tree-Structured Learning
    Ma, Wenxuan
    Yan, Xing
    Zhang, Kun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [43] Visual tracking with tree-structured appearance model for online learning
    Lv, Yun-Qiu
    Liu, Kai
    Cheng, Fei
    Li, Wei
    IET IMAGE PROCESSING, 2019, 13 (12) : 2106 - 2115
  • [44] Latent Structured Perceptrons for Large-Scale Learning with Hidden Information
    Sun, Xu
    Matsuzaki, Takuya
    Li, Wenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (09) : 2063 - 2075
  • [45] Reinforcement learning in a large-scale photonic recurrent neural network
    Bueno, J.
    Maktoobi, S.
    Froehly, L.
    Fischer, I.
    Jacquot, M.
    Larger, L.
    Brunner, D.
    OPTICA, 2018, 5 (06): : 756 - 760
  • [46] Structured sparsity learning for large-scale fuzzy cognitive maps
    Ding Fengqian
    Luo Chao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 105
  • [47] Large-scale power inspection: A deep reinforcement learning approach
    Guan, Qingshu
    Zhang, Xiangquan
    Xie, Minghui
    Nie, Jianglong
    Cao, Hui
    Chen, Zhao
    He, Zhouqiang
    FRONTIERS IN ENERGY RESEARCH, 2023, 10
  • [48] Scheduling Large-scale Distributed Training via Reinforcement Learning
    Peng, Zhanglin
    Ren, Jiamin
    Zhang, Ruimao
    Wu, Lingyun
    Wang, Xinjiang
    Luo, Ping
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1797 - 1806
  • [49] Efficient and scalable reinforcement learning for large-scale network control
    Ma, Chengdong
    Li, Aming
    Du, Yali
    Dong, Hao
    Yang, Yaodong
    NATURE MACHINE INTELLIGENCE, 2024, 6 (09) : 1006 - 1020
  • [50] Large-Scale Wildfire Mitigation Through Deep Reinforcement Learning
    Altamimi, Abdulelah
    Lagoa, Constantino
    Borges, Jose G.
    McDill, Marc E.
    Andriotis, C. P.
    Papakonstantinou, K. G.
    FRONTIERS IN FORESTS AND GLOBAL CHANGE, 2022, 5