Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning

被引:4
|
作者
Chen, Haokun [1 ,2 ]
Zhu, Chenxu [1 ]
Tang, Ruiming [3 ]
Zhang, Weinan [1 ]
He, Xiuqiang [3 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Alibaba Grp, Hangzhou 310052, Peoples R China
[3] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Predictive models; interactive recommender system; large-scale recommendation; cold-start; ALGORITHMS;
D O I
10.1109/TKDE.2021.3137310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although reinforcement learning (RL) techniques are regarded as promising solutions for interactive recommender systems (IRS), such solutions still face three main challenges, namely, i) time inefficiency when handling large discrete action space in IRS, ii) inability to deal with the cold-start scenarios in IRS, iii) data inefficiency during training the RL-based methods. To tackle these challenges, we propose a generic tree-structured RL framework taking both policy-based and value-based approaches into consideration. We propose to construct a balanced tree over representations of the items, such that picking an item is formulated as seeking a suitable path from the root to a leaf node in the balanced tree, which dramatically reduces the time complexity of item recommendation. Further, for cold-start scenarios where prior information of the items is unavailable, we initialize a random balanced tree as the starting point and then refine the tree structure based on the learned item representations. Besides, we also incorporate a user modeling component to explicitly model the environment, which can be utilized in the training phase to improve data efficiency. Extensive experiments on two real-world datasets are conducted and demonstrate that our framework can achieve superior recommendation performance and provide time and data efficiency improvement over state-of-the-art methods in both warm-start and cold-start IRS scenarios.
引用
收藏
页码:4018 / 4032
页数:15
相关论文
共 50 条
  • [11] Tree-Structured CRF Models for Interactive Image Labeling
    Mensink, Thomas
    Verbeek, Jakob
    Csurka, Gabriela
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 476 - 489
  • [12] Online Formation of Large Tree-Structured Team
    Ding, Cheng
    Xia, Fan
    Gopakumar
    Qian, Weining
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 118 - 132
  • [13] Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video
    Wu, Jie
    Li, Guanbin
    Liu, Si
    Lin, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12386 - 12393
  • [14] Research on Automated Reinforcement Learning: based on Tree-structured Parzen Estimators and Median Pruning
    Wang, Zhaolei
    Wang, Ludi
    Liang, Qi
    Luo, Wuyi
    Gong, Qinghai
    Li, Shanshan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5461 - 5465
  • [15] Statistical Tests for Large Tree-Structured Data
    Bharath, Karthik
    Kambadur, Prabhanjan
    Dey, Dipak K.
    Rao, Arvind
    Baladandayuthapani, Veerabhadran
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (520) : 1733 - 1743
  • [16] The Simulation Technique for Large-Scale Tree Structured Interconnects
    Gorshkov, K.
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, APPLICATIONS AND MANUFACTURING (ICIEAM), 2016,
  • [17] Learning Program Representations with a Tree-Structured Transformer
    Wang, Wenhan
    Zhang, Kechi
    Li, Ge
    Liu, Shangqing
    Li, Anran
    Jin, Zhi
    Liu, Yang
    2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 248 - 259
  • [18] Learning Tree-Structured Data in the Model Space
    Dong, Ya-dong
    Lv, Sheng-fei
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 258 - 266
  • [19] Tree-structured supervised learning and the genetics of hypertension
    Huang, J
    Lin, A
    Narasimhan, B
    Quertermous, T
    Hsiung, CA
    Ho, LT
    Grove, JS
    Olivier, M
    Ranade, K
    Risch, NJ
    Shen, RA
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (29) : 10529 - 10534
  • [20] Interactive tree-structured regression via principal Hessian directions
    Li, KC
    Lue, HH
    Chen, CH
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (450) : 547 - 560