Reinforcement Routing on Proximity Graph for Efficient Recommendation

被引:4
|
作者
Feng, Chao [1 ]
Lian, Defu [1 ]
Wang, Xiting [2 ]
Liu, Zheng [2 ]
Xie, Xing [2 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, 443 Huangshan Rd, Hefei, Peoples R China
[2] Microsoft Res Asia, 5 Danleng St, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
MIPS; non-metric; proximity graph; reinforcement learning; reward shaping; graph convolutional network; BINARY SEARCH TREES; QUANTIZATION;
D O I
10.1145/3512767
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We focus onMaximum Inner Product Search (MIPS), which is an essential problem in many machine learning communities. Given a query, MIPS finds the most similar items with the maximum inner products. Methods for Nearest Neighbor Search (NNS) which is usually defined on metric space do not exhibit the satisfactory performance for MIPS problem since inner product is a non-metric function. However, inner products exhibit many good properties compared with metric functions, such as avoiding vanishing and exploding gradients. As a result, inner product is widely used in many recommendation systems, which makes efficient Maximum Inner Product Search a key for speeding up many recommendation systems. Graph-based methods for NNS problem show the superiorities compared with other class methods. Each data point of the database ismapped to a node of the proximity graph. Nearest neighbor search in the database can be converted to route on the proximity graph to find the nearest neighbor for the query. This technique can be used to solve MIPS problem. Instead of searching the nearest neighbor for the query, we search the item with amaximum inner product with query on the proximity graph. In this article, we propose a reinforcement model to train an agent to search on the proximity graph automatically for MIPS problem if we lack the ground truths of training queries. If we know the ground truths of some training queries, our model can also utilize these ground truths by imitation learning to improve the agent's searchability. By experiments, we can see that our proposed mode which combines reinforcement learning with imitation learning shows the superiorities over the state-of-the-art methods.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] Supervised Reinforcement Session Recommendation Model Based on Dual-Graph Convolution
    Liang, Shunpan
    Zhang, Guozheng
    Ren, Wenhui
    IEEE ACCESS, 2023, 11 : 115380 - 115391
  • [32] Hierarchical Reinforcement Learning for Conversational Recommendation With Knowledge Graph Reasoning and Heterogeneous Questions
    Yang, Yao-Chun
    Chen, Chiao-Ting
    Lu, Tzu-Yu
    Huang, Szu-Hao
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (05) : 3439 - 3452
  • [33] Rec-clusterGCN: An Efficient Graph Convolution Network for Recommendation
    Sun, Tianhao
    Luo, Man
    Chen, Renqin
    Xia, Yunni
    Jiang, Ning
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 244 - 250
  • [34] Knowledge Graph-Enhanced Hierarchical Reinforcement Learning for Interactive and Explainable Recommendation
    Zhang, Mingwei
    Li, Yage
    Li, Shuping
    Wang, Yinchu
    Yan, Jing
    IEEE ACCESS, 2024, 12 : 137345 - 137359
  • [35] An efficient joint framework for interacting knowledge graph and item recommendation
    Haizhou Du
    Yue Tang
    Zebang Cheng
    Knowledge and Information Systems, 2023, 65 : 1685 - 1712
  • [36] Two-stage routing with optimized guided search and greedy algorithm on proximity graph
    Xu, Xiaoliang
    Wang, Mengzhao
    Wang, Yuxiang
    Ma, Dingcheng
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [37] Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
    Jali, Neharika
    Qu, Guannan
    Wang, Weina
    Joshi, Gauri
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [38] Reinforcement Learning for Efficient Drone-Assisted Vehicle Routing
    Bogyrbayeva, Aigerim
    Dauletbayev, Bissenbay
    Meraliyev, Meraryslan
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [39] Route reinforcement for efficient QoS routing based on ant algorithm
    Oh, JS
    Bae, S
    Ahn, J
    Kang, S
    INFORMATION NETWORKING: NETWORKING TECHNOLOGIES FOR BROADBAND AND MOBILE NETWORKS, 2004, 3090 : 342 - 349
  • [40] An Efficient Graph Theoretic Algorithm for Channel Routing in VLSI Design with Given Constraint Graph
    Liu, Ming
    Geng, Xianya
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023, 2024, 2061 : 339 - 344