Reinforcement Routing on Proximity Graph for Efficient Recommendation

被引:4
|
作者
Feng, Chao [1 ]
Lian, Defu [1 ]
Wang, Xiting [2 ]
Liu, Zheng [2 ]
Xie, Xing [2 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, 443 Huangshan Rd, Hefei, Peoples R China
[2] Microsoft Res Asia, 5 Danleng St, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
MIPS; non-metric; proximity graph; reinforcement learning; reward shaping; graph convolutional network; BINARY SEARCH TREES; QUANTIZATION;
D O I
10.1145/3512767
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We focus onMaximum Inner Product Search (MIPS), which is an essential problem in many machine learning communities. Given a query, MIPS finds the most similar items with the maximum inner products. Methods for Nearest Neighbor Search (NNS) which is usually defined on metric space do not exhibit the satisfactory performance for MIPS problem since inner product is a non-metric function. However, inner products exhibit many good properties compared with metric functions, such as avoiding vanishing and exploding gradients. As a result, inner product is widely used in many recommendation systems, which makes efficient Maximum Inner Product Search a key for speeding up many recommendation systems. Graph-based methods for NNS problem show the superiorities compared with other class methods. Each data point of the database ismapped to a node of the proximity graph. Nearest neighbor search in the database can be converted to route on the proximity graph to find the nearest neighbor for the query. This technique can be used to solve MIPS problem. Instead of searching the nearest neighbor for the query, we search the item with amaximum inner product with query on the proximity graph. In this article, we propose a reinforcement model to train an agent to search on the proximity graph automatically for MIPS problem if we lack the ground truths of training queries. If we know the ground truths of some training queries, our model can also utilize these ground truths by imitation learning to improve the agent's searchability. By experiments, we can see that our proposed mode which combines reinforcement learning with imitation learning shows the superiorities over the state-of-the-art methods.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] An Efficient Routing Method for Range Queries in Skip Graph
    Banno, Ryohei
    Shudo, Kazuyuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (03) : 516 - 525
  • [22] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
    Mai, Xuan
    Fu, Quanzhi
    Chen, Yi
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [23] Efficient adaptive routing algorithm for the faulty star graph
    Osaka Univ, Suita-shi, Japan
    IEICE Trans Inf Syst, 8 (783-792):
  • [24] ScaIR: Scalable Intelligent Routing based on Distributed Graph Reinforcement Learning
    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing
    100876, China
    不详
    100044, China
    Comput. Networks, 1600, (February 2025):
  • [25] GAPPO - A Graph Attention Reinforcement Learning based Robust Routing Algorithm
    Li, Xinyuan
    Xiao, Yang
    Liu, Sixu
    Lu, Xucong
    Liu, Fang
    Zhou, Wenli
    Liu, Jun
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [26] An efficient adaptive routing algorithm for the faulty star graph
    Bai, LQ
    Ebara, H
    Nakano, H
    Maeda, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (08) : 783 - 792
  • [27] An efficient adaptive routing algorithm for the faulty star graph
    Bai, LQ
    Maeda, H
    Ebara, H
    Nakano, H
    1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 82 - 87
  • [28] ScaIR: Scalable Intelligent Routing based on Distributed Graph Reinforcement Learning
    Zhang, Jing
    Guan, Jianfeng
    Liu, Kexian
    Hu, Yizhong
    Shen, Ao
    Ma, Yuyin
    COMPUTER NETWORKS, 2025, 257
  • [29] DR-GAT: Dynamic routing graph attention network for stock recommendation
    Lei, Zengyu
    Zhang, Caiming
    Xu, Yunyang
    Li, Xuemei
    INFORMATION SCIENCES, 2024, 654
  • [30] An efficient joint framework for interacting knowledge graph and item recommendation
    Du, Haizhou
    Tang, Yue
    Cheng, Zebang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (04) : 1685 - 1712