Reinforcement Routing on Proximity Graph for Efficient Recommendation

被引：4

作者：

Feng, Chao ^{[1
]}

Lian, Defu ^{[1
]}

Wang, Xiting ^{[2
]}

Liu, Zheng ^{[2
]}

Xie, Xing ^{[2
]}

Chen, Enhong ^{[1
]}

机构：

[1] Univ Sci & Technol China, Sch Comp Sci & Technol, 443 Huangshan Rd, Hefei, Peoples R China

[2] Microsoft Res Asia, 5 Danleng St, Beijing, Peoples R China

来源：

ACM TRANSACTIONS ON INFORMATION SYSTEMS | 2023年 / 41卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

MIPS; non-metric; proximity graph; reinforcement learning; reward shaping; graph convolutional network; BINARY SEARCH TREES; QUANTIZATION;

D O I：

10.1145/3512767

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We focus onMaximum Inner Product Search (MIPS), which is an essential problem in many machine learning communities. Given a query, MIPS finds the most similar items with the maximum inner products. Methods for Nearest Neighbor Search (NNS) which is usually defined on metric space do not exhibit the satisfactory performance for MIPS problem since inner product is a non-metric function. However, inner products exhibit many good properties compared with metric functions, such as avoiding vanishing and exploding gradients. As a result, inner product is widely used in many recommendation systems, which makes efficient Maximum Inner Product Search a key for speeding up many recommendation systems. Graph-based methods for NNS problem show the superiorities compared with other class methods. Each data point of the database ismapped to a node of the proximity graph. Nearest neighbor search in the database can be converted to route on the proximity graph to find the nearest neighbor for the query. This technique can be used to solve MIPS problem. Instead of searching the nearest neighbor for the query, we search the item with amaximum inner product with query on the proximity graph. In this article, we propose a reinforcement model to train an agent to search on the proximity graph automatically for MIPS problem if we lack the ground truths of training queries. If we know the ground truths of some training queries, our model can also utilize these ground truths by imitation learning to improve the agent's searchability. By experiments, we can see that our proposed mode which combines reinforcement learning with imitation learning shows the superiorities over the state-of-the-art methods.

引用

页数：27

共 50 条

[21] An Efficient Routing Method for Range Queries in Skip Graph
Banno, Ryohei
Shudo, Kazuyuki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (03) : 516 - 525
[22] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
Mai, Xuan
Fu, Quanzhi
Chen, Yi
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[23] Efficient adaptive routing algorithm for the faulty star graph
Osaka Univ, Suita-shi, Japan
IEICE Trans Inf Syst, 8 (783-792):
[24] ScaIR: Scalable Intelligent Routing based on Distributed Graph Reinforcement Learning
State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing
100876, China
不详
100044, China
Comput. Networks, 1600, (February 2025):
[25] GAPPO - A Graph Attention Reinforcement Learning based Robust Routing Algorithm
Li, Xinyuan
Xiao, Yang
Liu, Sixu
Lu, Xucong
Liu, Fang
Zhou, Wenli
Liu, Jun
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[26] An efficient adaptive routing algorithm for the faulty star graph
Bai, LQ
Ebara, H
Nakano, H
Maeda, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (08) : 783 - 792
[27] An efficient adaptive routing algorithm for the faulty star graph
Bai, LQ
Maeda, H
Ebara, H
Nakano, H
1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 82 - 87
[28] ScaIR: Scalable Intelligent Routing based on Distributed Graph Reinforcement Learning
Zhang, Jing
Guan, Jianfeng
Liu, Kexian
Hu, Yizhong
Shen, Ao
Ma, Yuyin
COMPUTER NETWORKS, 2025, 257
[29] DR-GAT: Dynamic routing graph attention network for stock recommendation
Lei, Zengyu
Zhang, Caiming
Xu, Yunyang
Li, Xuemei
INFORMATION SCIENCES, 2024, 654
[30] An efficient joint framework for interacting knowledge graph and item recommendation
Du, Haizhou
Tang, Yue
Cheng, Zebang
KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (04) : 1685 - 1712

← 1 2 3 4 5 →