A Learning-to-Rank Formulation of Clustering-Based Approximate Nearest Neighbor Search

被引:1
|
作者
Vecchiato, Thomas [1 ]
Lucchese, Claudio [1 ]
Nardini, Franco Maria [2 ]
Bruch, Sebastian [3 ]
机构
[1] Ca Foscari Univ Venice, Venice, Italy
[2] ISTI CNR, Pisa, Italy
[3] Pinecone, New York, NY USA
来源
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024 | 2024年
关键词
Approximate Nearest Neighbor Search; Inverted File; Learning to; Rank; EFFICIENT;
D O I
10.1145/3626772.3657931
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical piece of the modern information retrieval puzzle is approximate nearest neighbor search. Its objective is to return a set of k . data points that are closest to a query point, with its accuracy measured by the proportion of exact nearest neighbors captured in the returned set. One popular approach to this question is clustering: The indexing algorithm partitions data points into non-overlapping subsets and represents each partition by a point such as its centroid. The query processing algorithm first identifies the nearest clusters-a process known as routing-then performs a nearest neighbor search over those clusters only. In this work, we make a simple observation: The routing function solves a ranking problem. Its quality can therefore be assessed with a ranking metric, making the function amenable to learning-to-rank. Interestingly, ground-truth is often freely available: Given a query distribution in a top-k. configuration, the ground-truth is the set of clusters that contain the exact top-k. vectors. We develop this insight and apply it to Maximum Inner Product Search (MIPS). As we demonstrate empirically on various datasets, learning a simple linear function consistently improves the accuracy of clustering-based MIPS.
引用
收藏
页码:2261 / 2265
页数:5
相关论文
共 50 条
  • [21] Learning-based query optimization for multi-probe approximate nearest neighbor search
    Pengcheng Zhang
    Bin Yao
    Chao Gao
    Bin Wu
    Xiao He
    Feifei Li
    Yuanfei Lu
    Chaoqun Zhan
    Feilong Tang
    The VLDB Journal, 2023, 32 : 623 - 645
  • [22] Clustering-Based Index and Data Broadcasting for Mobile Nearest Neighbor Query Processing
    Waluyo, Agustinus Borgy
    Taniar, David
    Rahayu, Wenny
    Srinivasan, Bala
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (04) : 1964 - 1974
  • [23] Image Tampering Detection Method Based on Approximate Nearest Neighbor Search
    Wang Jing
    Zhang Yuchen
    Huo Zhanqiang
    Jia Liqin
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
  • [24] Tree-based compact hashing for approximate nearest neighbor search
    Hou, Guangdong
    Cui, Runpeng
    Pan, Zheng
    Zhang, Changshui
    NEUROCOMPUTING, 2015, 166 : 271 - 281
  • [25] Approximate Nearest Neighbor Search Small World Approach
    Ponomarenko, Alexander
    Mal'kov, Yury
    Logvinov, Andrey
    Krylov, Vladimir
    ICT&A 2011/DEMSET 2011: INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES AND APPLICATIONS / INTERNATIONAL CONFERENCE ON DESIGN AND MODELING IN SCIENCE, EDUCATION, AND TECHNOLOGY, 2011, : 40 - 45
  • [26] Approximate Nearest Neighbor Search for Low Dimensional Queries
    Har-Peled, Sariel
    Kumar, Nirman
    PROCEEDINGS OF THE TWENTY-SECOND ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2011, : 854 - 867
  • [27] Optimized Product Quantization for Approximate Nearest Neighbor Search
    Ge, Tiezheng
    He, Kaiming
    Ke, Qifa
    Sun, Jian
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2946 - 2953
  • [28] Fast spectral analysis for approximate nearest neighbor search
    Jing Wang
    Jie Shen
    Machine Learning, 2022, 111 : 2297 - 2322
  • [29] Approximate k-flat Nearest Neighbor Search
    Mulzer, Wolfgang
    Nguyen, Huy L.
    Seiferth, Paul
    Stein, Yannik
    STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 783 - 792
  • [30] Fast Approximate Nearest Neighbor Search via k-Diverse Nearest Neighbor Graph
    Xiao, Yan
    Guo, Jiafeng
    Lan, Yanyan
    Xu, Jun
    Cheng, Xueqi
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8175 - 8176