Fast nearest neighbor search in high-dimensional space

被引:64
|
作者
Berchtold, S [1 ]
Ertl, B [1 ]
Keim, DA [1 ]
Kriegel, HP [1 ]
Seidl, T [1 ]
机构
[1] AT&T Bell Labs, Res, Naperville, IL 60566 USA
关键词
D O I
10.1109/ICDE.1998.655779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity search in multimedia databases requires an efficient support of nearest-neighbor search on a large set of high-dimensional points as a basic operation for query processing. As recent theoretical results show, state of the art approaches to nearest-neighbor search are not efficient in higher dimensions. In our new approach, we therefore precompute the result of any nearest-neighbor search which corresponds to a computation of the voronoi cell of each data point. In a second step, we store the voronoi cells in an index structure efficient for high-dimensional data spaces. As a result, nearest neighbor search corresponds to a simple point query on the index structure. Although our technique is based on a precomputation of the solution space, it is dynamic, i.e. it supports insertions of new delta points. An extensive experimental evaluation of our technique demonstrates the high efficiency for uniformly distributed as well as real data. We obtained a significant reduction of the search time compared to nearest neighbor search in the X-tree (up to a factor of 4).
引用
收藏
页码:209 / 218
页数:10
相关论文
共 50 条
  • [31] I-LSH: I/O efficient c-Approximate Nearest Neighbor Search in High-dimensional Space
    Liu, Wanqi
    Wang, Hanchen
    Zhang, Ying
    Wang, Wei
    Qin, Lu
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1670 - 1673
  • [32] A Fast Approximate Nearest Neighbor Search Algorithm in the Hamming Space
    Esmaeili, Mani Malek
    Ward, Rabab Kreidieh
    Fatourechi, Mehrdad
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (12) : 2481 - 2488
  • [33] GPU-Accelerated Reverse K-Nearest Neighbor Search for High-Dimensional Data
    Tsuihiji, Kyohei
    Amagasa, Toshiyuki
    [J]. ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2022, 2022, 526 : 279 - 288
  • [34] Exploit Every Bit: Effective Caching for High-Dimensional Nearest Neighbor Search (Extended Abstract)
    Tang, Bo
    Yiu, Man Lung
    Hua, Kien A.
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 45 - 46
  • [35] Quality and Efficiency in High Dimensional Nearest Neighbor Search
    Tao, Yufei
    Yi, Ke
    Sheng, Cheng
    Kalnis, Panos
    [J]. ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 563 - 575
  • [36] Utilization of principle axis analysis for fast nearest neighbor searches in high-dimensional image databases
    Department of Electronic Engineering, National Kinmen Institute of Technology, Kinmen, 892, Taiwan
    不详
    [J]. J. Inf. Sci. Eng., 2008, 2 (553-571):
  • [37] Utilization of principle axis analysis for fast nearest neighbor searches in high-dimensional image databases
    Wu, Tian-Luu
    Cheng, Shyi-Chyi
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2008, 24 (02) : 553 - 571
  • [38] Randomized Embeddings with Slack and High-Dimensional Approximate Nearest Neighbor
    Anagnostopoulos, Evangelos
    Emiris, Ioannis Z.
    Psarros, Ioannis
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2018, 14 (02)
  • [39] On optimizing nearest neighbor queries in high-dimensional data spaces
    Berchtold, S
    Böhm, C
    Keim, D
    Krebs, F
    Kriegel, HP
    [J]. DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 435 - 449
  • [40] Efficient nearest neighbor query based on extended B+-tree in high-dimensional space
    Cui, Jiangtao
    An, Zhiyong
    Guo, Yong
    Zhou, Shuisheng
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1740 - 1748