Trading quality for time with Nearest-Neighbor Search

被引:0
|
作者
Weber, R [1 ]
Böhm, K [1 ]
机构
[1] ETH Zentrum, Inst Informat Syst, CH-8092 Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many situations, users would readily accept an approximate query result if evaluation of the query becomes faster, In this article, we investigate approximate evaluation techniques based on the VA-File for Nearest-Neighbor Search (NN-Search). The VA-File contains approximations of feature points. These approximations frequently suffice to eliminate the vast majority of points in a first phase. Then, a second phase identifies the NN by computing exact distances of all remaining points. To develop approximate query-evaluation techniques, we proceed in two steps: first, we derive an analytic model for VA-File based NN-search. This is to investigate the relationship between approximation granularity, effectiveness of the filtering step and search performance. In more detail, we develop formulae for the distribution of the error of the bounds and the duration of the different phases of query evaluation. Based on these results, we develop different approximate query evaluation techniques. The first one adapts the bounds to have a more rigid filtering, the second one skips computation of the exact distances. Experiments show that these techniques have the desired effect: for instance, when allowing for a small but specific reduction of result quality, we observed a speedup of 7 in 50-NN search.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 50 条
  • [31] Range nearest-neighbor query
    Hu, HB
    Lee, DL
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (01) : 78 - 91
  • [32] NEAREST-NEIGHBOR MEDIAN FILTER
    ITOH, K
    ICHIOKA, Y
    MINAMI, T
    APPLIED OPTICS, 1988, 27 (16): : 3445 - 3450
  • [33] WEIGHTED NEAREST-NEIGHBOR ANALYSIS
    SCHWARZBACH, E
    BIOMETRICS, 1985, 41 (04) : 1088 - 1088
  • [34] Coresets for the nearest-neighbor rule
    Department of Computer Science, University of Maryland, College Park
    MD, United States
    不详
    MD, United States
    Leibniz Int. Proc. Informatics, LIPIcs,
  • [35] FASTER NEAREST-NEIGHBOR CALCULATIONS
    BATCHELOR, BG
    ELECTRONICS LETTERS, 1977, 13 (10) : 304 - 306
  • [36] Nearest-neighbor variance estimation (NNVE): Robust covariance estimation via nearest-neighbor cleaning
    Wang, N
    Raftery, AE
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (460) : 994 - 1006
  • [37] NEAREST-NEIGHBOR MULTICHANNEL FILTER
    PLATANIOTIS, KN
    ANDROUTSOS, D
    SRI, V
    VENETSANOPOULOS, AN
    ELECTRONICS LETTERS, 1995, 31 (22) : 1910 - 1911
  • [38] CHOICE OF NEIGHBOR ORDER IN NEAREST-NEIGHBOR CLASSIFICATION
    Hall, Peter
    Park, Byeong U.
    Samworth, Richard J.
    ANNALS OF STATISTICS, 2008, 36 (05): : 2135 - 2152
  • [39] A fast nearest-neighbor algorithm based on a principal axis search tree
    McNames, J
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (09) : 964 - 976
  • [40] Nearest-neighbor Search from Large Datasets using Narrow Sketches
    Higuchi, Naoya
    Imamura, Yasunobu
    Mic, Vladimir
    Shinohara, Takeshi
    Hirata, Kouichi
    Kuboyama, Tetsuji
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 401 - 410