A meta-indexing method for fast probably approximately correct nearest neighbor searches

被引:1
|
作者
Santini, Simone [1 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, C Tomas & Valiente 11, Madrid 28049, Spain
关键词
Indexing; Approximate nearest neighbor; Error modeling; Curse of dimensionality; Multimedia data base; Approximate search; QUERIES; ALGORITHM; FILE;
D O I
10.1007/s11042-022-12690-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an indexing method for probably approximately correct nearest neighbor queries in high dimensional spaces capable of improving the performance of any index whose performance degrades with the increased dimensionality of the query space. The basic idea of the method is quite simple: we use SVD to concentrate the variance of the inter-element distance in a lower dimensional space, Xi. We do a nearest neighbor query in this space and then we "peek" forward from the nearest neighbor by gathering all the elements whose distance from the query is less than d(Xi) (1 + zeta sigma(2)(Xi)), where d(Xi) is the distance from the nearest neighbor in Xi, sigma(2)(Xi) is the variance of the data in Xi, and zeta a parameter. All the data thus collected form a tentative set T, in which we do a scan using the complete feature space to find the point closest to the query. The advantages of the method are that (1) it can be built on top of virtually any indexing method and (2) we can build a model of the distribution of the error precise enough to allow designing a compromise between error and speed. We show the improvement that we can obtain using data from the SUN data base.
引用
收藏
页码:30465 / 30491
页数:27
相关论文
共 34 条
  • [21] Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows
    Roberts, Adam
    McMillan, Leonard
    Wang, Wei
    Parker, Joel
    Rusyn, Ivan
    Threadgill, David
    BIOINFORMATICS, 2007, 23 (13) : I401 - I407
  • [22] Feature Matching Method Based on SURF and Fast Library for Approximate Nearest Neighbor Search
    Wang Beiyi
    Zhang Xiaohong
    Wang Weibing
    INTEGRATED FERROELECTRICS, 2021, 218 (01) : 147 - 154
  • [23] Distance quantization method for fast nearest neighbor search computations with applications to motion estimation
    Cheong, Hye-Yeon
    Ortega, Antonio
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 909 - 913
  • [24] MEAN-NEAREST-NEIGHBOR: A MORE PERVASIVE, FAST AND ADAPTIVE IMAGE FILTERING METHOD
    Tian, Peng
    Pan, Wei
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE (ITCS 2011), PROCEEDINGS, 2011, : 319 - 322
  • [25] A novel supervised cluster adjustment method using a fast exact nearest neighbor search algorithm
    Ali Zaghian
    Fakhroddin Noorbehbahani
    Pattern Analysis and Applications, 2017, 20 : 701 - 715
  • [26] A novel cell partition method by introducing Silhouette Coefficient for fast approximate nearest neighbor search
    Song, Wenwen
    Wang, Yang
    Pan, Zhibin
    INFORMATION SCIENCES, 2023, 642
  • [27] A novel supervised cluster adjustment method using a fast exact nearest neighbor search algorithm
    Zaghian, Ali
    Noorbehbahani, Fakhroddin
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 701 - 715
  • [28] Fault classification method based on fast k-nearest neighbor with hybrid feature generation and K-medoids clustering
    Zhou, Zhe
    Zeng, Fanliang
    Huang, Jiacheng
    Zheng, Jinhui
    Li, Zuxin
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 568 - 573
  • [29] Ranking recommendation to implement meta-heuristic algorithm based on multi-label k-nearest neighbor method
    Cui J.-S.
    Shang T.-Z.
    Yang F.
    Yu J.-W.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (05): : 1289 - 1298
  • [30] A fast prototype reduction method based on template reduction and visualization-induced self-organizing map for nearest neighbor algorithm
    I-Jing Li
    Jia-Chian Chen
    Jiunn-Lin Wu
    Applied Intelligence, 2013, 39 : 564 - 582