A meta-indexing method for fast probably approximately correct nearest neighbor searches

被引:1
|
作者
Santini, Simone [1 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, C Tomas & Valiente 11, Madrid 28049, Spain
关键词
Indexing; Approximate nearest neighbor; Error modeling; Curse of dimensionality; Multimedia data base; Approximate search; QUERIES; ALGORITHM; FILE;
D O I
10.1007/s11042-022-12690-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an indexing method for probably approximately correct nearest neighbor queries in high dimensional spaces capable of improving the performance of any index whose performance degrades with the increased dimensionality of the query space. The basic idea of the method is quite simple: we use SVD to concentrate the variance of the inter-element distance in a lower dimensional space, Xi. We do a nearest neighbor query in this space and then we "peek" forward from the nearest neighbor by gathering all the elements whose distance from the query is less than d(Xi) (1 + zeta sigma(2)(Xi)), where d(Xi) is the distance from the nearest neighbor in Xi, sigma(2)(Xi) is the variance of the data in Xi, and zeta a parameter. All the data thus collected form a tentative set T, in which we do a scan using the complete feature space to find the point closest to the query. The advantages of the method are that (1) it can be built on top of virtually any indexing method and (2) we can build a model of the distribution of the error precise enough to allow designing a compromise between error and speed. We show the improvement that we can obtain using data from the SUN data base.
引用
收藏
页码:30465 / 30491
页数:27
相关论文
共 34 条
  • [1] A meta-indexing method for fast probably approximately correct nearest neighbor searches
    Simone Santini
    Multimedia Tools and Applications, 2022, 81 : 30465 - 30491
  • [2] A fast indexing method for multidimensional nearest neighbor search
    Shepherd, J
    Zhu, X
    Megiddo, N
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VII, 1998, 3656 : 350 - 355
  • [3] An efficient indexing method for nearest neighbor searches in high-dimensional image databases
    Cha, GH
    Zhu, XM
    Petkovic, D
    Chung, CW
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 76 - 87
  • [4] Probably correct k-nearest neighbor search in high dimensions
    Toyama, Jun
    Kudo, Mineichi
    Imai, Hideyuki
    PATTERN RECOGNITION, 2010, 43 (04) : 1361 - 1372
  • [5] A review of feature indexing methods for fast approximate nearest neighbor search
    The-Anh Pham
    Van-Hao Le
    Dinh-Nghiep Le
    PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 372 - 377
  • [6] Fast indexing method for image retrieval using k nearest neighbors searches by principal axis analysis
    Cheng, Shyi-Chyi
    Wu, Tian-Luu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2006, 17 (01) : 42 - 56
  • [7] Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing
    Bittremieux, Wout
    Meysman, Pieter
    Noble, William Stafford
    Laukens, Kris
    JOURNAL OF PROTEOME RESEARCH, 2018, 17 (10) : 3463 - 3474
  • [8] A Fast Nearest Neighbor Method Using Empirical Marginal Distribution
    Kudo, Mineichi
    Toyama, Jun
    Imai, Hideyuki
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, PROCEEDINGS, 2009, 5712 : 333 - 339
  • [9] A Fast Speaker Identification Method Using Nearest Neighbor Distance
    Zeinali, Hossein
    Sameti, Hossein
    Babaali, Bagher
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 2159 - 2162
  • [10] An efficient retrieval method for nearest neighbor searches in high-dimensional image database
    Cui, JT
    Liu, WG
    Zhou, LH
    WAVELET ANALYSIS AND ITS APPLICATIONS, AND ACTIVE MEDIA TECHNOLOGY, VOLS 1 AND 2, 2004, : 356 - 361