Quantized ranking for permutation-based indexing

被引:15
|
作者
Mohamed, Hisham [1 ]
Marchand-Maillet, Stephane [1 ]
机构
[1] Univ Geneva, Dept Comp Sci, Viper Grp, CH-1227 Carouge, Switzerland
基金
瑞士国家科学基金会;
关键词
Large-scale indexing; Permutation-based indexing; Approximate similarity search; Metric permutation table; Quantized ranking; Big-data; BINARY SEARCH TREES; NEAREST-NEIGHBOR; PROXIMITY; REGION;
D O I
10.1016/j.is.2015.01.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The K-Nearest Neighbor (K-NN) search problem is the way to find the K closest and most similar objects to a given query. The K-NN is essential for many applications such as information retrieval and visualization, machine learning and data mining. The exponential growth of data imposes to find approximate approaches to this problem. Permutation-based indexing is one of the most recent techniques for approximate similarity search. Objects are represented by permutation lists ordering their distances to a set of selected reference objects, following the idea that two neighboring objects have the same surrounding. In this paper, we propose a novel quantized representation of permutation lists with its related data structure for effective retrieval on single and multicore architectures. Our novel permutation-based indexing strategy is built to be fast, memory efficient and scalable. This is experimentally demonstrated in comparison to existing proposals using several large-scale datasets of millions of documents and of different dimensions. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:163 / 175
页数:13
相关论文
共 50 条
  • [1] Quantized Ranking for Permutation-Based Indexing
    Mohamed, Hisham
    Marchand-Maillet, Stephane
    [J]. SIMILARITY SEARCH AND APPLICATIONS (SISAP), 2013, 8199 : 103 - 114
  • [2] Permutation-based tests of perfect ranking
    Zamanzade, Ehsan
    Arghami, Nasser Reza
    Vock, Michael
    [J]. STATISTICS & PROBABILITY LETTERS, 2012, 82 (12) : 2213 - 2220
  • [3] Employing GPU architectures for permutation-based indexing
    Krulis, Martin
    Osipyan, Hasmik
    Marchand-Maillet, Stephane
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (09) : 11859 - 11887
  • [4] Employing GPU architectures for permutation-based indexing
    Martin Kruliš
    Hasmik Osipyan
    Stéphane Marchand-Maillet
    [J]. Multimedia Tools and Applications, 2017, 76 : 11859 - 11887
  • [5] Quantifying the Invariance and Robustness of Permutation-Based Indexing Schemes
    Marchand-Maillet, Stephane
    Roman-Rangel, Edgar
    Mohamed, Hisham
    Nielsen, Frank
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2016, 2016, 9939 : 79 - 92
  • [6] A comparison of pivot selection techniques for permutation-based indexing
    Amato, Giuseppe
    Esuli, Andrea
    Falchi, Fabrizio
    [J]. INFORMATION SYSTEMS, 2015, 52 : 176 - 188
  • [7] Re-ranking via local embeddings: A use case with permutation-based indexing and the nSimplex projection
    Vadicamo, Lucia
    Gennaro, Claudio
    Falchi, Fabrizio
    Chavez, Edgar
    Connor, Richard
    Amato, Giuseppe
    [J]. INFORMATION SYSTEMS, 2021, 95
  • [8] Multi-Core (CPU and GPU) for Permutation-Based Indexing
    Mohamed, Hisham
    Osipyan, Hasmik
    Marchand-Maillet, Stephane
    [J]. SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 277 - 288
  • [9] Cancelable Permutation-Based Indexing for Secure and Efficient Biometric Identification
    Murakami, Takao
    Fujita, Ryo
    Ohki, Tetsushi
    Kaga, Yosuke
    Fujio, Masakazu
    Takahashi, Kenta
    [J]. IEEE ACCESS, 2019, 7 : 45563 - 45582
  • [10] Deep Permutations: Deep Convolutional Neural Networks and Permutation-Based Indexing
    Amato, Giuseppe
    Falchi, Fabrizio
    Gennaro, Claudio
    Vadicamo, Lucia
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2016, 2016, 9939 : 93 - 106