Nearest neighbor retrieval using distance-based hashing

被引:45
|
作者
Athitsos, Vassilis [1 ]
Potamias, Michalis [2 ]
Papapetrou, Panagiotis [2 ]
Kollios, George [2 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[2] Boston Univ, Dept Comp Sci, Boston, MA USA
关键词
D O I
10.1109/ICDE.2008.4497441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficient approximate nearest neighbor retrieval. Hashing methods, such as Locality Sensitive Hashing (LSH), have been successfully applied for similarity indexing in vector spaces and string spaces under the Hamming distance. The key novelty of the hashing technique proposed here is that it can be applied to spaces with arbitrary distance measures, including non-metric distance measures. First, we describe a domain-independent method for constructing a family of binary hash functions. Then, we use these functions to construct multiple multibit hash tables. We show that the LSH formalism is not applicable for analyzing the behavior of these tables as index structures. We present a novel formulation, that uses statistical observations from sample data to analyze retrieval accuracy and efficiency for the proposed indexing method. Experiments on several real-world data sets demonstrate that our method produces good trade-offs between accuracy and efficiency, and significantly outperforms VP-trees, which are a well-known method for distance-based indexing.
引用
收藏
页码:327 / +
页数:3
相关论文
共 50 条
  • [1] A Nearest Neighbor Search Engine Using Distance-based Hashing
    Ito, Toshitaka
    Itotani, Yuri
    Wakabayashi, Shin'ichi
    Nagayama, Shinobu
    Inagi, Masato
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 153 - 160
  • [2] An Approximate Nearest Neighbor Search Algorithm Using Distance-Based Hashing
    Itotani, Yuri
    Wakabayashi, Shin'ichi
    Nagayama, Shinobu
    Inagi, Masato
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 203 - 213
  • [3] A generalized mean distance-based k-nearest neighbor classifier
    Gou, Jianping
    Ma, Hongxing
    Ou, Weihua
    Zeng, Shaoning
    Rao, Yunbo
    Yang, Hebiao
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 356 - 372
  • [4] Rank-based Hashing for Effective and Efficient Nearest Neighbor Search for Image Retrieval
    Kawai, Vinicius Sato
    Valem, Lucas Pascotti
    Baldassin, Alexandro
    Borin, Edson
    Pedronette, Daniel Carlos Guimaraes
    Latecki, Longin Jan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [5] Efficient Document Retrieval System using Locality Sensitive Hashing Nearest Neighbor Algorithm and Weighted Jaccard Distance for Retrieving Closest Personalities
    Ben George, E.
    Rosline, G. Jeba
    Balasupramanian, N.
    Blessing, N. R. Wilfred
    JURNAL KEJURUTERAAN, 2024, 36 (04): : 1535 - 1543
  • [6] Comparison of a distance-based likelihood ratio test and k-nearest neighbor classification methods
    Remus, Jeremiah J.
    Morton, Kenneth D.
    Torrione, Peter A.
    Tantum, Stacy L.
    Collins, Leslie A.
    2008 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2008, : 362 - 367
  • [7] Improving the Retrieval Performance by Using Distance-Based Bigram
    Aimmanee, P.
    Theeramunkong, T.
    ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 706 - 709
  • [8] Fast nearest neighbor retrieval using randomized binary codes and approximate Euclidean distance
    Marukatat, Sanparith
    Methasate, Ithipan
    PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 1101 - 1107
  • [9] Complementary Hashing for Approximate Nearest Neighbor Search
    Xu, Hao
    Wang, Jingdong
    Li, Zhu
    Zeng, Gang
    Li, Shipeng
    Yu, Nenghai
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1631 - 1638
  • [10] Locally Optimized Hashing for Nearest Neighbor Search
    Tokui, Seiya
    Sato, Issei
    Nakagawa, Hiroshi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 498 - 509