In Defense of Locality-Sensitive Hashing

被引:22
|
作者
Ding, Kun [1 ,2 ]
Huo, Chunlei [1 ]
Fan, Bin [1 ]
Xiang, Shiming [1 ]
Pan, Chunhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Locality-sensitive hashing (LSH); semantic similarity search; two-step hashing; NEAREST-NEIGHBOR; IMAGE RETRIEVAL; KERNEL METHODS; REPRESENTATION; MULTICLASS; CODES; QUANTIZATION; OPTIMIZATION; BINARY; SCENE;
D O I
10.1109/TNNLS.2016.2615085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing-based semantic similarity search is becoming increasingly important for building large-scale content-based retrieval system. The state-of-the-art supervised hashing techniques use flexible two-step strategy to learn hash functions. The first step learns binary codes for training data by solving binary optimization problems with millions of variables, thus usually requiring intensive computations. Despite simplicity and efficiency, locality-sensitive hashing (LSH) has never been recognized as a good way to generate such codes due to its poor performance in traditional approximate neighbor search. We claim in this paper that the true merit of LSH lies in transforming the semantic labels to obtain the binary codes, resulting in an effective and efficient two-step hashing framework. Specifically, we developed the locality-sensitive two-step hashing (LS-TSH) that generates the binary codes through LSH rather than any complex optimization technique. Theoretically, with proper assumption, LS-TSH is actually a useful LSH scheme, so that it preserves the label-based semantic similarity and possesses sublinear query complexity for hash lookup. Experimentally, LS-TSH could obtain comparable retrieval accuracy with state of the arts with two to three orders of magnitudes faster training speed.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 50 条
  • [1] Kernelized Locality-Sensitive Hashing
    Kulis, Brian
    Grauman, Kristen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1092 - 1104
  • [2] Correlated Locality-Sensitive Hashing
    Pagh, Rasmus
    [J]. ALGORITHMS - ESA 2015, 2015, 9294
  • [3] Bit Reduction for Locality-Sensitive Hashing
    Liu, Huawen
    Zhou, Wenhua
    Zhang, Hong
    Li, Gang
    Zhang, Shichao
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12470 - 12481
  • [4] An Improved Algorithm for Locality-Sensitive Hashing
    Cen, Wei
    Miao, Kehua
    [J]. 10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 61 - 64
  • [5] Locality-sensitive hashing for the edit distance
    Marcais, Guillaume
    DeBlasio, Dan
    Pandey, Prashant
    Kingsford, Carl
    [J]. BIOINFORMATICS, 2019, 35 (14) : I127 - I135
  • [6] Optimal Parameters for Locality-Sensitive Hashing
    Slaney, Malcolm
    Lifshits, Yury
    He, Junfeng
    [J]. PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2604 - 2623
  • [7] Locality-sensitive hashing for finding nearest neighbors
    Slaney, Malcolm
    Casey, Michael
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (02) : 128 - 131
  • [8] Using Locality-sensitive Hashing for Rendezvous Search
    Jiang, Guann-Yng
    Chang, Cheng-Shang
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1743 - 1749
  • [9] Locality-sensitive hashing of permutations for proximity searching
    Figueroa, Karina
    Camarena-Ibarrola, Antonio
    Valero-Elizondo, Luis
    Reyes, Nora
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4677 - 4684
  • [10] Non-Metric Locality-Sensitive Hashing
    Mu, Yadong
    Yan, Shuicheng
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 539 - 544