Bit Reduction for Locality-Sensitive Hashing

被引:3
|
作者
Liu, Huawen [1 ]
Zhou, Wenhua [2 ]
Zhang, Hong [3 ]
Li, Gang [4 ]
Zhang, Shichao [5 ]
Li, Xuelong [6 ]
机构
[1] Shaoxing Univ, Dept Comp Sci, Shaoxing 312000, Peoples R China
[2] Jinhua Polytech, Coll Informat Engn, Jinhua 321016, Peoples R China
[3] Shaoxing Univ, Dept Math, Shaoxing 312000, Peoples R China
[4] Deakin Univ, Sch Informat Technol, Burwood, Vic 3125, Australia
[5] Cent South Univ, Sch Comp Sci, Changsha 410083, Peoples R China
[6] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & ElectroNics iOPE, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Binary codes; Mutual information; Hash functions; Convolutional neural networks; Kernel; Data models; Correlation; Binary representation; hash bit reduction; hash learning; information retrieval; locality-sensitivity hashing (LSH); mutual information; similarity preservation; DESCRIPTORS; FRAMEWORK; SELECTION;
D O I
10.1109/TNNLS.2023.3263195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality-sensitive hashing (LSH) has gained ever-increasing popularity in similarity search for large-scale data. It has competitive search performance when the number of generated hash bits is large, reversely bringing adverse dilemmas for its wide applications. The first purpose of this work is to introduce a novel hash bit reduction schema for hashing techniques to derive shorter binary codes, which has not yet received sufficient concerns. To briefly show how the reduction schema works, the second purpose is to present an effective bit reduction method for LSH under the reduction schema. Specifically, after the hash bits are generated by LSH, they will be put into bit pool as candidates. Then mutual information and data labels are exploited to measure the correlation and structural properties between the hash bits, respectively. Eventually, highly correlated and redundant hash bits can be distinguished and then removed accordingly, without deteriorating the performance greatly. The advantages of our reduction method include that it can not only reduce the number of hash bits effectively but also boost retrieval performance of LSH, making it more appealing and practical in real-world applications. Comprehensive experiments were conducted on three public real-world datasets. The experimental results with representative bit selection methods and the state-of-the-art hashing algorithms demonstrate that the proposed method has encouraging and competitive performance.
引用
收藏
页码:12470 / 12481
页数:12
相关论文
共 50 条
  • [1] In Defense of Locality-Sensitive Hashing
    Ding, Kun
    Huo, Chunlei
    Fan, Bin
    Xiang, Shiming
    Pan, Chunhong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (01) : 87 - 103
  • [2] Kernelized Locality-Sensitive Hashing
    Kulis, Brian
    Grauman, Kristen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1092 - 1104
  • [3] Correlated Locality-Sensitive Hashing
    Pagh, Rasmus
    [J]. ALGORITHMS - ESA 2015, 2015, 9294
  • [4] An Improved Algorithm for Locality-Sensitive Hashing
    Cen, Wei
    Miao, Kehua
    [J]. 10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 61 - 64
  • [5] Locality-sensitive hashing for the edit distance
    Marcais, Guillaume
    DeBlasio, Dan
    Pandey, Prashant
    Kingsford, Carl
    [J]. BIOINFORMATICS, 2019, 35 (14) : I127 - I135
  • [6] Optimal Parameters for Locality-Sensitive Hashing
    Slaney, Malcolm
    Lifshits, Yury
    He, Junfeng
    [J]. PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2604 - 2623
  • [7] Using Locality-sensitive Hashing for Rendezvous Search
    Jiang, Guann-Yng
    Chang, Cheng-Shang
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1743 - 1749
  • [8] Locality-sensitive hashing of permutations for proximity searching
    Figueroa, Karina
    Camarena-Ibarrola, Antonio
    Valero-Elizondo, Luis
    Reyes, Nora
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4677 - 4684
  • [9] Locality-sensitive hashing for finding nearest neighbors
    Slaney, Malcolm
    Casey, Michael
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (02) : 128 - 131
  • [10] Non-Metric Locality-Sensitive Hashing
    Mu, Yadong
    Yan, Shuicheng
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 539 - 544