LayerLSH: Rebuilding Locality-Sensitive Hashing Indices by Exploring Density of Hash Values

被引:1
|
作者
Ding, Jiwen [1 ]
Liu, Zhuojin [1 ]
Zhang, Yanfeng [1 ]
Gong, Shufeng [1 ]
Yu, Ge [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Hash functions; Search problems; Nearest neighbor methods; Indexing; Compounds; Licenses; LSH; nearest neighbors search; multi-layered structure; data skewness; LSH; FRAMEWORK; SEARCH;
D O I
10.1109/ACCESS.2022.3182802
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Locality-sensitive hashing (LSH) has attracted extensive research efforts for approximate nearest neighbors (NN) search. However, most of these LSH-based index structures fail to take data distribution into account. They perform well in a uniform data distribution setting but exhibit unstable performance when the data are skewed. As known, most real life data are skewed, which makes LSH suffer. In this paper, we observe that the skewness of hash values resulted from skewed data is a potential reason for performance degradation. To address this problem, we propose to rebuild LSH indices by exploring the density of hash values. The hash values in dense/sparse ranges are carefully reorganized using a multi-layered structure, so that more efforts are put into indexing the dense hash values. We further discuss the benefit in distributed computing. Extensive experiments are conducted to show the effectiveness and efficiency of the reconstructed LSH indices.
引用
收藏
页码:69851 / 69865
页数:15
相关论文
共 50 条
  • [31] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [32] Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning
    Meira, Jorge
    Eiras-Franco, Carlos
    Bolon-Canedo, Veronica
    Marreiros, Goreti
    Alonso-Betanzos, Amparo
    INFORMATION SCIENCES, 2022, 607 : 1245 - 1264
  • [33] CONSULT: accurate contamination removal using locality-sensitive hashing
    Rachtman, Eleonora
    Bafna, Vineet
    Mirarab, Siavash
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (03)
  • [34] A novel locality-sensitive hashing for large scale image retrieva
    Li, Junyi
    Li, Jianhua
    Ni, Bingbing
    Yan, Shuicheng
    Journal of Computational Information Systems, 2012, 8 (23): : 9611 - 9617
  • [35] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    Soft Computing, 2024, 28 : 1041 - 1053
  • [36] Maintaining Academic Integrity in Programming: Locality-Sensitive Hashing and Recommendations
    Karnalim, Oscar
    EDUCATION SCIENCES, 2023, 13 (01):
  • [37] Locality-Sensitive Hashing for Efficient Web Application Security Testing
    Ben-Bassat, Ilan
    Rokah, Erez
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY (ICISSP), 2019, : 193 - 204
  • [38] Secure Approximate Nearest Neighbor Search with Locality-Sensitive Hashing
    Song, Shang
    Liu, Lin
    Chen, Rongmao
    Peng, Wei
    Wang, Yi
    COMPUTER SECURITY - ESORICS 2023, PT III, 2024, 14346 : 411 - 430
  • [39] Locality-Sensitive Hashing for Efficient Rendezvous Search: A New Approach
    Jiang, Guann-Yng
    Chang, Cheng-Shang
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (09) : 5674 - 5687
  • [40] Scalable Graph Representation Learning via Locality-Sensitive Hashing
    Chen, Xiusi
    Jiang, Jyun-Yu
    Wang, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3878 - 3882