DASH: Data Aware Locality Sensitive Hashing

被引:0
|
作者
Tan, Zongyuan [1 ,3 ]
Wang, Hongya [1 ,2 ,3 ]
Du, Ming [1 ]
Zhang, Jie [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Chinese Acad Sci, State Key Lab Comp Architecture, ICT, Beijing, Peoples R China
[3] Shanghai Key Lab Comp Software Evaluating & Testi, Shanghai, Peoples R China
来源
关键词
LSH; ANNS; High dimensions; Data-dependent hashing; PRODUCT QUANTIZATION;
D O I
10.1007/978-3-031-25198-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality sensitive hashing (LSH) has been extensively employed to solve the problem of c-approximate nearest neighbor search (c-ANNS) in high-dimensional spaces. However, the search performance of LSH is degenerated with the number of data increasing. To this end, we propose an efficient method called Data Aware Sensitive Hashing (DASH) to deal with this drawback. DASH is the data-dependent hashing algorithm under considering the residual distance prior. DASH leverages this prior knowledge and provides theoretical guarantee for search results. Our experimental results with various datasets show that DASH achieves better search performance and the running time can reach up to about 4-40x speedups compared with other state-of-the-art methods.
引用
收藏
页码:85 / 100
页数:16
相关论文
共 50 条
  • [1] A Locality Sensitive Hashing Technique for Categorical Data
    Lee, Kyung Mi
    Lee, Keon Myung
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 3159 - 3164
  • [2] P-QALSH: Parallelizing Query Aware Locality-Sensitive Hashing for Big Data
    Huang, Yikai
    Yao, Zhili
    Feng, Jianlin
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 629 - 635
  • [3] Fast and QoS-Aware Heterogeneous Data Center Scheduling Using Locality Sensitive Hashing
    Islam, Mohammad Shahedul
    Gibson, Matt
    Muzahid, Abdullah
    2015 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2015, : 74 - 81
  • [4] ON THE DISTORTION OF LOCALITY SENSITIVE HASHING
    Chierichetti, Flavio
    Kumar, Ravi
    Panconesi, Alessandro
    Terolli, Erisa
    SIAM JOURNAL ON COMPUTING, 2019, 48 (02) : 350 - 372
  • [5] Trust-Aware Hybrid Collaborative Recommendation with Locality-Sensitive Hashing
    Li, Dejuan
    Esquivel, James A.
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (04): : 1421 - 1434
  • [6] Catching the Flow with Locality Sensitive Hashing in Programmable Data Planes
    Cao, Zuowei
    Chen, Xiao
    Sheng, Yiqiang
    Nil, Hong
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 216 - 220
  • [7] Lower bounds on locality sensitive hashing
    Motwani, Rajeev
    Naor, Assaf
    Panigrahy, Rina
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2007, 21 (04) : 930 - 935
  • [8] Refining Codes for Locality Sensitive Hashing
    Liu, Huawen
    Zhou, Wenhua
    Wu, Zongda
    Zhang, Shichao
    Li, Gang
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1274 - 1284
  • [9] Kernelized Locality-Sensitive Hashing
    Kulis, Brian
    Grauman, Kristen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1092 - 1104
  • [10] Locality Sensitive Hashing Using GMM
    Schmieder, Fabian
    Yang, Bin
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 569 - 581