DASH: Data Aware Locality Sensitive Hashing

被引:0
|
作者
Tan, Zongyuan [1 ,3 ]
Wang, Hongya [1 ,2 ,3 ]
Du, Ming [1 ]
Zhang, Jie [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] Chinese Acad Sci, State Key Lab Comp Architecture, ICT, Beijing, Peoples R China
[3] Shanghai Key Lab Comp Software Evaluating & Testi, Shanghai, Peoples R China
来源
关键词
LSH; ANNS; High dimensions; Data-dependent hashing; PRODUCT QUANTIZATION;
D O I
10.1007/978-3-031-25198-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality sensitive hashing (LSH) has been extensively employed to solve the problem of c-approximate nearest neighbor search (c-ANNS) in high-dimensional spaces. However, the search performance of LSH is degenerated with the number of data increasing. To this end, we propose an efficient method called Data Aware Sensitive Hashing (DASH) to deal with this drawback. DASH is the data-dependent hashing algorithm under considering the residual distance prior. DASH leverages this prior knowledge and provides theoretical guarantee for search results. Our experimental results with various datasets show that DASH achieves better search performance and the running time can reach up to about 4-40x speedups compared with other state-of-the-art methods.
引用
收藏
页码:85 / 100
页数:16
相关论文
共 50 条
  • [21] BOUNDARY-EXPANDING LOCALITY SENSITIVE HASHING
    Wang, Qiang
    Guo, Zhiyuan
    Liu, Gang
    Guo, Jun
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 358 - 362
  • [22] Neural Locality Sensitive Hashing for Entity Blocking
    Wang, Runhui
    Kong, Luyang
    Tao, Yefan
    Borthwick, Andrew
    Golac, Davor
    Johnson, Henrik
    Hijazi, Shadie
    Deng, Dong
    Zhang, Yongfeng
    Proceedings of the 2024 SIAM International Conference on Data Mining, SDM 2024, 2024, : 887 - 895
  • [23] Optimal Parameters for Locality-Sensitive Hashing
    Slaney, Malcolm
    Lifshits, Yury
    He, Junfeng
    PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2604 - 2623
  • [24] Locality-sensitive hashing for the edit distance
    Marcais, Guillaume
    DeBlasio, Dan
    Pandey, Prashant
    Kingsford, Carl
    BIOINFORMATICS, 2019, 35 (14) : I127 - I135
  • [25] Locality Sensitive Hashing for Data Placement to Optimize Parallel Subgraph Query Evaluation
    Li, Mingdao
    Zhai, Bo
    Jiang, Yuntao
    Li, Yunjian
    Qin, Zheng
    Peng, Peng
    WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 32 - 47
  • [26] Locality Sensitive Hashing of Customer Load Profiles
    Beretka, Sandor F.
    Varga, Ervin D.
    2013 INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY RESEARCH AND APPLICATIONS (ICRERA), 2013, : 353 - 356
  • [27] An Improved Algorithm for Locality-Sensitive Hashing
    Cen, Wei
    Miao, Kehua
    10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 61 - 64
  • [28] Locality Sensitive Hashing for Network Traffic Fingerprinting
    Mashnoor, Nowfel
    Thom, Jay
    Rouf, Abdur
    Sengupta, Shamik
    Charyyev, Batyr
    2023 IEEE 29TH INTERNATIONAL SYMPOSIUM ON LOCAL AND METROPOLITAN AREA NETWORKS, LANMAN, 2023,
  • [29] Locality sensitive hashing via mechanical behavior
    Lejeune, Emma
    Prachaseree, Peerasait
    EXTREME MECHANICS LETTERS, 2023, 63
  • [30] Efficient viideo retrieval by locality sensitive hashing
    Hu, SY
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 449 - 452