DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search

被引:1
|
作者
Wei, Jiuqi [1 ,2 ]
Peng, Botao [1 ]
Lee, Xiaodong [1 ]
Palpanas, Themis [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Univ Paris Cite, LIPADE, Paris, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2024年 / 17卷 / 09期
基金
中国国家自然科学基金;
关键词
DATA SERIES; SIMILARITY SEARCH;
D O I
10.14778/3665844.3665854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Locality-sensitive hashing (LSH) is a well-known solution for approximate nearest neighbor (ANN) search in high-dimensional spaces due to its robust theoretical guarantee on query accuracy. Traditional LSH-based methods mainly focus on improving the efficiency and accuracy of the query phase by designing different query strategies, but pay little attention to improving the efficiency of the indexing phase. They typically fine-tune existing data-oriented partitioning trees to index data points and support their query strategies. However, their strategy to directly partition the multidimensional space is time-consuming, and performance degrades as the space dimensionality increases. In this paper, we design an encoding-based tree called Dynamic Encoding Tree (DE-Tree) to improve the indexing efficiency and support efficient range queries based on Euclidean distance. Based on DE-Tree, we propose a novel LSH scheme called DET-LSH. DET-LSH adopts a novel query strategy, which performs range queries in multiple independent index DE-Trees to reduce the probability of missing exact NN points, thereby improving the query accuracy. Our theoretical studies show that DET-LSH enjoys probabilistic guarantees on query accuracy. Extensive experiments on real-world datasets demonstrate the superiority of DET-LSH over the state-of-the-art LSH-based methods on both efficiency and accuracy. While achieving better query accuracy than competitors, DET-LSH achieves up to 6x speedup in indexing time and 2x speedup in query time over the state-of-the-art LSH-based methods.
引用
收藏
页码:2241 / 2254
页数:14
相关论文
共 16 条
  • [1] Secure Approximate Nearest Neighbor Search with Locality-Sensitive Hashing
    Song, Shang
    Liu, Lin
    Chen, Rongmao
    Peng, Wei
    Wang, Yi
    [J]. COMPUTER SECURITY - ESORICS 2023, PT III, 2024, 14346 : 411 - 430
  • [2] Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search
    Huang, Qiang
    Feng, Jianlin
    Zhang, Yikai
    Fang, Qiong
    Ng, Wilfred
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 9 (01): : 1 - 12
  • [3] Improved locality-sensitive hashing method for the approximate nearest neighbor problem
    陆颖华
    马廷淮
    钟水明
    曹杰
    王新
    Abdullah Al-Dhelaane
    [J]. Chinese Physics B, 2014, 23 (08) : 221 - 229
  • [4] Improved locality-sensitive hashing method for the approximate nearest neighbor problem
    Lu Ying-Hua
    Ma Ting-Huai
    Zhong Shui-Ming
    Cao Jie
    Wang Xin
    Al-Dhelaan, Abdullah
    [J]. CHINESE PHYSICS B, 2014, 23 (08)
  • [5] Fast Locality-Sensitive Hashing Frameworks for Approximate Near Neighbor Search
    Christiani, Tobias
    [J]. SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 3 - 17
  • [6] BCH-LSH: a new scheme of locality-sensitive hashing
    Ma, Yuena
    Feng, Xiaoyi
    Liu, Yang
    Li, Shuhong
    [J]. IET IMAGE PROCESSING, 2018, 12 (06) : 850 - 855
  • [7] Falconn plus plus : A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search
    Pham, Ninh
    Liu, Tao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [8] Anomaly detection based on Nearest Neighbor search with Locality-Sensitive B-tree
    Shen, Maying
    Jiang, Xinghao
    Sun, Tanfeng
    [J]. NEUROCOMPUTING, 2018, 289 : 55 - 67
  • [9] DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing
    Tian, Yao
    Zhao, Xi
    Thou, Xiaofang
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2250 - 2262
  • [10] Tree-based compact hashing for approximate nearest neighbor search
    Hou, Guangdong
    Cui, Runpeng
    Pan, Zheng
    Zhang, Changshui
    [J]. NEUROCOMPUTING, 2015, 166 : 271 - 281