SES-LSH: Shuffle-Efficient Locality Sensitive Hashing for Distributed Similarity Search

被引:12
|
作者
Li, Dongsheng [1 ]
Zhang, Wanxin [1 ]
Shen, Siqi [1 ]
Zhang, Yiming [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Natl Lab Parallel & Distributed Proc, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
Locality Sensitive Hashing; shuffle; location-aware querying; Similarity Search;
D O I
10.1109/ICWS.2017.99
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality Sensitive Hashing ( LSH) is a widely used similarity search technique for many web services, such as content-based retrieval services for images and videos. Due to its popularity, much research effort has been devoted to improving the search quality, and the indexing and query performance of LSH. However, most existing variants of LSH can only run on single node, which limits their applicability to large-scale data. In this paper, we present a Shuffle-Efficient Similarity Search scheme based on LSH, which can be efficiently executed in distributed environments, to serve a massive amount of data. In SES-LSH, a shuffle efficient indexing scheme is proposed to reduce the data shuffle when constructing hash tables, and a location-aware querying scheme is proposed to improve the query performance. We have implemented a prototype of SES-LSH based on Spark, and several optimizations have been utilized to improve the fine-grained hash table operations of distributed LSH. Extensive experiments using large-scale real-world datasets show that SES-LSH is remarkably more efficient than existing methods.
引用
收藏
页码:822 / 827
页数:6
相关论文
共 50 条
  • [1] Shuffle-Efficient Distributed Locality Sensitive Hashing on Spark
    Zhang, Wanxin
    Li, Dongsheng
    Xu, Ying
    Zhang, Yiming
    [J]. 2016 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2016,
  • [2] Fast image similarity search by distributed locality sensitive hashing
    Durmaz, Osman
    Bilge, Hasan Sakir
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 361 - 369
  • [3] Bayesian Locality Sensitive Hashing for Fast Similarity Search
    Satuluri, Venu
    Parthasarathy, Srinivasan
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05): : 430 - 441
  • [4] Fast Graph Similarity Search via Locality Sensitive Hashing
    Zhang, Boyu
    Liu, Xianglong
    Lang, Bo
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT I, 2015, 9314 : 623 - 633
  • [5] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    [J]. SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [6] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    [J]. Soft Computing, 2024, 28 : 1041 - 1053
  • [7] BCH-LSH: a new scheme of locality-sensitive hashing
    Ma, Yuena
    Feng, Xiaoyi
    Liu, Yang
    Li, Shuhong
    [J]. IET IMAGE PROCESSING, 2018, 12 (06) : 850 - 855
  • [8] Locality-Sensitive Hashing for Efficient Rendezvous Search: A New Approach
    Jiang, Guann-Yng
    Chang, Cheng-Shang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (09) : 5674 - 5687
  • [9] LSH-SMILE: Locality Sensitive Hashing Accelerated Simulation and Learning
    Sima, Chonghao
    Xue, Yexiang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Fast distributed video deduplication via locality-sensitive hashing with similarity ranking
    Li, Yeguang
    Hu, Liang
    Xia, Ke
    Luo, Jie
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (1)