Accelerating Duplicate Data Chunk Recognition Using NN Trained by Locality-Sensitive Hash

被引:0
|
作者
Berman, Amit [1 ]
Birk, Yitzhak [1 ]
Mendelson, Avi [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
Deduplication; Chunking; Cloud Storage; Neural Network; Machine Learning; Locality-Sensitive Hashing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deduplication is often used in storage systems in order to save storage space, communication bandwidth, write energy, and recovery and error-protection infrastructure. However, deduplication overhead increases latency and computation energy. Determining whether a data chunk is already stored by comparing signatures constitutes a significant fraction of this deduplication overhead. In this paper, we propose a statistical chunk classifier based on a neural network. Our technique is based on learning the patterns of locality-sensitive hashing of the data. Our experiments show an acceleration of chunk processing, leading to reduction in deduplication overhead.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing
    Hisashi Koga
    Tetsuo Ishibashi
    Toshinori Watanabe
    Knowledge and Information Systems, 2007, 12 : 25 - 53
  • [32] Efficient locality-sensitive hashing over high-dimensional streaming data
    Wang, Hao
    Yang, Chengcheng
    Zhang, Xiangliang
    Gao, Xin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (05): : 3753 - 3766
  • [33] Large-Scale Distributed Locality-Sensitive Hashing for General Metric Data
    Silva, Eliezer
    Teixeira, Thiago
    Teodoro, George
    Valle, Eduardo
    SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 82 - 93
  • [34] Efficient Locality-Sensitive Hashing Over High-Dimensional Data Streams
    Yang, Chengcheng
    Deng, Dong
    Shang, Shuo
    Shao, Ling
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1994 - 1997
  • [35] Matching CCD images to a stellar catalog using locality-sensitive hashing
    Bo Liu
    Jia-Zong Yu
    Qing-Yu Peng
    Research in Astronomy and Astrophysics, 2018, 18 (02) : 107 - 116
  • [36] Sieving for Shortest Vectors in Lattices Using Angular Locality-Sensitive Hashing
    Laarhoven, Thijs
    ADVANCES IN CRYPTOLOGY, PT I, 2015, 9215 : 3 - 22
  • [37] Matching CCD images to a stellar catalog using locality-sensitive hashing
    Liu, Bo
    Yu, Jia-Zong
    Peng, Qing-Yu
    RESEARCH IN ASTRONOMY AND ASTROPHYSICS, 2018, 18 (02)
  • [38] Efficient locality-sensitive hashing over high-dimensional streaming data
    Hao Wang
    Chengcheng Yang
    Xiangliang Zhang
    Xin Gao
    Neural Computing and Applications, 2023, 35 : 3753 - 3766
  • [39] Parallel set similarity join on big data based on Locality-Sensitive Hashing
    Sohrabi, Mohammad Karim
    Azgomi, Hosseion
    SCIENCE OF COMPUTER PROGRAMMING, 2017, 145 : 1 - 12
  • [40] In-air Handwritten Chinese character recognition using Discriminative Projection based on Locality-sensitive Sparse Representation
    Qu, Xiwen
    Wang, Weiqiang
    Lu, Ke
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1137 - 1140