Accelerating Duplicate Data Chunk Recognition Using NN Trained by Locality-Sensitive Hash

被引:0
|
作者
Berman, Amit [1 ]
Birk, Yitzhak [1 ]
Mendelson, Avi [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
Deduplication; Chunking; Cloud Storage; Neural Network; Machine Learning; Locality-Sensitive Hashing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deduplication is often used in storage systems in order to save storage space, communication bandwidth, write energy, and recovery and error-protection infrastructure. However, deduplication overhead increases latency and computation energy. Determining whether a data chunk is already stored by comparing signatures constitutes a significant fraction of this deduplication overhead. In this paper, we propose a statistical chunk classifier based on a neural network. Our technique is based on learning the patterns of locality-sensitive hashing of the data. Our experiments show an acceleration of chunk processing, leading to reduction in deduplication overhead.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Efficient Data Stream Clustering with Sliding Windows based on Locality-Sensitive Hashing
    Youn, Jonghem
    Shim, Junho
    Lee, Sang-Goo
    IEEE ACCESS, 2018, 6 : 63757 - 63776
  • [42] Efficient Outlier Detection in Hyperedge Streams Using MinHash and Locality-Sensitive Hashing
    Ranshous, Stephen
    Chaudhary, Mandar
    Samatova, Nagiza F.
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 105 - 116
  • [43] Reducing the Complexity of Fingerprinting-Based Positioning using Locality-Sensitive Hashing
    Tang, Larry
    Ghods, Ramina
    Studer, Christoph
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1086 - 1090
  • [44] Hardware acceleration of k-mer clustering using locality-sensitive hashing
    Soto, Javier E.
    Krohmer, Thomas
    Hernandez, Cecilia
    Figueroa, Miguel
    2019 22ND EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2019, : 659 - 662
  • [45] 2L-LSH: A Locality-Sensitive Hash Function-Based Method For Rapid Point Cloud Indexing
    Wang, Shurui
    Zhang, Yuhe
    Guo, Ruizhe
    Zhang, Yaning
    Xie, Yifei
    Zhou, Xinyu
    COMPUTER JOURNAL, 2024, 67 (09): : 2809 - 2818
  • [46] P-QALSH: Parallelizing Query Aware Locality-Sensitive Hashing for Big Data
    Huang, Yikai
    Yao, Zhili
    Feng, Jianlin
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 629 - 635
  • [47] Improving Kernel Locality-Sensitive Hashing Using Pre-Images and Bounds
    Bodo, Zalan
    Csato, Lehel
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [48] Fast alignment filtering of nanopore sequencing reads using locality-sensitive hashing
    Wang, Jeremy R.
    Jones, Corbin D.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 127 - 130
  • [49] Faster Sieving for Shortest Lattice Vectors Using Spherical Locality-Sensitive Hashing
    Laarhoven, Thijs
    de Weger, Benne
    PROGRESS IN CRYPTOLOGY - LATINCRYPT 2015, 2015, 9230 : 101 - 118
  • [50] PA-Star: A disk-assisted parallel A-Star strategy with locality-sensitive hash for multiple sequence alignment
    Sundfeld, Daniel
    Razzolini, Caina
    Teodoro, George
    Boukerche, Azzedine
    Magalhaes Alves de Melo, Alba Cristina
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 112 : 154 - 165