Learned FBF: Learning-Based Functional Bloom Filter for Key-Value Storage

被引:11
|
作者
Byun, Hayoung [1 ]
Lim, Hyesook [2 ]
机构
[1] Myongji Univ, Dept Elect Engn, Yongin 17058, South Korea
[2] Ewha Womans Univ, Dept Elect & Elect Engn, Seoul 03760, South Korea
基金
新加坡国家研究基金会;
关键词
Data structures; Data models; Programming; Memory management; Indexes; Task analysis; Neural networks; Key-value storage; functional Bloom filter; deep learning; search failure;
D O I
10.1109/TC.2021.3112079
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As a challenging attempt to replace a traditional data structure with a learned model, this paper proposes a learned functional Bloom filter (L-FBF) for a key-value storage. The learned model in the proposed L-FBF learns the characteristics and the distribution of given data and classifies each input. It is shown through theoretical analysis that the L-FBF provides a lower search failure rate than a single FBF in the same memory size, while providing the same semantic guarantees. For model training, character-level neural networks are used with pretrained embeddings. In experiments, four types of different character-level neural networks are trained: a single gated recurrent unit (GRU), two GRUs, a single long short-term memory (LSTM), and a single one-dimensional convolutional neural network (1D-CNN). Experimental results prove the validity of theoretical results, and show that the L-FBF reduces the search failures by 82.8% to 83.9% when compared with a single FBF under the same amount of memory used.
引用
收藏
页码:1928 / 1938
页数:11
相关论文
共 50 条
  • [1] kBF: Towards Approximate and Bloom Filter based Key-Value Storage for Cloud Computing Systems
    Xiong, Sisi
    Yao, Yanjun
    Li, Shuangjiang
    Cao, Qing
    He, Tian
    Qi, Hairong
    Tolbert, Leon
    Liu, Yilu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (01) : 85 - 98
  • [2] kBF: a Bloom Filter for Key-Value Storage with an Application on Approximate State Machines
    Xiong, Sisi
    Yao, Yanjun
    Cao, Qing
    He, Tian
    2014 PROCEEDINGS IEEE INFOCOM, 2014, : 1150 - 1158
  • [3] A matrix-indexed bloom filter for flash-based key-value store
    Li, Wei
    Zhang, Dafang
    Xie, Kun
    Li, Wenwei
    He, Jie
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (05): : 1210 - 1222
  • [4] Key-Value Storage Engines
    Idreos, Stratos
    Callaghan, Mark
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2667 - 2672
  • [5] Flame DB: A Key-Value Store With Grouped Level Structure and Heterogeneous Bloom Filter
    Zhang, Weitao
    Xu, Yinlong
    Li, Yongkun
    Zhang, Yueming
    Li, Dinglong
    IEEE ACCESS, 2018, 6 : 24962 - 24972
  • [6] An FPGA Implementation of High-Throughput Key-Value Store Using Bloom Filter
    Cho, Jae Min
    Choi, Kiyoung
    2014 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2014,
  • [7] Deep Learning-Based Bloom Filter for Efficient Multi-key Membership Testing
    Haitian Chen
    Ziwei Wang
    Yunchuan Li
    Ruixin Yang
    Yan Zhao
    Rui Zhou
    Kai Zheng
    Data Science and Engineering, 2023, 8 : 234 - 246
  • [8] Deep Learning-Based Bloom Filter for Efficient Multi-key Membership Testing
    Chen, Haitian
    Wang, Ziwei
    Li, Yunchuan
    Yang, Ruixin
    Zhao, Yan
    Zhou, Rui
    Zheng, Kai
    DATA SCIENCE AND ENGINEERING, 2023, 8 (03) : 234 - 246
  • [9] L-rCBF: Learning-Based Key-Value Data Structure for Dynamic Data Processing
    Lee, Yejee
    Byun, Hayoung
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [10] BloomStore: Bloom-Filter based Memory-efficient Key-Value Store for Indexing of Data Deduplication on Flash
    Lu, Guanlin
    Nam, Young Jin
    Du, David H. C.
    2012 IEEE 28TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2012,