A lower bound for dynamic approximate membership data structures

被引:16
|
作者
Lovett, Shachar [1 ]
Porat, Ely [2 ]
机构
[1] Weizmann Inst Sci, Dept Comp Sci, IL-76100 Rehovot, Israel
[2] Bar Ilan Univ, Ramat Gan, Israel
基金
以色列科学基金会; 欧洲研究理事会;
关键词
Dynamic data structures; Bloom filters; Lower bounds;
D O I
10.1109/FOCS.2010.81
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An approximate membership data structure is a randomized data structure for representing a set which supports membership queries. It allows for a small false positive error rate but has no false negative errors. Such data structures were first introduced by Bloom in the 1970's, and have since had numerous applications, mainly in distributed systems, database systems, and networks. The algorithm of Bloom is quite effective: it can store a set S of size n by using only approximate to 1:44n log(2)(1/epsilon) bits while having false positive error epsilon. This is within a constant factor of the entropy lower bound of n log(2)(1/epsilon) for storing such sets. Closing this gap is an important open problem, as Bloom filters are widely used is situations were storage is at a premium. Bloom filters have another property: they are dynamic. That is, they support the iterative insertions of up to n elements. In fact, if one removes this requirement, there exist static data structures which receive the entire set at once and can almost achieve the entropy lower bound; they require only n log(2)(1/epsilon) (1 + o(1)) bits. Our main result is a new lower bound for the memory requirements of any dynamic approximate membership data structure. We show that for any constant epsilon > 0, any such data structure which achieves false positive error rate of epsilon must use at least C(epsilon).n log(2)(1/epsilon) memory bits, where C(epsilon) > 1 depends only on epsilon. This shows that the entropy lower bound cannot be achieved by dynamic data structures for any constant error rate. In fact, our lower bound holds even in the setting where the insertion and query algorithms may use shared randomness, and where they are only required to perform well on average.
引用
收藏
页码:797 / 804
页数:8
相关论文
共 50 条
  • [1] A SPACE LOWER BOUND FOR DYNAMIC APPROXIMATE MEMBERSHIP DATA STRUCTURES
    Lovett, Shachar
    Porat, Ely
    SIAM JOURNAL ON COMPUTING, 2013, 42 (06) : 2182 - 2196
  • [2] Succinct data structures for retrieval and approximate membership* (Extended Abstract)
    Dietzfelbinger, Martin
    Pagh, Rasmus
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT 1, PROCEEDINGS, 2008, 5125 : 385 - +
  • [3] Approximate Lower Bound Arguments
    Chaidos, Pyrros
    Kiayias, Aggelos
    Reyzin, Leonid
    Zinovyev, Anatoliy
    ADVANCES IN CRYPTOLOGY, PT IV, EUROCRYPT 2024, 2024, 14654 : 55 - 84
  • [4] The Limits of Buffering: A Tight Lower Bound for Dynamic Membership in the External Memory Model
    Verbin, Elad
    Zhang, Qin
    STOC 2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2010, : 447 - 456
  • [5] THE LIMITS OF BUFFERING: A TIGHT LOWER BOUND FOR DYNAMIC MEMBERSHIP IN THE EXTERNAL MEMORY MODEL
    Verbin, Elad
    Zhang, Qin
    SIAM JOURNAL ON COMPUTING, 2013, 42 (01) : 212 - 229
  • [6] fimpera: drastic improvement of Approximate Membership Query data-structures with counts
    Robidou, Lucas
    Peterlongo, Pierre
    BIOINFORMATICS, 2023, 39 (05)
  • [7] A lower bound for dynamic scheduling of data parallel programs
    da Silva, FAB
    Campos, LM
    Scherson, ID
    EURO-PAR '98 PARALLEL PROCESSING, 1998, 1470 : 367 - 372
  • [8] Lower Bound Framework for Differentially Private and Oblivious Data Structures
    Persiano, Giuseppe
    Yeo, Kevin
    ADVANCES IN CRYPTOLOGY - EUROCRYPT 2023, PT I, 2023, 14004 : 487 - 517
  • [9] Dynamic data structures for approximate Hausdorff distance in the word RAM
    Chan, Timothy M.
    Skrepetos, Dimitrios
    COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2017, 60 : 37 - 44
  • [10] Certifying Certainty and Uncertainty in Approximate Membership Query Structures
    Gopinathan, Kiran
    Sergey, Ilya
    COMPUTER AIDED VERIFICATION, PT II, 2020, 12225 : 279 - 303