Bloom Filters, Adaptivity, and the Dictionary Problem

被引:29
|
作者
Bender, Michael A. [1 ]
Farach-Colton, Martin [2 ]
Goswami, Mayank [3 ]
Johnson, Rob [4 ]
McCauley, Samuel [5 ]
Singh, Shikha [5 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Rutgers State Univ, Piscataway, NJ 08856 USA
[3] CUNY Queens Coll, Flushing, NY 11367 USA
[4] VMware Res, Creekside F,3425 Hillview Ave, Palo Alto, CA 94304 USA
[5] Wellesley Coll, Wellesley, MA 02481 USA
来源
2018 IEEE 59TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2018年
基金
欧洲研究理事会; 美国国家科学基金会;
关键词
Bloom filters; approximate membership query data structures; adaptive data structures; dictionary data structures;
D O I
10.1109/FOCS.2018.00026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An approximate membership query data structure (AMQ)-such as a Bloom, quotient, or cuckoo filter-maintains a compact, probabilistic representation of a set S of keys from a universe U. It supports lookups and inserts. Some AMQs also support deletes. A query for x. S returns PRESENT. A query for x is not an element of S returns PRESENT with a tunable false-positive probability epsilon, and otherwise returns ABSENT. AMQs are widely used to speed up dictionaries that are stored remotely (e.g., on disk or across a network). The AMQ is stored locally (e.g., in memory). The remote dictionary is only accessed when the AMQ returns PRESENT. Thus, the primary performance metric of an AMQ is how often it returns ABSENT for negative queries. Existing AMQs offer weak guarantees on the number of false positives in a sequence of queries. The false-positive probability e holds only for a single query. It is easy for an adversary to drive an AMQ's false-positive rate towards 1 by simply repeating false positives. This paper shows what it takes to get strong guarantees on the number of false positives. We say that an AMQ is adaptive if it guarantees a false-positive probability of e for every query, regardless of answers to previous queries. We establish upper and lower bounds for adaptive AMQs. Our lower bound shows that it is impossible to build a small adaptive AMQ, even when the AMQ is immediately told whenever a query is a false positive. On the other hand, we show that it is possible to maintain an AMQ that uses the same amount of local space as a non-adaptive AMQ (up to lower order terms), performs all queries and updates in constant time, and guarantees that each negative query to the dictionary accesses remote storage with probability e, independent of the results of past queries. Thus, we show that adaptivity can be achieved effectively for free.
引用
收藏
页码:182 / 193
页数:12
相关论文
共 50 条
  • [41] The performance analysis of Bloom and Random filters
    Chang, CC
    Leu, JJ
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1996, 19 (04) : 451 - 458
  • [42] Xor Filters: Faster and Smaller Than Bloom and Cuckoo Filters
    Graf T.M.
    Lemire D.
    ACM Journal of Experimental Algorithmics, 2020, 25
  • [43] Synchronizing Namespaces with Invertible Bloom Filters
    Fu, Wenliang
    Ben Abraham, Hila
    Crowley, Patrick
    ELEVENTH 2015 ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS, 2015, : 123 - 134
  • [44] Cryptographically Secure Bloom-Filters
    Nojima, Ryo
    Kadobayashi, Youki
    TRANSACTIONS ON DATA PRIVACY, 2009, 2 (02) : 131 - 139
  • [45] A Comment on "Fast Bloom Filters and Their Generalization"
    Reviriego, P.
    Christensen, K.
    Maestro, J. A.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (01) : 303 - 304
  • [46] Distance-Sensitive Bloom Filters
    Kirsch, Adam
    Mitzenmacher, Michael
    PROCEEDINGS OF THE EIGHTH WORKSHOP ON ALGORITHM ENGINEERING AND EXPERIMENTS AND THE THIRD WORKSHOP ON ANALYTIC ALGORITHMICS AND COMBINATORICS, 2006, : 41 - 50
  • [47] A cache architecture for counting Bloom filters
    Ahmadi, Mahmood
    Wong, Stephan
    2007 15TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, 2007, : 276 - 281
  • [48] Private Membership Test for Bloom Filters
    Meskanen, Tommi
    Liu, Jian
    Ramezanian, Sara
    Niemi, Valtteri
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, : 515 - 522
  • [49] Basket bloom filters for membership queries
    Xie, Kun
    Min, Yinghua
    Zhang, Dafang
    Xie, Gaogang
    Wen, Jigang
    TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 562 - 567
  • [50] The Power of Evil Choices in Bloom Filters
    Gerbet, Thomas
    Kumar, Amrit
    Lauradoux, Cedric
    2015 45TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, 2015, : 101 - 112