Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming

被引:0
|
作者
Kacham, Praneeth [1 ]
Pagh, Rasmus [2 ]
Thorup, Mikkel [2 ]
Woodruff, David P. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Copenhagen, Copenhagen, Denmark
关键词
GENERATORS; COMPLEXITY;
D O I
10.1109/FOCS57990.2023.00093
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan's generator over larger alphabets. Our generator allows a trade-off between seed length and the time needed to compute a given block of the generator's output. HashPRG can be used to obtain derandomizations with much better update time and without sacrificing space for a large number of data stream algorithms, for example: Andoni's F-p estimation algorithm for constant p > 2 (ICASSP, 2017) assumes a random oracle, but achieves optimal space and constant update time. Using HashPRG's time-space trade-off we eliminate the random oracle assumption while preserving the other properties. Previously no time-optimal derandomization was known. Using similar techniques, we give an algorithm for a relaxed version of l(p) sampling in a turnstile stream. Both of our algorithms use (O) over tilde (d(1-2/p)) bits of space and have O(1) update time. For 0 < p < 2, the 1 +/- epsilon approximate F-p estimation algorithm of Kane et al., (STOC, 2011) uses an optimal O(epsilon(-2) log d) bits of space but has an update time of O(log(2)(1/epsilon) log log(1/epsilon)). Using HashPRG, we show that if 1/root d <= epsilon <= 1/d(c) for an arbitrarily small constant c > 0, then we can obtain a 1 +/- epsilon approximate F-p estimation algorithm that uses the optimal O(epsilon(-2) log d) bits of space and has an update time of O(log d) in the Word RAM model, which is more than a quadratic improvement in the update time. We obtain similar improvements for entropy estimation. CountSketch, with the fine-grained error analysis of Minton and Price (SODA, 2014). For derandomization, they suggested a direct application of Nisan's generator, yielding a logarithmic multiplicative space overhead. With HashPRG we obtain an efficient derandomization yielding the same asymptotic space as when assuming a random oracle. Our ability to obtain a time-efficient derandomization makes crucial use of HashPRG's symmetry. We also give the first derandomization of a recent private version of CountSketch. For a d-dimensional vector x being updated in a turnstile stream, we show that parallel to x parallel to(infinity) can be estimated up to an additive error of epsilon parallel to x parallel to(2) using O(epsilon(-2) log(1/epsilon) log d) bits of space. Additionally, the update time of this algorithm is O(log 1/epsilon) in the Word RAM model. We show that the space complexity of this algorithm is optimal up to constant factors. However, for vectors x with parallel to x parallel to(infinity) = Theta(parallel to x parallel to(2)), we show that the lower bound can be broken by giving an algorithm that uses O(epsilon(-2) log d) bits of space which approximates parallel to x parallel to(infinity) up to an additive error of epsilon parallel to x parallel to(2). We use our aforementioned derandomization of the CountSketch data structure to obtain this algorithm, and using the time-space trade off of HashPRG, we show that the update time of this algorithm is also O(log 1/epsilon) in the Word RAM model.
引用
收藏
页码:1515 / 1550
页数:36
相关论文
共 50 条
  • [31] Matchgate and space-bounded quantum computations are equivalent
    Jozsa, Richard
    Kraus, Barbara
    Miyake, Akimasa
    Watrous, John
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2010, 466 (2115): : 809 - 830
  • [32] LANGUAGES ACCEPTED BY SPACE-BOUNDED ORACLE MACHINES
    BOOK, RV
    [J]. ACTA INFORMATICA, 1979, 12 (02) : 177 - 185
  • [33] Non-Malleable Codes for Space-Bounded Tampering
    Faust, Sebastian
    Hostakova, Kristina
    Mukherjee, Pratyay
    Venturi, Daniele
    [J]. ADVANCES IN CRYPTOLOGY - CRYPTO 2017, PART II, 2017, 10402 : 95 - 126
  • [34] Trial and error: A new approach to space-bounded learning
    Ameur, Foued
    Fischer, Paul
    Höffgen, Klaus -U.
    auf der Heide, Friedhelm Meyer
    [J]. Acta Informatica, 1996, 33 (05) : 621 - 630
  • [35] SPACE-BOUNDED REDUCIBILITY AMONG COMBINATORIAL PROBLEMS
    JONES, ND
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1975, 11 (01) : 68 - 85
  • [36] Deterministic amplification of space-bounded probabilistic algorithms
    Bar-Yossef, Z
    Goldreich, O
    Wigderson, A
    [J]. FOURTEENTH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 1999, : 188 - 198
  • [37] On the complexity of simulating space-bounded quantum computations
    John Watrous
    [J]. computational complexity, 2003, 12 : 48 - 84
  • [38] ON SEVERAL KINDS OF SPACE-BOUNDED ONLINE MULTICOUNTER AUTOMATA
    VOELKEL, L
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1985, 199 : 465 - 473
  • [39] Time-bounded and space-bounded sensing in wireless sensor networks
    Saukh, Olga
    Sauter, Robert
    Marron, Pedro Jose
    [J]. DISTRIBUTED COMPUTING IN SENSOR SYSTEMS, 2008, 5067 : 357 - 371
  • [40] SPACE-BOUNDED SIMULATION OF MULTITAPE TURING-MACHINES
    ADLEMAN, LM
    LOUI, MC
    [J]. MATHEMATICAL SYSTEMS THEORY, 1981, 14 (03): : 215 - 222