Beating the Direct Sum Theorem in Communication Complexity with Implications for Sketching

被引:0
|
作者
Molinaro, Marco [1 ]
Woodruff, David P. [2 ]
Yaroslavtsev, Grigory [3 ,4 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] IBM Almaden, San Jose, CA USA
[3] Penn State Univ, University Pk, PA 16802 USA
[4] IBM Almaden Res Ctr, San Jose, CA USA
关键词
JOHNSON-LINDENSTRAUSS; DATA STREAM; MATRICES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A direct sum theorem for two parties and a function f states that the communication cost of solving k copies of f simultaneously with error probability 1/3 is at least k . R-1/3 (f), where R-1/3 (f) is the communication required to solve a single copy of f with error probability 1/3. We improve this for a natural family of functions f, showing that the 1-way communication required to solve k copies of f simultaneously with probability 2/3 is Omega(k.R-1/k (f)). Since R-1/k (f) may be as large as Omega(R-1/3 (f) . log k), we asymptotically beat the direct sum bound for such functions, showing that the trivial upper bound of solving each of the k copies of f with probability 1 - O (1/k) and taking a union bound is optimal! In order to achieve this, our direct sum involves a novel measure of information cost which allows a protocol to abort with constant probability, and otherwise must be correct with very high probability. Moreover, for the functions considered, we show strong lower bounds on the communication cost of protocols with these relaxed guarantees; indeed, our lower bounds match those for protocols that are not allowed to abort. In the distributed and streaming models, where one wants to be correct not only on a single query, but simultaneously on a sequence of n queries, we obtain optimal lower bounds on the communication or space complexity. Lower bounds obtained from our direct sum result show that a number of techniques in the sketching literature are optimal, including the following: (JL transform) Lower bound of Omega(1/epsilon(2) log n/delta) on the dimension of (oblivious) Johnson- Lindenstrauss transforms. (l(p)-estimation) Lower bound for the size of encodings of n vectors in [+/- M](d) that allow l(1) or l(2)-estimation of Omega(n epsilon(-2) log n/delta (log d + log M)). (Matrix sketching) Lower bound of Omega (1/c(2) log n/delta) on the dimension of a matrix sketch S satisfying the entrywise guarantee vertical bar(ASS(T) B)(i;j) (AB)(i;j) - (AB)(i,)j vertical bar <= epsilon parallel to A(i)parallel to(2)parallel to B-j parallel to 2. (Database joins) Lower bound of Omega(n 1/c(2) log n/delta log M) for sketching frequency vectors of n tables in a database, each with M records, in order to allow join size estimation.
引用
收藏
页码:1738 / 1756
页数:19
相关论文
共 50 条
  • [21] Immersive 3D sketching tools: Implications for visual thinking and communication
    Oti, Alfred
    Crilly, Nathan
    Oti, Alfred (a.oti.14@ucl.ac.uk), 1600, Elsevier Ltd (94): : 111 - 123
  • [22] A note on multiparty communication complexity and the Hales-Jewett theorem
    Shraibman, Adi
    INFORMATION PROCESSING LETTERS, 2018, 139 : 44 - 48
  • [23] COMMUNICATION COMPLEXITY OF SUM-TYPE FUNCTIONS INVARIANT UNDER TRANSLATION
    TAMM, U
    INFORMATION AND COMPUTATION, 1995, 116 (02) : 162 - 173
  • [24] Optimal direct sum results for deterministic and randomized decision tree complexity
    Jain, Rahul
    Klauck, Hartmut
    Santha, Miklos
    INFORMATION PROCESSING LETTERS, 2010, 110 (20) : 893 - 897
  • [25] Direct Sum Fails for Zero-Error Average Communication
    Kol, Gillat
    Moran, Shay
    Shpilka, Amir
    Yehudayoff, Amir
    ALGORITHMICA, 2016, 76 (03) : 782 - 795
  • [26] Direct Sum Fails for Zero-Error Average Communication
    Gillat Kol
    Shay Moran
    Amir Shpilka
    Amir Yehudayoff
    Algorithmica, 2016, 76 : 782 - 795
  • [27] New Strong Direct Product Results in Communication Complexity
    Jain, Rahul
    JOURNAL OF THE ACM, 2015, 62 (03)
  • [28] Revisiting the Direct Sum Theorem and Space Lower Bounds in Random Order Streams
    Guha, Sudipto
    Huang, Zhini
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT I, 2009, 5555 : 513 - 524
  • [29] DECOMPOSITION IN DIRECT SUM OF SEMINORMED VECTOR SPACES AND MAZUR-ULAM THEOREM
    Dovgoshey, Oleksiy
    Prestin, Juergen
    Shevchuk, Igor
    MATHEMATICA SLOVACA, 2024, 74 (01) : 143 - 150
  • [30] Bilateral communication complexity of sum-type functions with one processor to be informed
    Alsvede, R.
    Nin', Chaj
    Problemy Peredachi Informatsii, 1994, 30 (01): : 3 - 12