ON APPROXIMATING MATRIX NORMS IN DATA STREAMS

被引:5
|
作者
Li, Yi [1 ]
Nguyen, Huy L. [2 ]
Woodruff, David P. [3 ]
机构
[1] Nanyang Technol Univ, Div Math, Singapore 637371, Singapore
[2] Northeastern Univ, Khoury Coll Comp Sci, Boston, MA 02115 USA
[3] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Schatten norm; matrix norm; streaming algorithm; approximation algorithm; sketching algorithm; numerical linear algebra; COMMUNICATION COMPLEXITY; LAPLACIAN EIGENVALUES; SPACE COMPLEXITY; ALGORITHMS; POWERS; SUM;
D O I
10.1137/17M1152255
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a systematic study of the space complexity of estimating the Schatten p-norms of an n x n matrix in the turnstile streaming model. Both kinds of space complexities, bit complexity and sketching dimension, are considered. Furthermore, two sketching models, general linear sketching and bilinear sketching, are considered. When p is not an even integer, we show that any one-pass algorithm with constant success probability requires near-linear space in terms of bits. This lower bound holds even for sparse matrices, i.e., matrices with O(1) nonzero entries per row and per column. However, when p is an even integer, we give for sparse matrices an upper bound which, up to logarithmic factors, is the same as estimating the pth moment of an n-dimensional vector. These results considerably strengthen lower bounds in previous work for arbitrary (not necessarily sparse) matrices. Similar near-linear lower bounds are obtained for Ky Fan norms, SVD entropy, eigenvalue shrinkers, and M-estimators, many of which could have been solvable in logarithmic space prior to this work. The results for general linear sketches give separations in the sketching complexity of Schatten p-norms with the corresponding vector p-norms, and rule out a table-lookup nearest-neighbor search for p = 1, making progress on a question of Andoni. The results for bilinear sketches are tight for the rank problem and nearly tight for p >= 2; the latter is the first general subquadratic upper bound for sketching the Schatten norms.
引用
收藏
页码:1643 / 1697
页数:55
相关论文
共 50 条
  • [1] Approximating Matrix p-norms
    Bhaskara, Aditya
    Vijayaraghavan, Aravindan
    [J]. PROCEEDINGS OF THE TWENTY-SECOND ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2011, : 497 - 511
  • [2] Matrix Norms in Data Streams: Faster, Multi-Pass and Row-Order
    Braverman, Vladimir
    Chestnut, Stephen
    Krauthgamer, Robert
    Li, Yi
    Woodruff, David
    Yang, Lin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] Inner matrix norms in evolving Cauchy possibilistic clustering for classification and regression from data streams
    Skrjanc, Igor
    Blazic, Saso
    Lughofer, Edwin
    Dovzan, Dejan
    [J]. INFORMATION SCIENCES, 2019, 478 : 540 - 563
  • [4] Estimating dominance norms of multiple data streams
    Cormode, G
    Muthukrishnan, S
    [J]. ALGORITHMS - ESA 2003, PROCEEDINGS, 2003, 2832 : 148 - 160
  • [5] Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension
    Braverman, Vladimir
    Krauthgamer, Robert
    Krishnan, Aditya
    Sinoff, Roi
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [6] Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension
    Braverman, Vladimir
    Krauthgamer, Robert
    Krishnan, Aditya
    Sinoff, Roi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [7] Smoothness of Schatten norms and sliding-window matrix streams
    Krauthgamer, Robert
    Sapir, Shay
    [J]. INFORMATION PROCESSING LETTERS, 2022, 177
  • [8] Erasable Virtual HyperLogLog for Approximating Cumulative Distribution over Data Streams
    Jia, Peng
    Wang, Pinghui
    Zhao, Junzhou
    Tao, Jing
    Yuan, Ye
    Guan, Xiaohong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5336 - 5350
  • [9] Comparing data streams using Hamming norms (How to zero in)
    Cormode, G
    Datar, M
    Indyk, P
    Muthukrishnan, S
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 529 - 540
  • [10] Biclustering and Boolean Matrix Factorization in Data Streams
    Neumann, Stefan
    Miettinen, Pauli
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (10): : 1709 - 1722