Tight Query Complexity Lower Bounds for PCA via Finite Sample Deformed Wigner Law

被引:20
|
作者
Simchowitz, Max [1 ]
El Alaoui, Ahmed [1 ]
Recht, Benjamin [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
Lower Bounds; Query Complexity; PCA; Optimization; Random Matrix Theory;
D O I
10.1145/3188745.3188796
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We prove a query complexity lower bound for approximating the top r dimensional eigenspace of a matrix. We consider an oracle model where, given a symmetric matrix M is an element of R-dxd, an algorithm Alg is allowed to make T exact queries of the form w((i)) = Mv((i)) for i in {1, ... ,T}, where v((i)) is drawn from a distribution which depends arbitrarily on the past queries and measurements {v((j)), w((i))}1 <= j <= i-1. We show that for every gap is an element of (0, 1/2], there exists a distribution over matrices M for which 1) gap(r) (M) = Omega(gap) (where gap(r) (M) is the normalized gap between the r and r + 1-st largest-magnitude eigenvector of M), and 2) any algorithm Alg which takes fewer than const x r log d/root gap queries fails (with overwhelming probability) to identity a matrix (V) over cap is an element of R-dxr with orthonormal columns for which ((V) over cap, M (V) over cap) >= (1 - const x gap) Sigma(r)(i-1) lambda(i) (M). Our bound requires only that d is a small polynomial in 1/gap and r, and matches the upper bounds of Musco and Musco '15. Moreover, it establishes a strict separation between convex optimization and randomized, "strict-saddle" non-convex optimization of which PCA is a canonical example: in the former, first-order methods can have dimension-free iteration complexity, whereas in PCA, the iteration complexity of gradient-based methods must necessarily grow with the dimension. Our argument proceeds via a reduction to estimating a rank-r spike in a deformed Wigner model M = W + lambda UU inverted perpendicular, where W is from the Gaussian Orthogonal Ensemble, U is uniform on the d x r-Stieffel manifold and lambda > 1 governs the size of the perturbation. Surprisingly, this ubiquitous random matrix model witnesses the worst-case rate for eigenspace approximation, and the 'accelerated' inverse square-root dependence on the gap in the rate follows as a consequence of the correspendence between the asymptotic eigengap and the size of the perturbation lambda, when lambda is near the "phase transition" lambda = 1. To verify that d need only be polynomial in gap(-1) and r, we prove a finite sample convergence theorem for top eigenvalues of a deformed Wigner matrix, which may be of independent interest. We then lower bound the above estimation problem with a novel technique based on Fano-style data-processing inequalities with truncated likelihoods; the technique generalizes the Bayes-risk lower bound of Chen et al. '16, and we believe it is particularly suited to lower bounds in adaptive settings like the one considered in this paper.
引用
收藏
页码:1249 / 1259
页数:11
相关论文
共 50 条
  • [1] Statistical Query Lower Bounds for Tensor PCA
    Dudeja, Rishabh
    Hsu, Daniel
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [2] Statistical query lower bounds for tensor PCA
    Dudeja, Rishabh
    Hsu, Daniel
    [J]. Journal of Machine Learning Research, 2021, 22
  • [3] Tight lower bounds for 2-query LCCs over finite fields
    Bhattacharyya, Arnab
    Dvir, Zeev
    Saraf, Shubhangi
    Shpilka, Amir
    [J]. 2011 IEEE 52ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2011), 2011, : 638 - 647
  • [4] LOWER BOUNDS ON QUANTUM QUERY COMPLEXITY
    Toran, Jacobo
    Hoyer, Peter
    Spalek, Robert
    [J]. BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2005, (87): : 78 - 103
  • [5] Tight Lower Bounds for the Complexity of Multicoloring
    Bonamy, Marthe
    Kowalik, Lukasz
    Pilipczuk, Michal
    Socala, Arkadiusz
    Wrochna, Marcin
    [J]. ACM TRANSACTIONS ON COMPUTATION THEORY, 2019, 11 (03)
  • [6] Tight lower bounds for linear 2-query LCCs over finite fields
    Bhattacharyya, Arnab
    Dvir, Zeev
    Saraf, Shubhangi
    Shpilka, Amir
    [J]. COMBINATORICA, 2016, 36 (01) : 1 - 36
  • [7] Tight lower bounds for linear 2-query LCCs over finite fields
    Arnab Bhattacharyya
    Zeev Dvir
    Shubhangi Saraf
    Amir Shpilka
    [J]. Combinatorica, 2016, 36 : 1 - 36
  • [8] Query complexity lower bounds for reconstruction of codes
    Chakraborty, Sourav
    Fischer, Eldar
    Matsliah, Arie
    [J]. Theory of Computing, 2014, 10 : 513 - 533
  • [9] Nearly tight sample complexity bounds for learning mixtures of Gaussians via sample compression schemes
    Ashtiani, Hassan
    Ben-David, Shai
    Harvey, Nicholas J. A.
    Liaw, Christopher
    Mehrabian, Abbas
    Plan, Yaniv
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Lower bounds for query complexity of some graph problems
    Lace, L
    Freivalds, R
    [J]. VLSI'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VLSI, 2003, : 309 - 313