Tight Query Complexity Lower Bounds for PCA via Finite Sample Deformed Wigner Law

被引：20

作者：

Simchowitz, Max ^{[1
]}

El Alaoui, Ahmed ^{[1
]}

Recht, Benjamin ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING | 2018年

关键词：

Lower Bounds; Query Complexity; PCA; Optimization; Random Matrix Theory;

D O I：

10.1145/3188745.3188796

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We prove a query complexity lower bound for approximating the top r dimensional eigenspace of a matrix. We consider an oracle model where, given a symmetric matrix M is an element of R-dxd, an algorithm Alg is allowed to make T exact queries of the form w((i)) = Mv((i)) for i in {1, ... ,T}, where v((i)) is drawn from a distribution which depends arbitrarily on the past queries and measurements {v((j)), w((i))}1 <= j <= i-1. We show that for every gap is an element of (0, 1/2], there exists a distribution over matrices M for which 1) gap(r) (M) = Omega(gap) (where gap(r) (M) is the normalized gap between the r and r + 1-st largest-magnitude eigenvector of M), and 2) any algorithm Alg which takes fewer than const x r log d/root gap queries fails (with overwhelming probability) to identity a matrix (V) over cap is an element of R-dxr with orthonormal columns for which ((V) over cap, M (V) over cap) >= (1 - const x gap) Sigma(r)(i-1) lambda(i) (M). Our bound requires only that d is a small polynomial in 1/gap and r, and matches the upper bounds of Musco and Musco '15. Moreover, it establishes a strict separation between convex optimization and randomized, "strict-saddle" non-convex optimization of which PCA is a canonical example: in the former, first-order methods can have dimension-free iteration complexity, whereas in PCA, the iteration complexity of gradient-based methods must necessarily grow with the dimension. Our argument proceeds via a reduction to estimating a rank-r spike in a deformed Wigner model M = W + lambda UU inverted perpendicular, where W is from the Gaussian Orthogonal Ensemble, U is uniform on the d x r-Stieffel manifold and lambda > 1 governs the size of the perturbation. Surprisingly, this ubiquitous random matrix model witnesses the worst-case rate for eigenspace approximation, and the 'accelerated' inverse square-root dependence on the gap in the rate follows as a consequence of the correspendence between the asymptotic eigengap and the size of the perturbation lambda, when lambda is near the "phase transition" lambda = 1. To verify that d need only be polynomial in gap(-1) and r, we prove a finite sample convergence theorem for top eigenvalues of a deformed Wigner matrix, which may be of independent interest. We then lower bound the above estimation problem with a novel technique based on Fano-style data-processing inequalities with truncated likelihoods; the technique generalizes the Bayes-risk lower bound of Chen et al. '16, and we believe it is particularly suited to lower bounds in adaptive settings like the one considered in this paper.

引用

页码：1249 / 1259

页数：11

共 50 条

[1] Statistical Query Lower Bounds for Tensor PCA
Dudeja, Rishabh
Hsu, Daniel
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[2] Statistical query lower bounds for tensor PCA
Dudeja, Rishabh
Hsu, Daniel
[J]. Journal of Machine Learning Research, 2021, 22
[3] Tight lower bounds for 2-query LCCs over finite fields
Bhattacharyya, Arnab
Dvir, Zeev
Saraf, Shubhangi
Shpilka, Amir
[J]. 2011 IEEE 52ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2011), 2011, : 638 - 647
[4] LOWER BOUNDS ON QUANTUM QUERY COMPLEXITY
Toran, Jacobo
Hoyer, Peter
Spalek, Robert
[J]. BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2005, (87): : 78 - 103
[5] Tight Lower Bounds for the Complexity of Multicoloring
Bonamy, Marthe
Kowalik, Lukasz
Pilipczuk, Michal
Socala, Arkadiusz
Wrochna, Marcin
[J]. ACM TRANSACTIONS ON COMPUTATION THEORY, 2019, 11 (03)
[6] Tight lower bounds for linear 2-query LCCs over finite fields
Bhattacharyya, Arnab
Dvir, Zeev
Saraf, Shubhangi
Shpilka, Amir
[J]. COMBINATORICA, 2016, 36 (01) : 1 - 36
[7] Tight lower bounds for linear 2-query LCCs over finite fields
Arnab Bhattacharyya
Zeev Dvir
Shubhangi Saraf
Amir Shpilka
[J]. Combinatorica, 2016, 36 : 1 - 36
[8] Query complexity lower bounds for reconstruction of codes
Chakraborty, Sourav
Fischer, Eldar
Matsliah, Arie
[J]. Theory of Computing, 2014, 10 : 513 - 533
[9] Nearly tight sample complexity bounds for learning mixtures of Gaussians via sample compression schemes
Ashtiani, Hassan
Ben-David, Shai
Harvey, Nicholas J. A.
Liaw, Christopher
Mehrabian, Abbas
Plan, Yaniv
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[10] Lower bounds for query complexity of some graph problems
Lace, L
Freivalds, R
[J]. VLSI'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VLSI, 2003, : 309 - 313

← 1 2 3 4 5 →