ON ADAPTIVE SKETCH-AND-PROJECT FOR SOLVING LINEAR SYSTEMS

被引:31
|
作者
Gower, Robert M. [1 ]
Molitor, Denali [2 ]
Moorman, Jacob [2 ]
Needell, Deanna [2 ]
机构
[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-91120 Palaiseau, France
[2] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
sketch-and-project; adaptive sampling; least squares; randomized Kaczmarz; coordinate descent; RANDOMIZED EXTENDED KACZMARZ; COORDINATE DESCENT METHODS; OPTIMIZATION; CONVERGENCE; ALGORITHM; GREEDY; RECONSTRUCTION; CONSTRAINTS; EFFICIENCY;
D O I
10.1137/19M1285846
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We generalize the concept of adaptive sampling rules to the sketch-and-project method for solving linear systems. Analyzing adaptive sampling rules in the sketch-and-project setting yields convergence results that apply to all special cases at once, including the Kaczmarz and coordinate descent. This eliminates the need to separately analyze analogous adaptive sampling rules in each special case. To deduce new sampling rules, we show how the progress of one step of the sketch-and-project method depends directly on a sketched residual. Based on this insight, we derive a (1) max-distance sampling rule, by sampling the sketch with the largest sketched residual, (2) a proportional sampling rule, by sampling proportional to the sketched residual, and finally (3) a capped sampling rule. The capped sampling rule is a generalization of the recently introduced adaptive sampling rules for the Kaczmarz method [Z.-Z. Bai and W.-T. Wu, SIAM J. Sci. Comput., 40 (2018), pp. A592-A606]. We provide a global exponential convergence theorem for each sampling rule and show that the max-distance sampling rule enjoys the fastest convergence. This finding is also verified in extensive numerical experiments that lead us to conclude that the max-distance sampling rule is superior both experimentally and theoretically to the capped sampling rule. We also provide numerical insights into implementing the adaptive strategies so that the per iteration cost is of the same order as using a fixed sampling strategy when the product of the number of sketches with the sketch size is not significantly larger than the number of columns.
引用
收藏
页码:954 / 989
页数:36
相关论文
共 50 条
  • [1] Sketch-and-project methods for tensor linear systems
    Tang, Ling
    Yu, Yajie
    Zhang, Yanjun
    Li, Hanyu
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2023, 30 (02)
  • [2] On convergence of a sketch-and-project method for the matrix equation AXB=C
    Bao W.
    Guo Z.
    Li W.
    Lv Y.
    Computational and Applied Mathematics, 2024, 43 (06)
  • [3] Sharp Analysis of Sketch-and-Project Methods via a Connection to Randomized Singular Value Decomposition\ast
    Derezinski, Michal
    Rebrova, Elizaveta
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2024, 6 (01): : 127 - 153
  • [4] An adaptive CGNR algorithm for solving large linear systems
    Li, CG
    ANNALS OF OPERATIONS RESEARCH, 2001, 103 (1-4) : 329 - 338
  • [5] An Adaptive CGNR Algorithm for Solving Large Linear Systems
    Chunguang Li
    Annals of Operations Research, 2001, 103 : 329 - 338
  • [6] ON ADAPTIVE STOCHASTIC HEAVY BALL MOMENTUM FOR SOLVING LINEAR SYSTEMS
    Zeng, Yun
    Han, Deren
    Su, Yansheng
    Xie, Jiaxin
    SIAM Journal on Matrix Analysis and Applications, 2024, 45 (03) : 1259 - 1286
  • [7] A count sketch maximal weighted residual Kaczmarz method for solving highly overdetermined linear systems
    Zhang, Yanjun
    Li, Hanyu
    APPLIED MATHEMATICS AND COMPUTATION, 2021, 410
  • [8] Balancing parallel adaptive FEM computations by solving systems of linear equations
    Meyerhenke, H
    Schamberger, S
    EURO-PAR 2005 PARALLEL PROCESSING, PROCEEDINGS, 2005, 3648 : 209 - 219
  • [9] COMPOSITE ADAPTIVE PROCEDURE FOR SOLVING LARGE SPARSE LINEAR-SYSTEMS
    MAI, TZ
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1991, 154 : 509 - 534
  • [10] ON THE EFFECTIVENESS OF ADAPTIVE TSCHEBYSCHEFF ACCELERATION FOR SOLVING SYSTEMS OF LINEAR-EQUATIONS
    JEA, KC
    YOUNG, DM
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1988, 24 (1-2) : 33 - 54