ACCELERATED, PARALLEL, AND PROXIMAL COORDINATE DESCENT

被引:157
|
作者
Fercoq, Olivier [1 ]
Richtarik, Peter [2 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, LTCI, Paris, France
[2] Univ Edinburgh, Sch Math, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
randomized coordinate descent; acceleration; parallel methods; proximal methods; complexity; partial separability; convex optimization; big data; ALGORITHM;
D O I
10.1137/130949993
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose a new randomized coordinate descent method for minimizing the sum of convex functions each of which depends on a small number of coordinates only. Our method (APPROX) is simultaneously Accelerated, Parallel, and PROXimal; this is the first time such a method is proposed. In the special case when the number of processors is equal to the number of coordinates, the method converges at the rate 2 (omega) over bar(L) over barR(2)/(k + 1)(2), where k is the iteration counter, (omega) over bar is a data-weighted average degree of separability of the loss function, (L) over bar is the average of Lipschitz constants associated with the coordinates and individual functions in the sum, and R is the distance of the initial point from the minimizer. We show that the method can be implemented without the need to perform full-dimensional vector operations, which is the major bottleneck of accelerated coordinate descent. The fact that the method depends on the average degree of separability, and not on the maximum degree, can be attributed to the use of new safe large stepsizes, leading to improved expected separable overapproximation (ESO). These are of independent interest and can be utilized in all existing parallel randomized coordinate descent algorithms based on the concept of ESO. In special cases, our method recovers several classical and recent algorithms such as simple and accelerated proximal gradient descent, as well serial, parallel, and distributed versions of randomized block coordinate descent. Our bounds match or improve on the best known bounds for these methods.
引用
收藏
页码:1997 / 2023
页数:27
相关论文
共 50 条
  • [31] Restarting the accelerated coordinate descent method with a rough strong convexity estimate
    Fercoq, Olivier
    Qu, Zheng
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2020, 75 (01) : 63 - 91
  • [32] Efficient Serial and Parallel SVM Training using Coordinate Descent
    Liossis, Emmanuel
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR ENGINEERING SOLUTIONS (CIES), 2013, : 76 - 83
  • [33] SySCD: A System-Aware Parallel Coordinate Descent Algorithm
    Ioannou, Nikolas
    Mendler-Dunner, Celestine
    Parnell, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Accelerated Mini-batch Randomized Block Coordinate Descent Method
    Zhao, Tuo
    Yu, Mo
    Wang, Yiming
    Arora, Raman
    Liu, Han
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [35] Fixing the Convergence Problems in Parallel Asynchronous Dual Coordinate Descent
    Zhang, Huan
    Hsieh, Cho-Jui
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 619 - 628
  • [36] The phase only beamforming synthesis via an accelerated coordinate descent method
    Cong, Yang
    Xiao, Xiangqing
    Zhong, Kai
    Hu, Jinfeng
    DIGITAL SIGNAL PROCESSING, 2022, 128
  • [37] A Block Coordinate Descent Proximal Method for Simultaneous Filtering and Parameter Estimation
    Raziperchikolaei, Ramin
    Bhat, Harish S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [38] Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning
    Bhandari, Akshita
    Singh, Chandramani
    LEARNING AND INTELLIGENT OPTIMIZATION, LION 12, 2019, 11353 : 1 - 15
  • [39] Analysis dictionary learning using block coordinate descent framework with proximal operators
    Li, Zhenni
    Ding, Shuxue
    Hayashi, Takafumi
    Li, Yujie
    NEUROCOMPUTING, 2017, 239 : 165 - 180
  • [40] A computationally efficient parallel coordinate descent algorithm for MPC: implementation on a PLC
    Necoara, Ion
    Clipici, Dragos N.
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 3596 - 3601