Communication-Efficient Distributed Optimization of Self-concordant Empirical Loss

被引:12
|
作者
Zhang, Yuchen [1 ]
Xiao, Lin [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
关键词
Empirical risk minimization; Distributed optimization; Inexact Newton methods; Self-concordant functions; GRADIENT-METHOD; NEWTON METHOD; ALGORITHMS;
D O I
10.1007/978-3-319-97478-1_11
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We consider distributed convex optimization problems originating from sample average approximation of stochastic optimization, or empirical risk minimization in machine learning. We assume that each machine in the distributed computing system has access to a local empirical loss function, constructed with i.i.d. data sampled from a common distribution. We propose a communication-efficient distributed algorithm to minimize the overall empirical loss, which is the average of the local empirical losses. The algorithm is based on an inexact damped Newton method, where the inexact Newton steps are computed by a distributed preconditioned conjugate gradient method. We analyze its iteration complexity and communication efficiency for minimizing self-concordant empirical loss functions, and discuss the results for ridge regression, logistic regression and binary classification with a smoothed hinge loss. In a standard setting for supervised learning where the condition number of the problem grows with square root of the sample size, the required number of communication rounds of the algorithm does not increase with the sample size, and only grows slowly with the number of machines.
引用
下载
收藏
页码:289 / 341
页数:53
相关论文
共 50 条
  • [1] DiSCO: Distributed Optimization for Self-Concordant Empirical Loss
    Zhang, Yuchen
    Xiao, Lin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 362 - 370
  • [2] Communication-Efficient Distributed PCA by Riemannian Optimization
    Huang, Long-Kai
    Pan, Sinno Jialin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [3] Double Quantization for Communication-Efficient Distributed Optimization
    Huang, Longbo
    PROCEEDINGS OF THE 13TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS ( VALUETOOLS 2020), 2020, : 2 - 2
  • [4] Double Quantization for Communication-Efficient Distributed Optimization
    Yu, Yue
    Wu, Jiaxiang
    Huang, Longbo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Communication-Efficient Distributed Optimization with Quantized Preconditioners
    Alimisis, Foivos
    Davies, Peter
    Alistarh, Dan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] Gradient Sparsification for Communication-Efficient Distributed Optimization
    Wangni, Jianqiao
    Wang, Jialei
    Liu, Ji
    Zhang, Tong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [7] Harvesting Curvatures for Communication-Efficient Distributed Optimization
    Cardoso, Diogo
    Li, Boyue
    Chi, Yuejie
    Xavier, Joao
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 749 - 753
  • [8] Self-concordant functions for optimization on smooth manifolds
    Jiang, D
    Moore, JB
    Ji, HB
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 3631 - 3636
  • [9] Self-concordant functions for optimization on smooth manifolds
    Jiang, Danchi
    Moore, John B.
    Ji, Huibo
    JOURNAL OF GLOBAL OPTIMIZATION, 2007, 38 (03) : 437 - 457
  • [10] Self-concordant functions for optimization on smooth manifolds
    Danchi Jiang
    John B. Moore
    Huibo Ji
    Journal of Global Optimization, 2007, 38 : 437 - 457