Communication-Efficient Distributed Optimization of Self-concordant Empirical Loss

被引:12
|
作者
Zhang, Yuchen [1 ]
Xiao, Lin [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
关键词
Empirical risk minimization; Distributed optimization; Inexact Newton methods; Self-concordant functions; GRADIENT-METHOD; NEWTON METHOD; ALGORITHMS;
D O I
10.1007/978-3-319-97478-1_11
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We consider distributed convex optimization problems originating from sample average approximation of stochastic optimization, or empirical risk minimization in machine learning. We assume that each machine in the distributed computing system has access to a local empirical loss function, constructed with i.i.d. data sampled from a common distribution. We propose a communication-efficient distributed algorithm to minimize the overall empirical loss, which is the average of the local empirical losses. The algorithm is based on an inexact damped Newton method, where the inexact Newton steps are computed by a distributed preconditioned conjugate gradient method. We analyze its iteration complexity and communication efficiency for minimizing self-concordant empirical loss functions, and discuss the results for ridge regression, logistic regression and binary classification with a smoothed hinge loss. In a standard setting for supervised learning where the condition number of the problem grows with square root of the sample size, the required number of communication rounds of the algorithm does not increase with the sample size, and only grows slowly with the number of machines.
引用
下载
收藏
页码:289 / 341
页数:53
相关论文
共 50 条
  • [31] Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction
    Li, Boyue
    Cen, Shicong
    Chen, Yuxin
    Chi, Yuejie
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [32] D-ADMM: A Communication-Efficient Distributed Algorithm for Separable Optimization
    Mota, Joao F. C.
    Xavier, Joao M. F.
    Aguiar, Pedro M. Q.
    Pueschel, Markus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (10) : 2718 - 2723
  • [33] Augmented self-concordant barriers and nonlinear optimization problems with finite complexity
    Yu. Nesterov
    J.-Ph. Vial
    Mathematical Programming, 2004, 99 : 149 - 174
  • [34] Augmented self-concordant barriers and nonlinear optimization problems with finite complexity
    Nesterov, Y
    Vial, JP
    MATHEMATICAL PROGRAMMING, 2004, 99 (01) : 149 - 174
  • [35] Efficient-Adam: Communication-Efficient Distributed Adam
    Chen C.
    Shen L.
    Liu W.
    Luo Z.-Q.
    IEEE Transactions on Signal Processing, 2023, 71 : 3257 - 3266
  • [36] Communication-Efficient Algorithms for Statistical Optimization
    Zhang, Yuchen
    Duchi, John C.
    Wainwright, Martin J.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 6792 - 6792
  • [37] Adaptive and Communication-Efficient Zeroth-Order Optimization for Distributed Internet of Things
    Dang, Qianlong
    Yang, Shuai
    Liu, Qiqi
    Ruan, Junhu
    IEEE Internet of Things Journal, 2024, 11 (22) : 37200 - 37213
  • [38] A Communication-efficient Linearly Convergent Algorithm with Variance Reduction for Distributed Stochastic Optimization
    Lei, Jinlong
    Yi, Peng
    Chen, Jie
    Hong, Yiguang
    2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 1250 - 1255
  • [39] Communication-Efficient Distributed Optimization using an Approximate Newton-type Method
    Shamir, Ohad
    Srebro, Nathan
    Zhang, Tong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1000 - 1008
  • [40] Communication-Efficient Distributed Mining of Association Rules
    Assaf Schuster
    Ran Wolff
    Data Mining and Knowledge Discovery, 2004, 8 : 171 - 196