Communication-Efficient Algorithms for Statistical Optimization

被引:0
|
作者
Zhang, Yuchen [1 ]
Duchi, John C. [1 ]
Wainwright, Martin J. [2 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
distributed learning; stochastic optimization; averaging; subsampling; STOCHASTIC-APPROXIMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We analyze two communication-efficient algorithms for distributed optimization in statistical settings involving large-scale data sets. The first algorithm is a standard averaging method that distributes the N data samples evenly to m machines, performs separate minimization on each subset, and then averages the estimates. We provide a sharp analysis of this average mixture algorithm, showing that under a reasonable set of conditions, the combined parameter achieves mean-squared error (MSE) that decays as O(N-1 + (N/m)(-2)). Whenever m <= root N, this guarantee matches the best possible rate achievable by a centralized algorithm having access to all N samples. The second algorithm is a novel method, based on an appropriate form of bootstrap subsampling. Requiring only a single round of communication, it has mean-squared error that decays as O(N-1 + (N/m)(-3)), and so is more robust to the amount of parallelization. In addition, we show that a stochastic gradient-based method attains mean-squared error decaying as O(N-1 + (N/m)(-3/2)), easing computation at the expense of a potentially slower MSE rate. We also provide an experimental evaluation of our methods, investigating their performance both on simulated data and on a large-scale regression problem from the internet search domain. In particular, we show that our methods can be used to efficiently solve an advertisement prediction problem from the Chinese SoSo Search Engine, which involves logistic regression with N approximate to 2.4 x 10(8) samples and d approximate to 740,000 covariates.
引用
收藏
页码:3321 / 3363
页数:43
相关论文
共 50 条
  • [21] Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis
    Garber, Dan
    Shamir, Ohad
    Srebro, Nathan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [22] Communication-efficient distributed optimization with adaptability to system heterogeneity
    Yu, Ziyi
    Freris, Nikolaos M.
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3321 - 3326
  • [23] Communication-Efficient Stochastic Gradient Descent Ascent with Momentum Algorithms
    Zhang, Yihan
    Qiu, Meikang
    Gao, Hongchang
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4602 - 4610
  • [24] Heterogeneity-aware and communication-efficient distributed statistical inference
    Duan, Rui
    Ning, Yang
    Chen, Yong
    [J]. BIOMETRIKA, 2022, 109 (01) : 67 - 83
  • [25] COMMUNICATION-EFFICIENT WEIGHTED ADMM FOR DECENTRALIZED NETWORK OPTIMIZATION
    Ling, Qing
    Liu, Yaohua
    Shi, Wei
    Tian, Zhi
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4821 - 4825
  • [26] Communication-efficient Distributed Learning for Large Batch Optimization
    Liu, Rui
    Mozafari, Barzan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [27] Adaptive Bit Allocation for Communication-Efficient Distributed Optimization
    Reisizadeh, Hadi
    Touri, Behrouz
    Mohajer, Soheil
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1994 - 2001
  • [28] Communication-efficient decentralised algorithms for seismic tomography with sensor networks
    Zhao, Liang
    [J]. INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2020, 35 (05) : 550 - 570
  • [29] CoCoA: A General Framework for Communication-Efficient Distributed Optimization
    Smith, Virginia
    Forte, Simone
    Ma, Chenxin
    Takac, Martin
    Jordan, Michael I.
    Jaggi, Martin
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [30] COMMUNICATION-EFFICIENT PARALLEL ALGORITHMS FOR DISTRIBUTED RANDOM-ACCESS MACHINES
    LEISERSON, CE
    MAGGS, BM
    [J]. ALGORITHMICA, 1988, 3 (01) : 53 - 77