A distributed gradient algorithm based on randomized block-coordinate and projection-free over networks

被引:0
|
作者
Junlong Zhu
Xin Wang
Mingchuan Zhang
Muhua Liu
Qingtao Wu
机构
[1] Henan University of Science and Technology,The School of Information Engineering
[2] Shanghai International Studies University,The School of Business and Management
来源
关键词
Block-coordinate descent; Computational complexity; Frank–Wolfe algorithm; Huge-scale optimization;
D O I
暂无
中图分类号
学科分类号
摘要
The computational bottleneck in distributed optimization methods, which is based on projected gradient descent, is due to the computation of a full gradient vector and projection step. This is a particular problem for large datasets. To reduce the computational complexity of existing methods, we combine the randomized block-coordinate descent and the Frank–Wolfe techniques, and then propose a distributed randomized block-coordinate projection-free algorithm over networks, where each agent randomly chooses a subset of the coordinates of its gradient vector and the projection step is eschewed in favor of a much simpler linear optimization step. Moreover, the convergence performance of the proposed algorithm is also theoretically analyzed. Specifically, we rigorously prove that the proposed algorithm can converge to optimal point at rate of O(1/t)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {O}}(1/t)$$\end{document} under convexity and O(1/t2)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {O}}(1/t^2)$$\end{document} under strong convexity, respectively. Here, t is the number of iterations. Furthermore, the proposed algorithm can converge to a stationary point, where the “Frank-Wolfe” gap is equal to zero, at a rate O(1/t)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal {O}}(1/\sqrt{t})$$\end{document} under non-convexity. To evaluate the computational benefit of the proposed algorithm, we use the proposed algorithm to solve the multiclass classification problems by simulation experiments on two datasets, i.e., aloi and news20. The results shows that the proposed algorithm is faster than the existing distributed optimization algorithms due to its lower computation per iteration. Furthermore, the results also show that well-connected graphs or smaller graphs leads to faster convergence rate, which can confirm the theoretical results.
引用
收藏
页码:267 / 283
页数:16
相关论文
共 50 条
  • [41] A Nesterov-Like Gradient Tracking Algorithm for Distributed Optimization Over Directed Networks
    Lu, Qingguo
    Liao, Xiaofeng
    Li, Huaqing
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (10): : 6258 - 6270
  • [42] Robust Diffusion Affine Projection Algorithm With Variable Step-Size Over Distributed Networks
    Song, Pucha
    Zhao, Haiquan
    Zeng, Xiangping
    [J]. IEEE ACCESS, 2019, 7 : 150484 - 150491
  • [43] A Distributed Randomized Gradient-Free Algorithm for the Non-Convex Economic Dispatch Problem
    Xie, Jun
    Yu, Qingyun
    Cao, Chi
    [J]. ENERGIES, 2018, 11 (01):
  • [44] Distributed Coordinated Voltage Control for Distribution Networks With DG and OLTC Based on MPC and Gradient Projection
    Jiao, Wenshu
    Chen, Jian
    Wu, Qiuwei
    Li, Canbing
    Zhou, Bin
    Huang, Sheng
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (01) : 680 - 690
  • [45] Adaptive Distributed Incremental Networks Based on the Selective Partial Update Affine Projection Algorithm
    Danaee, Ali-Reza
    Abadi, Mohammad Shams Esfand
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [46] Randomized Gradient-Free Method for Multiagent Optimization Over Time-Varying Networks
    Yuan, Deming
    Ho, Daniel W. C.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (06) : 1342 - 1347
  • [47] Distributed mirror descent algorithm over unbalanced digraphs based on gradient weighting technique
    Shi, Chong-Xiao
    Yang, Guang-Hong
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (14): : 10656 - 10680
  • [48] A Block Coordinate Descent-Based Projected Gradient Algorithm for Orthogonal Non-Negative Matrix Factorization
    Asadi, Soodabeh
    Povh, Janez
    [J]. MATHEMATICS, 2021, 9 (05) : 1 - 22
  • [49] Projection-based force reflection algorithm for stable bilateral teleoperation over networks
    Polushin, Ilia G.
    Liu, Peter X.
    Lung, Chung-Horng
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2008, 57 (09) : 1854 - 1865
  • [50] Projection-based force reflection algorithm for stable bilateral teleoperation over networks
    Polushin, Ilia G.
    Liu, Peter X.
    Lung, Chung-Horng
    [J]. 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 2660 - 2665