Finite-time convergence rates of distributed local stochastic approximation

被引:0
|
作者
Doan, Thinh T. [1 ]
机构
[1] Virginia Tech, Bradley Dept Elect & Comp Engn, Blacksburg, VA 24061 USA
关键词
D O I
10.1016/j.automatica.2023.111294
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a distributed learning framework, where there are a group of agents communicating with a centralized coordinator. The goal of the agents is to find the root of an operator composed of the local operators at the agents. Such a framework models many practical problems in different areas, including those in federated learning and reinforcement learning. For solving this problem, we study the popular distributed stochastic approximation. Over a series of time epoch, each agent runs a number of local stochastic approximation steps based on its own data, whose results are then aggregated at the centralized coordinator.Existing theoretical guarantees for the finite-time performance of local stochastic approximation are studied under the common assumption that the local data at each agent is sampled i.i.d. Such an assumption may not hold in many applications, where the data are temporally dependent, for example, they are sampled from some dynamical systems. In this paper, we study the setting where the data are generated from Markov random processes, which are often used to model the systems in stochastic control and reinforcement learning. Our main contribution is to characterize the finite-time performance of the local stochastic approximation under this setting. We provide explicit formulas for the rates of this method for both constant and time-varying step sizes when the local operators are strongly monotone. Our results show that these rates are within a logarithmic factor of the comparable bounds under independent data. We also provide a number of numerical simulations to illustrate our theoretical results by applying local SA in solving problems in robust identification and reinforcement learning over multi-agent systems.Published by Elsevier Ltd.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Finite-time error bounds for distributed linear stochastic approximation
    Lin, Yixuan
    Gupta, Vijay
    Liu, Ji
    AUTOMATICA, 2024, 159
  • [2] Finite-Time Performance of Distributed Two-Time-Scale Stochastic Approximation
    Doan, Thinh T.
    Romberg, Justin
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 26 - 36
  • [3] Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance
    Doan, Thinh T. T.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) : 4695 - 4705
  • [4] Finite-Time Convergence Rates of Decentralized Stochastic Approximation With Applications in Multi-Agent and Multi-Task Learning
    Zeng, Sihan
    Doan, Thinh T.
    Romberg, Justin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (05) : 2758 - 2773
  • [5] Distributed optimization with the consideration of adaptivity and finite-time convergence
    Lin, Peng
    Ren, Wei
    Song, Yongduan
    Farrell, Jay A.
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 3177 - 3182
  • [6] Distributed Optimization with Finite-Time Convergence via Discontinuous Dynamics
    Pan, Xiaowei
    Liu, Zhongxin
    Chen, Zengqiang
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6665 - 6669
  • [7] Linear Two-Time-Scale Stochastic Approximation A Finite-Time Analysis
    Doan, Thinh T.
    Romberg, Justin
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 399 - 406
  • [8] Finite-Time Error Bounds For Linear Stochastic Approximation and TD Learning
    Srikant, R.
    Ying, Lei
    CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [9] Robust Finite-Time Dynamic Average Consensus With Exponential Convergence Rates
    Xu, Kedong
    Gao, Lan
    Chen, Fei
    Li, Chaojie
    Xuan, Qi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (07) : 2578 - 2582
  • [10] Finite-Time Convergent Distributed Cooperative Learning Algorithm for Data Approximation
    Song, Yanfei
    Chen, Weisheng
    Dai, Hao
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 8032 - 8036