Convergence of Heterogeneous Distributed Learning in Stochastic Routing Games

被引:0
|
作者
Krichene, Syrine [1 ]
Krichene, Walid [2 ]
Dong, Roy [2 ]
Bayen, Alexandre [2 ]
机构
[1] ENSIMAG Sch Comp Sci & Appl Math Grenoble, St Martin Dheres, France
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
来源
2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2015年
关键词
CONVEX-OPTIMIZATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study convergence properties of distributed learning dynamics in repeated stochastic routing games. The game is stochastic in that each player observes a stochastic vector, the conditional expectation of which is equal to the true loss (almost surely). In particular, we propose a model in which every player m follows a stochastic mirror descent dynamics with Bregman divergence D psi(m) and learning rates eta(m)(t) = theta(m)t(-alpha m). We prove that if all players use the same sequence of learning rates, then their joint strategy converges almost surely to the equilibrium set. If the learning dynamics are heterogeneous, that is, different players use different learning rates, then the joint strategy converges to equilibrium in expectation, and we give upper bounds on the convergence rate. This result holds for general routing games (no smoothness or strong convexity assumptions are required). These results provide a distributed learning model that is robust to measurement noise and other stochastic perturbations, and allows flexibility in the choice of learning algorithm of each player. The results also provide estimates of convergence rates, which are confirmed in simulation.
引用
收藏
页码:480 / 487
页数:8
相关论文
共 50 条
  • [21] Learning in Games: Robustness of Fast Convergence
    Foster, Dylan J.
    Li, Zhiyuan
    Lykouris, Thodoris
    Sridharan, Karthik
    Tardos, Eva
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [22] Privacy stochastic games in distributed constraint reasoning
    Julien Savaux
    Julien Vion
    Sylvain Piechowiak
    René Mandiau
    Toshihiro Matsui
    Katsutoshi Hirayama
    Makoto Yokoo
    Shakre Elmane
    Marius Silaghi
    Annals of Mathematics and Artificial Intelligence, 2020, 88 : 691 - 715
  • [23] Privacy stochastic games in distributed constraint reasoning
    Savaux, Julien
    Vion, Julien
    Piechowiak, Sylvain
    Mandiau, Rene
    Matsui, Toshihiro
    Hirayama, Katsutoshi
    Yokoo, Makoto
    Elmane, Shakre
    Silaghi, Marius
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2020, 88 (07) : 691 - 715
  • [24] A convergence method for stochastic differential games with a small parameter
    Ramachandran, KM
    STOCHASTIC ANALYSIS AND APPLICATIONS, 1999, 17 (02) : 219 - 252
  • [25] Distributed Discontinuous Coupling for Convergence in Heterogeneous Networks
    Coraggio, Marco
    DeLellis, Pietro
    di Bernardo, Mario
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 1037 - 1042
  • [26] CONVERGENCE AND STABILITY OF DISTRIBUTED STOCHASTIC ITERATIVE PROCESSES
    LADDE, GS
    SILJAK, DD
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1990, 35 (06) : 665 - 672
  • [27] Network aggregative games: Distributed convergence to Nash equilibria
    Parise, Francesca
    Gentile, Basilio
    Grammatico, Sergio
    Lygeros, John
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 2295 - 2300
  • [28] Robustness of Learning in Games With Heterogeneous Players
    Akbar, Aqsa Shehzadi
    Jaleel, Hassan
    Abbas, Waseem
    Shamma, Jeff S.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1553 - 1567
  • [29] Distributed Adaptive Routing Convergence to Non-Blocking DCN Routing Assignments
    Zahavi, Eitan
    Keslassy, Isaac
    Kolodny, Avinoam
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (01) : 88 - 101
  • [30] Online Reinforcement Learning in Stochastic Games
    Wei, Chen-Yu
    Hong, Yi-Te
    Lu, Chi-Jen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30