Convergence of Heterogeneous Distributed Learning in Stochastic Routing Games

被引：0

作者：

Krichene, Syrine ^{[1
]}

Krichene, Walid ^{[2
]}

Dong, Roy ^{[2
]}

Bayen, Alexandre ^{[2
]}

机构：

[1] ENSIMAG Sch Comp Sci & Appl Math Grenoble, St Martin Dheres, France

[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2015年

关键词：

CONVEX-OPTIMIZATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We study convergence properties of distributed learning dynamics in repeated stochastic routing games. The game is stochastic in that each player observes a stochastic vector, the conditional expectation of which is equal to the true loss (almost surely). In particular, we propose a model in which every player m follows a stochastic mirror descent dynamics with Bregman divergence D psi(m) and learning rates eta(m)(t) = theta(m)t(-alpha m). We prove that if all players use the same sequence of learning rates, then their joint strategy converges almost surely to the equilibrium set. If the learning dynamics are heterogeneous, that is, different players use different learning rates, then the joint strategy converges to equilibrium in expectation, and we give upper bounds on the convergence rate. This result holds for general routing games (no smoothness or strong convexity assumptions are required). These results provide a distributed learning model that is robust to measurement noise and other stochastic perturbations, and allows flexibility in the choice of learning algorithm of each player. The results also provide estimates of convergence rates, which are confirmed in simulation.

引用

页码：480 / 487

页数：8

共 50 条

[21] Learning in Games: Robustness of Fast Convergence
Foster, Dylan J.
Li, Zhiyuan
Lykouris, Thodoris
Sridharan, Karthik
Tardos, Eva
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[22] Privacy stochastic games in distributed constraint reasoning
Julien Savaux
Julien Vion
Sylvain Piechowiak
René Mandiau
Toshihiro Matsui
Katsutoshi Hirayama
Makoto Yokoo
Shakre Elmane
Marius Silaghi
Annals of Mathematics and Artificial Intelligence, 2020, 88 : 691 - 715
[23] Privacy stochastic games in distributed constraint reasoning
Savaux, Julien
Vion, Julien
Piechowiak, Sylvain
Mandiau, Rene
Matsui, Toshihiro
Hirayama, Katsutoshi
Yokoo, Makoto
Elmane, Shakre
Silaghi, Marius
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2020, 88 (07) : 691 - 715
[24] A convergence method for stochastic differential games with a small parameter
Ramachandran, KM
STOCHASTIC ANALYSIS AND APPLICATIONS, 1999, 17 (02) : 219 - 252
[25] Distributed Discontinuous Coupling for Convergence in Heterogeneous Networks
Coraggio, Marco
DeLellis, Pietro
di Bernardo, Mario
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 1037 - 1042
[26] CONVERGENCE AND STABILITY OF DISTRIBUTED STOCHASTIC ITERATIVE PROCESSES
LADDE, GS
SILJAK, DD
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1990, 35 (06) : 665 - 672
[27] Network aggregative games: Distributed convergence to Nash equilibria
Parise, Francesca
Gentile, Basilio
Grammatico, Sergio
Lygeros, John
2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 2295 - 2300
[28] Robustness of Learning in Games With Heterogeneous Players
Akbar, Aqsa Shehzadi
Jaleel, Hassan
Abbas, Waseem
Shamma, Jeff S.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (03) : 1553 - 1567
[29] Distributed Adaptive Routing Convergence to Non-Blocking DCN Routing Assignments
Zahavi, Eitan
Keslassy, Isaac
Kolodny, Avinoam
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (01) : 88 - 101
[30] Online Reinforcement Learning in Stochastic Games
Wei, Chen-Yu
Hong, Yi-Te
Lu, Chi-Jen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30

← 1 2 3 4 5 →