Stochastic learning in multi-agent optimization: Communication and payoff-based approaches

被引:10
|
作者
Tatarenko, Tatiana [1 ]
机构
[1] Tech Univ Darmstadt, Control Methods & Robot Lab, Darmstadt, Germany
关键词
Non-convex optimization; Multi-agent systems; Game theory; Learning algorithms; Stochastic approximation; DISTRIBUTED OPTIMIZATION; POWER-CONTROL; CONSENSUS; GAMES;
D O I
10.1016/j.automatica.2018.10.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Game theory serves as a powerful tool for distributed optimization in multi-agent systems in different applications. In this paper we consider multi-agent systems that can be modeled by means of potential games whose potential function coincides with a global objective function to be maximized. In this approach, the agents correspond to the strategic decision makers and the optimization problem is equivalent to the problem of learning a potential function maximizer in the designed game. The paper deals with two different information settings in the system. Firstly, we consider systems, where agents have the access to the gradient of their utility functions. However, they do not possess the full information about the joint actions. Thus, to be able to move along the gradient toward a local optimum, they need to exchange the information with their neighbors by means of communication. The second setting refers to a payoff based approach. Here, we assume that at each iteration agents can only observe their own played actions and experienced payoffs. In both cases, the paper studies unconstrained non-concave optimization with a differentiable objective function. To develop the corresponding algorithms guaranteeing convergence to a local maximum of the potential function in absence of saddle points, we utilize the idea of the well-known Robbins-Monro procedure based on the theory of stochastic approximation. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [31] A note on the learning effect in multi-agent optimization
    Janiak, Adam
    Rudek, Radoslaw
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5974 - 5980
  • [32] Reinforcement Learning for Multi-Agent Stochastic Resource Collection
    Strauss, Niklas
    Winkel, David
    Berrendorf, Max
    Schubert, Matthias
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 200 - 215
  • [33] Distributed subgradientmethod for multi-agent optimization with quantized communication
    Li, Jueyou
    Chen, Guo
    Wu, Zhiyou
    He, Xing
    [J]. MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2017, 40 (04) : 1201 - 1213
  • [34] Logarithmic Communication for Distributed Optimization in Multi-Agent Systems
    London, Palma
    Vardi, Shai
    Wierman, Adam
    [J]. Performance Evaluation Review, 2020, 48 (01): : 97 - 98
  • [35] Multi-Agent Reinforcement Learning in Stochastic Networked Systems
    Lin, Yiheng
    Qu, Guannan
    Huang, Longbo
    Wierman, Adam
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [36] Flocking Control for Multi-agent Systems with Communication Optimization
    Li, Heng
    Peng, Jun
    Liu, Weirong
    Wang, Jing
    Liu, Jiangang
    Huang, Zhiwu
    [J]. 2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 2056 - 2061
  • [37] Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation
    Lei Yuan
    Feng Chen
    Zongzhang Zhang
    Yang Yu
    [J]. Frontiers of Computer Science, 2024, 18
  • [38] Logarithmic Communication for Distributed Optimization in Multi-Agent Systems
    London, Palma
    Vardi, Shai
    Wierman, Adam
    [J]. PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2019, 3 (03)
  • [39] Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation
    Yuan, Lei
    Chen, Feng
    Zhang, Zongzhang
    Yu, Yang
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (06)
  • [40] Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation
    YUAN Lei
    CHEN Feng
    ZHANG Zongzhang
    YU Yang
    [J]. Frontiers of Computer Science, 2024, 18 (06)