Stochastic learning in multi-agent optimization: Communication and payoff-based approaches

被引:10
|
作者
Tatarenko, Tatiana [1 ]
机构
[1] Tech Univ Darmstadt, Control Methods & Robot Lab, Darmstadt, Germany
关键词
Non-convex optimization; Multi-agent systems; Game theory; Learning algorithms; Stochastic approximation; DISTRIBUTED OPTIMIZATION; POWER-CONTROL; CONSENSUS; GAMES;
D O I
10.1016/j.automatica.2018.10.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Game theory serves as a powerful tool for distributed optimization in multi-agent systems in different applications. In this paper we consider multi-agent systems that can be modeled by means of potential games whose potential function coincides with a global objective function to be maximized. In this approach, the agents correspond to the strategic decision makers and the optimization problem is equivalent to the problem of learning a potential function maximizer in the designed game. The paper deals with two different information settings in the system. Firstly, we consider systems, where agents have the access to the gradient of their utility functions. However, they do not possess the full information about the joint actions. Thus, to be able to move along the gradient toward a local optimum, they need to exchange the information with their neighbors by means of communication. The second setting refers to a payoff based approach. Here, we assume that at each iteration agents can only observe their own played actions and experienced payoffs. In both cases, the paper studies unconstrained non-concave optimization with a differentiable objective function. To develop the corresponding algorithms guaranteeing convergence to a local maximum of the potential function in absence of saddle points, we utilize the idea of the well-known Robbins-Monro procedure based on the theory of stochastic approximation. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [21] Payoff-based learning explains the decline in cooperation in public goods games
    Burton-Chellew, Maxwell N.
    Nax, Heinrich H.
    West, Stuart A.
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2015, 282 (1801)
  • [22] Learning Attentional Communication for Multi-Agent Cooperation
    Jiang, Jiechuan
    Lu, Zongqing
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [23] Coordinating Multi-Agent Navigation by Learning Communication
    Hildreth, Dalto N.
    Guy, Stephen J.
    [J]. PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)
  • [24] Learning to Ground Multi-Agent Communication with Autoencoders
    Lin, Toru
    Huh, Minyoung
    Stauffer, Chris
    Lim, Ser-Nam
    Isola, Phillip
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] A communication architecture for multi-agent learning systems
    Ireson, N
    Cao, YJ
    Bull, L
    Miles, R
    [J]. REAL-WORLD APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2000, 1803 : 255 - 266
  • [26] Multi-Agent Mirror Descent for Decentralized Stochastic Optimization
    Rabbat, Michael
    [J]. 2015 IEEE 6TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2015, : 517 - 520
  • [27] Communication Optimization for Multi-agent Reinforcement Learning-based Traffic Control System with Explainable Protocol
    Wang, Han
    Wu, Haochen
    Lu, Juanwu
    Tang, Fang
    Delle Monache, Maria Laura
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 6068 - 6073
  • [28] HyperComm: Hypergraph-based communication in multi-agent reinforcement learning
    Zhu, Tianyu
    Shi, Xinli
    Xu, Xiangping
    Gui, Jie
    Cao, Jinde
    [J]. NEURAL NETWORKS, 2024, 178
  • [29] Multi-Objective Optimization in Air-to-Air Communication System Based on Multi-Agent Deep Reinforcement Learning
    Lin, Shaofu
    Chen, Yingying
    Li, Shuopeng
    [J]. SENSORS, 2023, 23 (23)
  • [30] Multi-Agent Reinforcement Learning for Convex Optimization
    Morcos, Amir
    West, Aaron
    Maguire, Brian
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746