Reinforcement Learning for Mean-Field Game

被引:4
|
作者
Agarwal, Mridul [1 ]
Aggarwal, Vaneet [1 ,2 ]
Ghosh, Arnob [3 ]
Tiwari, Nilay [4 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47907 USA
[3] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
[4] IIT Kanpur, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
基金
美国国家科学基金会;
关键词
reinforcement learning; mean-field game; equilibrium; DYNAMIC-GAMES;
D O I
10.3390/a15030073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stochastic games provide a framework for interactions among multiple agents and enable a myriad of applications. In these games, agents decide on actions simultaneously. After taking an action, the state of every agent updates to the next state, and each agent receives a reward. However, finding an equilibrium (if exists) in this game is often difficult when the number of agents becomes large. This paper focuses on finding a mean-field equilibrium (MFE) in an action-coupled stochastic game setting in an episodic framework. It is assumed that an agent can approximate the impact of the other agents' by the empirical distribution of the mean of the actions. All agents know the action distribution and employ lower-myopic best response dynamics to choose the optimal oblivious strategy. This paper proposes a posterior sampling-based approach for reinforcement learning in the mean-field game, where each agent samples a transition probability from the previous transitions. We show that the policy and action distributions converge to the optimal oblivious strategy and the limiting distribution, respectively, which constitute an MFE.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Mean-Field Game and Reinforcement Learning MEC Resource Provisioning for SFCr
    Abouaomar, Amine
    Cherkaoui, Soumaya
    Mlika, Zoubeir
    Kobbane, Abdellatif
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [2] MODEL-FREE MEAN-FIELD REINFORCEMENT LEARNING: MEAN-FIELD MDP AND MEAN-FIELD Q-LEARNING
    Carmona, Rene
    Lauriere, Mathieu
    Tan, Zongjun
    [J]. ANNALS OF APPLIED PROBABILITY, 2023, 33 (6B): : 5334 - 5381
  • [3] Service Function Chaining in MEC: A Mean-Field Game and Reinforcement Learning Approach
    Abouaomar, Amine
    Cherkaoui, Soumaya
    Mlika, Zoubeir
    Kobbane, Abdellatif
    [J]. IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5357 - 5368
  • [4] Reinforcement Learning in Stationary Mean-field Games
    Subramanian, Jayakumar
    Mahajan, Aditya
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 251 - 259
  • [5] Mean-Field Multiagent Reinforcement Learning: A Decentralized Network Approach
    Gu, Haotian
    Guo, Xin
    Wei, Xiaoli
    Xu, Renyuan
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2024,
  • [6] Dealer markets: A reinforcement learning mean field game approach
    Bernasconi, Martino
    Vittori, E.
    Trovo, F.
    Restelli, M.
    [J]. NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2023, 68
  • [7] Obstacle mean-field game problem
    Gomes, Diogo A.
    Patrizi, Stefania
    [J]. INTERFACES AND FREE BOUNDARIES, 2015, 17 (01) : 55 - 68
  • [8] On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation
    Huang, Jiawei
    Yardim, Batuhan
    He, Niao
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [9] Learning Mean-Field Games
    Guo, Xin
    Hu, Anran
    Xu, Renyuan
    Zhang, Junzi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] Learning in Mean-Field Games
    Yin, Huibing
    Mehta, Prashant G.
    Meyn, Sean P.
    Shanbhag, Uday V.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (03) : 629 - 644