Unified reinforcement Q-learning for mean field game and control problems

被引:24
|
作者
Angiuli, Andrea [1 ]
Fouque, Jean-Pierre [1 ]
Lauriere, Mathieu [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, South Hall 5504, Santa Barbara, CA 93106 USA
[2] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
Q-learning; Mean field game; Mean field control; Timescales; Linear-quadratic control;
D O I
10.1007/s00498-021-00310-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a Reinforcement Learning (RL) algorithm to solve infinite horizon asymptotic Mean Field Game (MFG) and Mean Field Control (MFC) problems. Our approach can be described as a unified two-timescale Mean Field Q-learning: The same algorithm can learn either the MFG or the MFC solution by simply tuning the ratio of two learning parameters. The algorithm is in discrete time and space where the agent not only provides an action to the environment but also a distribution of the state in order to take into account the mean field feature of the problem. Importantly, we assume that the agent cannot observe the population's distribution and needs to estimate it in a model-free manner. The asymptotic MFG and MFC problems are also presented in continuous time and space, and compared with classical (non-asymptotic or stationary) MFG and MFC problems. They lead to explicit solutions in the linear-quadratic (LQ) case that are used as benchmarks for the results of our algorithm.
引用
收藏
页码:217 / 271
页数:55
相关论文
共 50 条
  • [1] Unified reinforcement Q-learning for mean field game and control problems
    Andrea Angiuli
    Jean-Pierre Fouque
    Mathieu Laurière
    Mathematics of Control, Signals, and Systems, 2022, 34 : 217 - 271
  • [2] Continuous Time q-Learning for Mean-Field Control Problems
    Wei, Xiaoli
    Yu, Xiang
    APPLIED MATHEMATICS AND OPTIMIZATION, 2025, 91 (01):
  • [3] MODEL-FREE MEAN-FIELD REINFORCEMENT LEARNING: MEAN-FIELD MDP AND MEAN-FIELD Q-LEARNING
    Carmona, Rene
    Lauriere, Mathieu
    Tan, Zongjun
    ANNALS OF APPLIED PROBABILITY, 2023, 33 (6B): : 5334 - 5381
  • [4] Reinforcement Learning for Mean-Field Game
    Agarwal, Mridul
    Aggarwal, Vaneet
    Ghosh, Arnob
    Tiwari, Nilay
    ALGORITHMS, 2022, 15 (03)
  • [5] Q-Learning in Regularized Mean-field Games
    Anahtarci, Berkay
    Kariksiz, Can Deha
    Saldi, Naci
    DYNAMIC GAMES AND APPLICATIONS, 2023, 13 (01) : 89 - 117
  • [6] Q-Learning in Regularized Mean-field Games
    Berkay Anahtarci
    Can Deha Kariksiz
    Naci Saldi
    Dynamic Games and Applications, 2023, 13 : 89 - 117
  • [7] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
  • [8] Adaptive Drug Delivery to Control Mean Arterial Blood Pressure by Reinforcement Fuzzy Q-Learning
    Zhang, Rui
    Li, Zhichun
    Pan, Xingzheng
    Ma, Zejun
    Dai, Ying
    Mohammadzadeh, Ardashir
    Zhang, Chunwei
    IEEE SENSORS JOURNAL, 2024, 24 (19) : 30968 - 30977
  • [9] Fuzzy Q-Learning for generalization of reinforcement learning
    Berenji, HR
    FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2208 - 2214
  • [10] Deep Reinforcement Learning with Double Q-Learning
    van Hasselt, Hado
    Guez, Arthur
    Silver, David
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2094 - 2100