Q-Learning in Regularized Mean-field Games

被引:13
|
作者
Anahtarci, Berkay [1 ]
Kariksiz, Can Deha [1 ]
Saldi, Naci [2 ]
机构
[1] Ozyegin Univ, Istanbul, Turkey
[2] Bilkent Univ, Ankara, Turkey
关键词
Mean-field games; Q-learning; Regularized Markov decision processes; Discounted reward; NASH EQUILIBRIA; DYNAMIC-GAMES; ROBUSTNESS;
D O I
10.1007/s13235-022-00450-2
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this paper, we introduce a regularized mean-field game and study learning of this game under an infinite-horizon discounted reward function. Regularization is introduced by adding a strongly concave regularization function to the one-stage reward function in the classical mean-field game model. We establish a value iteration based learning algorithm to this regularized mean-field game using fitted Q-learning. The regularization term in general makes reinforcement learning algorithm more robust to the system components. Moreover, it enables us to establish error analysis of the learning algorithm without imposing restrictive convexity assumptions on the system components, which are needed in the absence of a regularization term.
引用
收藏
页码:89 / 117
页数:29
相关论文
共 50 条
  • [1] Q-Learning in Regularized Mean-field Games
    Berkay Anahtarci
    Can Deha Kariksiz
    Naci Saldi
    [J]. Dynamic Games and Applications, 2023, 13 : 89 - 117
  • [2] MODEL-FREE MEAN-FIELD REINFORCEMENT LEARNING: MEAN-FIELD MDP AND MEAN-FIELD Q-LEARNING
    Carmona, Rene
    Lauriere, Mathieu
    Tan, Zongjun
    [J]. ANNALS OF APPLIED PROBABILITY, 2023, 33 (6B): : 5334 - 5381
  • [3] Learning Regularized Monotone Graphon Mean-Field Games
    Zhang, Fengzhuo
    Tan, Vincent Y. F.
    Wang, Zhaoran
    Yang, Zhuoran
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Learning Mean-Field Games
    Guo, Xin
    Hu, Anran
    Xu, Renyuan
    Zhang, Junzi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Learning in Mean-Field Games
    Yin, Huibing
    Mehta, Prashant G.
    Meyn, Sean P.
    Shanbhag, Uday V.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (03) : 629 - 644
  • [6] Learning in Mean-Field Oscillator Games
    Yin, Huibing
    Mehta, Prashant G.
    Meyn, Sean P.
    Shanbhag, Uday V.
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3125 - 3132
  • [7] Independent Learning and Subjectivity in Mean-Field Games
    Yongacoglu, Bora
    Arslan, Gürdal
    Yuksel, Serdar
    [J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2845 - 2850
  • [8] A General Framework for Learning Mean-Field Games
    Guo, Xin
    Hu, Anran
    Xu, Renyuan
    Zhang, Junzi
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (02) : 656 - 686
  • [9] Reinforcement Learning in Stationary Mean-field Games
    Subramanian, Jayakumar
    Mahajan, Aditya
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 251 - 259
  • [10] Optimal control for unknown mean-field discrete-time system based on Q-Learning
    Ge, Yingying
    Liu, Xikui
    Li, Yan
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2021, 52 (15) : 3335 - 3349