Hierarchical Nash-Q Learning in Continuous Games

被引:0
|
作者
Sahraei-Ardakani, Mostafa [1 ]
Rahimi-Kian, Ashkan [1 ]
Nili-Ahmadabadi, Majid [1 ]
机构
[1] Univ Tehran, Coll Engn, ECE Dept, CIPCE, Tehran, Iran
关键词
D O I
10.1109/CIG.2008.5035652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent Reinforcement Learning (RL) algorithms usually work on repeated extended, or stochastic games. Generally RL is developed for discrete systems both in terms of states and actions. In this paper, a hierarchical method to learn equilibrium strategy in continuous games is developed. Hierarchy is used to break the continuous domain of strategies into discrete sets of hierarchical strategies. The algorithm is proved to converge to Nash-Equilibrium in a specific class of games with dominant strategies. Then, it is applied to some other games and the convergence in shown. This approach is common in RL algorithms that they are applied to problem where no proof of convergence exits.
引用
收藏
页码:290 / 295
页数:6
相关论文
共 50 条
  • [31] Learning Generalized Nash Equilibria in a Class of Convex Games
    Tatarenko, Tatiana
    Kamgarpour, Maryam
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (04) : 1426 - 1439
  • [32] 基于多智能体Nash-Q强化学习的综合能源市场交易优化决策
    孙庆凯
    王小君
    王怡
    张义志
    刘曌
    和敬涵
    电力系统自动化, 2021, 45 (16) : 124 - 133
  • [33] Q-Learning for Feedback Nash Strategy of Finite-Horizon Nonzero-Sum Difference Games
    Zhang, Zhaorong
    Xu, Juanjuan
    Fu, Minyue
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9170 - 9178
  • [34] Extending Q-learning to continuous and mixed strategy games based on spatial reciprocity
    Wang, Lu
    Zhang, Long
    Liu, Yang
    Wang, Zhen
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 479 (2274):
  • [35] EXISTENCE AND APPROXIMATION OF CONTINUOUS BAYESIAN NASH EQUILIBRIA IN GAMES WITH CONTINUOUS TYPE AND ACTION SPACES
    Guo, Shaoyan
    Xu, Huifu
    Zhang, Liwei
    SIAM JOURNAL ON OPTIMIZATION, 2021, 31 (04) : 2481 - 2507
  • [36] Collaborative optimization strategy of source-grid-load-energy storage based on improved Nash-Q equilibrium transfer algorithm
    Huang H.
    Li Y.
    Liu H.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2023, 43 (08): : 71 - 77and104
  • [37] The calculation of the Stackelberg–Nash equilibrium as a fixed point problem in static hierarchical games
    Moya S.
    International Journal of Dynamics and Control, 2018, 6 (3) : 907 - 918
  • [38] ON LINEAR-QUADRATIC GAUSSIAN CONTINUOUS-TIME NASH GAMES
    PAPAVASSILOPOULOS, GP
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1984, 42 (04) : 525 - 549
  • [39] Optimal Deception Asset Deployment in Cybersecurity: A Nash Q-Learning Approach in Multi-Agent Stochastic Games
    Kong, Guanhua
    Chen, Fucai
    Yang, Xiaohan
    Cheng, Guozhen
    Zhang, Shuai
    He, Weizhen
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [40] A Constructive Proof that Learning in Repeated Games Leads to Nash Equilibria
    Leoni, Patrick L.
    B E JOURNAL OF THEORETICAL ECONOMICS, 2008, 8 (01):