Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning

被引:1
|
作者
Cheong, Kang Hao [1 ,2 ]
Zhao, Jie [1 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, S-639798 Singapore, Singapore
[2] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore S-639798, Singapore
[3] Singapore Univ Technol & Design, Sci Math & Technol, Singapore S-487372, Singapore
来源
PHYSICAL REVIEW RESEARCH | 2024年 / 6卷 / 03期
关键词
D O I
10.1103/PhysRevResearch.6.L032009
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Game-Theoretic Feedback-Based Optimization
    Agarwal, Anurag
    Simpson-Porco, John W.
    Pavel, Lacra
    [J]. IFAC PAPERSONLINE, 2022, 55 (13): : 174 - 179
  • [22] Deep Reinforcement Learning Based Game-Theoretic Decision-Making for Autonomous Vehicles
    Yuan, Mingfeng
    Shan, Jinjun
    Mi, Kevin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 818 - 825
  • [23] Game-theoretic interaction among stochastic learning networks for adaptive load balancing
    Yeung, D.Y.
    Bekey, G.A.
    [J]. Neural Networks, 1988, 1 (1 SUPPL)
  • [24] Game-Theoretic Inverse Reinforcement Learning: A Differential Pontryagin's Maximum Principle Approach
    Cao, Kun
    Xie, Lihua
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9506 - 9513
  • [25] Game-theoretic Particle Swarm Optimization for WMNs
    Zhao, Liqiang
    Zhang, Hailin
    Ding, Wei
    Zhang, Jie
    [J]. 2008 IEEE 19TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, 2008, : 12 - 16
  • [26] Game-Theoretic Receding-Horizon Reinforcement Learning for Lateral Control of Autonomous Vehicles
    Ma, Qingwen
    Yin, Xin
    Zhang, Xinglong
    Xu, Xin
    Yao, Xinxin
    [J]. IEEE Transactions on Vehicular Technology, 2024, 73 (10) : 14547 - 14562
  • [27] Generalization Analysis for Game-Theoretic Machine Learning
    Li, Haifang
    Tian, Fei
    Chen, Wei
    Qin, Tao
    Ma, Zhi-Ming
    Liu, Tie-Yan
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2089 - 2095
  • [28] A Game-theoretic Approach for Robust Federated Learning
    Tahanian, E.
    Amouei, M.
    Fateh, H.
    Rezvani, M.
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (04): : 832 - 842
  • [29] Paradigm shift in language planning and policy: game-theoretic solutions
    Tan, Di
    Leung, Genevieve
    [J]. CURRENT ISSUES IN LANGUAGE PLANNING, 2014, 15 (01) : 106 - +
  • [30] The Game-Theoretic Approach to Machine Learning and Adaptation
    Cesa-Bianchi, Nicolo
    [J]. ADAPTIVE AND INTELLIGENT SYSTEMS, 2011, 6943 : 1 - 1