Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning

被引：1

作者：

Cheong, Kang Hao ^{[1
,2
]}

Zhao, Jie ^{[1
,3
]}

机构：

[1] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, S-639798 Singapore, Singapore

[2] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore S-639798, Singapore

[3] Singapore Univ Technol & Design, Sci Math & Technol, Singapore S-487372, Singapore

来源：

PHYSICAL REVIEW RESEARCH | 2024年 / 6卷 / 03期

关键词：

D O I：

10.1103/PhysRevResearch.6.L032009

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games.

引用

页数：7

共 50 条

[21] Game-Theoretic Feedback-Based Optimization
Agarwal, Anurag
Simpson-Porco, John W.
Pavel, Lacra
[J]. IFAC PAPERSONLINE, 2022, 55 (13): : 174 - 179
[22] Deep Reinforcement Learning Based Game-Theoretic Decision-Making for Autonomous Vehicles
Yuan, Mingfeng
Shan, Jinjun
Mi, Kevin
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 818 - 825
[23] Game-theoretic interaction among stochastic learning networks for adaptive load balancing
Yeung, D.Y.
Bekey, G.A.
[J]. Neural Networks, 1988, 1 (1 SUPPL)
[24] Game-Theoretic Inverse Reinforcement Learning: A Differential Pontryagin's Maximum Principle Approach
Cao, Kun
Xie, Lihua
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9506 - 9513
[25] Game-theoretic Particle Swarm Optimization for WMNs
Zhao, Liqiang
Zhang, Hailin
Ding, Wei
Zhang, Jie
[J]. 2008 IEEE 19TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, 2008, : 12 - 16
[26] Game-Theoretic Receding-Horizon Reinforcement Learning for Lateral Control of Autonomous Vehicles
Ma, Qingwen
Yin, Xin
Zhang, Xinglong
Xu, Xin
Yao, Xinxin
[J]. IEEE Transactions on Vehicular Technology, 2024, 73 (10) : 14547 - 14562
[27] Generalization Analysis for Game-Theoretic Machine Learning
Li, Haifang
Tian, Fei
Chen, Wei
Qin, Tao
Ma, Zhi-Ming
Liu, Tie-Yan
[J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2089 - 2095
[28] A Game-theoretic Approach for Robust Federated Learning
Tahanian, E.
Amouei, M.
Fateh, H.
Rezvani, M.
[J]. INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (04): : 832 - 842
[29] Paradigm shift in language planning and policy: game-theoretic solutions
Tan, Di
Leung, Genevieve
[J]. CURRENT ISSUES IN LANGUAGE PLANNING, 2014, 15 (01) : 106 - +
[30] The Game-Theoretic Approach to Machine Learning and Adaptation
Cesa-Bianchi, Nicolo
[J]. ADAPTIVE AND INTELLIGENT SYSTEMS, 2011, 6943 : 1 - 1

← 1 2 3 4 5 →