Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning

被引:1
|
作者
Cheong, Kang Hao [1 ,2 ]
Zhao, Jie [1 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, S-639798 Singapore, Singapore
[2] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore S-639798, Singapore
[3] Singapore Univ Technol & Design, Sci Math & Technol, Singapore S-487372, Singapore
来源
PHYSICAL REVIEW RESEARCH | 2024年 / 6卷 / 03期
关键词
D O I
10.1103/PhysRevResearch.6.L032009
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] A Game-Theoretic Reinforcement Learning Approach for Adaptive Interaction at Intersections
    Jin, Xinze
    Li, Kuo
    Jia, Qing-Shan
    Xia, Huaxia
    Bai, Yu
    Ren, Dongchun
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4451 - 4456
  • [2] Multiagent Reinforcement Learning and Game-Theoretic Optimization for Autonomous Sensor Control
    Ravier, Robert
    Garagic, Denis
    Galoppo, Travis
    Rhodes, Bradley J.
    Zulch, Peter
    [J]. 2024 IEEE AEROSPACE CONFERENCE, 2024,
  • [3] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
    Lanctot, Marc
    Zambaldi, Vinicius
    Gruslys, Audrunas
    Lazaridou, Angeliki
    Tuyls, Karl
    Perolat, Julien
    Silver, David
    Graepel, Thore
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [4] Cooperation in wireless networks: a game-theoretic framework with reinforcement learning
    Baidas, Mohammed Wael
    [J]. IET COMMUNICATIONS, 2014, 8 (05) : 740 - 753
  • [5] On the Combination of Game-Theoretic Learning and Multi Model Adaptive Filters
    Smyrnakis, Michalis
    Qu, Hongyang
    Bauso, Dario
    Veres, Sandor
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020, 2021, 12613 : 73 - 105
  • [6] On the role of reinforcement learning in experimental games: The cognitive game-theoretic approach
    Erev, I
    Roth, AE
    [J]. GAMES AND HUMAN BEHAVIOR: ESSAYS IN HONOR OF AMNON RAPOPORT, 1999, : 53 - 77
  • [7] Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
    Zheng, Liyuan
    Fiez, Tanner
    Alumbaugh, Zane
    Chasnov, Benjamin
    Ratliff, Lillian J.
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9217 - 9224
  • [8] A Game-Theoretic Framework with Reinforcement Learning for Multinode Cooperation in Wireless Networks
    Baidas, Mohammed W.
    [J]. 2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 981 - 986
  • [9] Consciousness as an Evolutionary Game-Theoretic Strategy
    Arsiwalla, Xerxes D.
    Herreros, Ivan
    Moulin-Frier, Clement
    Verschure, Paul
    [J]. BIOMIMETIC AND BIOHYBRID SYSTEMS, LIVING MACHINES 2017, 2017, 10384
  • [10] Adaptive categorization of ART networks in robot behavior learning using game-theoretic formulation
    Fung, WK
    Liu, YH
    [J]. NEURAL NETWORKS, 2003, 16 (10) : 1403 - 1420