No-Regret Learning and Equilibrium Computation in Quantum Games

被引:0
|
作者
Lin, Wayne [1 ]
Piliouras, Georgios [1 ]
Sim, Ryann [1 ]
Varvitsiotis, Antonios [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
来源
QUANTUM | 2024年 / 8卷
基金
新加坡国家研究基金会;
关键词
D O I
10.22331/q-2024-12-17-1569
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited information. This paper delves into the dynamics of quantum-enabled agents within decentralized systems that employ no-regret algorithms to update their behaviors over time. Specifically, we investigate two-player quantum zero-sum games and polymatrix quantum zero-sum games, showing that no-regret algorithms converge to separable quantum Nash equilibria in time- average. In the case of general multi-player quantum games, our work leads to a novel solution concept, that of the separable quantum coarse correlated equilibria (QCCE), as the convergent outcome of the time-averaged behavior no-regret algorithms, offering a natural solution concept for decentralized quantum systems. Finally, we show that computing QCCEs can be formulated as a semidefinite program and establish the existence of entangled (i.e., non-separable) QCCEs, which are unlearnable via the current paradigm of no-regret learning.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Simple Uncoupled No-regret Learning Dynamics for Extensive-form Correlated Equilibrium
    Farina, Gabriele
    Celli, Andrea
    Marchesi, Alberto
    Gatti, Nicola
    JOURNAL OF THE ACM, 2022, 69 (06)
  • [32] Unifying convergence and no-regret in multiagent learning
    Banerjee, Bikramjit
    Peng, Jing
    LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 100 - 114
  • [33] Weighted Voting Via No-Regret Learning
    Haghtalab, Nika
    Noothigattu, Ritesh
    Procaccia, Ariel D.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1055 - 1062
  • [34] No-regret Exploration in Contextual Reinforcement Learning
    Modi, Aditya
    Tewari, Ambuj
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 829 - 838
  • [35] A Reduction from Reinforcement Learning to No-Regret Online Learning
    Cheng, Ching-An
    des Combes, Remi Tachet
    Boots, Byron
    Gordon, Geoff
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3514 - 3523
  • [36] Distributive opportunistic spectrum access for cognitive radio using correlated equilibrium and no-regret learning
    Han, Zhu
    Pandana, Charles
    Liu, K. J. Ray
    2007 IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-9, 2007, : 11 - +
  • [37] A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
    Carminati, Luca
    Cacciamani, Federico
    Ciccone, Marco
    Gatti, Nicola
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [38] Tight last-iterate convergence rates for no-regret learning in multi-player games
    Golowich, Noah
    Pattathil, Sarath
    Daskalakis, Constantinos
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [39] No-Regret and Incentive-Compatible Online Learning
    Freeman, Rupert
    Pennock, David M.
    Podimata, Chara
    Vaughan, Jennifer Wortman
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [40] No-Regret Learning in Partially-Informed Auctions
    Guo, Wenshuo
    Jordan, Michael I.
    Vitercik, Ellen
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,