No-Regret Learning and Equilibrium Computation in Quantum Games

被引：0

作者：

Lin, Wayne ^{[1
]}

Piliouras, Georgios ^{[1
]}

Sim, Ryann ^{[1
]}

Varvitsiotis, Antonios ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, Singapore, Singapore

来源：

QUANTUM | 2024年 / 8卷

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.22331/q-2024-12-17-1569

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited information. This paper delves into the dynamics of quantum-enabled agents within decentralized systems that employ no-regret algorithms to update their behaviors over time. Specifically, we investigate two-player quantum zero-sum games and polymatrix quantum zero-sum games, showing that no-regret algorithms converge to separable quantum Nash equilibria in time- average. In the case of general multi-player quantum games, our work leads to a novel solution concept, that of the separable quantum coarse correlated equilibria (QCCE), as the convergent outcome of the time-averaged behavior no-regret algorithms, offering a natural solution concept for decentralized quantum systems. Finally, we show that computing QCCEs can be formulated as a semidefinite program and establish the existence of entangled (i.e., non-separable) QCCEs, which are unlearnable via the current paradigm of no-regret learning.

引用

页数：24

共 50 条

[31] Simple Uncoupled No-regret Learning Dynamics for Extensive-form Correlated Equilibrium
Farina, Gabriele
Celli, Andrea
Marchesi, Alberto
Gatti, Nicola
JOURNAL OF THE ACM, 2022, 69 (06)
[32] Unifying convergence and no-regret in multiagent learning
Banerjee, Bikramjit
Peng, Jing
LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 100 - 114
[33] Weighted Voting Via No-Regret Learning
Haghtalab, Nika
Noothigattu, Ritesh
Procaccia, Ariel D.
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1055 - 1062
[34] No-regret Exploration in Contextual Reinforcement Learning
Modi, Aditya
Tewari, Ambuj
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 829 - 838
[35] A Reduction from Reinforcement Learning to No-Regret Online Learning
Cheng, Ching-An
des Combes, Remi Tachet
Boots, Byron
Gordon, Geoff
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3514 - 3523
[36] Distributive opportunistic spectrum access for cognitive radio using correlated equilibrium and no-regret learning
Han, Zhu
Pandana, Charles
Liu, K. J. Ray
2007 IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-9, 2007, : 11 - +
[37] A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
Carminati, Luca
Cacciamani, Federico
Ciccone, Marco
Gatti, Nicola
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[38] Tight last-iterate convergence rates for no-regret learning in multi-player games
Golowich, Noah
Pattathil, Sarath
Daskalakis, Constantinos
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[39] No-Regret and Incentive-Compatible Online Learning
Freeman, Rupert
Pennock, David M.
Podimata, Chara
Vaughan, Jennifer Wortman
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[40] No-Regret Learning in Partially-Informed Auctions
Guo, Wenshuo
Jordan, Michael I.
Vitercik, Ellen
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →