No-Regret Learning and Equilibrium Computation in Quantum Games

被引：0

作者：

Lin, Wayne ^{[1
]}

Piliouras, Georgios ^{[1
]}

Sim, Ryann ^{[1
]}

Varvitsiotis, Antonios ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, Singapore, Singapore

来源：

QUANTUM | 2024年 / 8卷

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.22331/q-2024-12-17-1569

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited information. This paper delves into the dynamics of quantum-enabled agents within decentralized systems that employ no-regret algorithms to update their behaviors over time. Specifically, we investigate two-player quantum zero-sum games and polymatrix quantum zero-sum games, showing that no-regret algorithms converge to separable quantum Nash equilibria in time- average. In the case of general multi-player quantum games, our work leads to a novel solution concept, that of the separable quantum coarse correlated equilibria (QCCE), as the convergent outcome of the time-averaged behavior no-regret algorithms, offering a natural solution concept for decentralized quantum systems. Finally, we show that computing QCCEs can be formulated as a semidefinite program and establish the existence of entangled (i.e., non-separable) QCCEs, which are unlearnable via the current paradigm of no-regret learning.

引用

页数：24

共 50 条

[21] No-regret Reinforcement Learning
Gopalan, Aditya
2019 FIFTH INDIAN CONTROL CONFERENCE (ICC), 2019, : 16 - 16
[22] No-regret learning for repeated non-cooperative games with lossy bandits
Liu, Wenting
Lei, Jinlong
Yi, Peng
Hong, Yiguang
AUTOMATICA, 2024, 160
[23] On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games
Anagnostides, Ioannis
Kalavasis, Alkis
Sandholm, Tuomas
Zampetakis, Manolis
15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,
[24] No-Regret Learning in Time-Varying Zero-Sum Games
Zhang, Mengxiao
Zhao, Peng
Luo, Haipeng
Zhou, Zhi-Hua
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[25] Near-Optimal No-Regret Learning Dynamics for General Convex Games
Farina, Gabriele
Anagnostides, Ioannis
Luo, Haipeng
Lee, Chung-Wei
Kroer, Christian
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[26] Doubly Optimal No-Regret Online Learning in Strongly Monotone Games with Bandit Feedback
Ba, Wenjia
Lin, Tianyi
Zhang, Jiawei
Zhou, Zhengyuan
OPERATIONS RESEARCH, 2025,
[27] No-Regret Distributed Learning in Two-Network Zero-Sum Games
Huang, Shijie
Lei, Jinlong
Hong, Yiguang
Shanbhag, Uday, V
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 924 - 929
[28] No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation
Hsieh, Yu-Guan
Antonakopoulos, Kimon
Cevher, Volkan
Mertikopoulos, Panayotis
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[29] No-Regret Learning Supports Voters' Competence
Spelda, Petr
Stritecky, Vit
Symons, John
SOCIAL EPISTEMOLOGY, 2024, 38 (05) : 543 - 559
[30] On the convergence of no-regret learning in selfish routing
Krichene, Walid
Drighes, Benjamin
Bayen, Alexandre
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 163 - 171

← 1 2 3 4 5 →