No-Regret Learning and Equilibrium Computation in Quantum Games

被引:0
|
作者
Lin, Wayne [1 ]
Piliouras, Georgios [1 ]
Sim, Ryann [1 ]
Varvitsiotis, Antonios [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
来源
QUANTUM | 2024年 / 8卷
基金
新加坡国家研究基金会;
关键词
D O I
10.22331/q-2024-12-17-1569
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited information. This paper delves into the dynamics of quantum-enabled agents within decentralized systems that employ no-regret algorithms to update their behaviors over time. Specifically, we investigate two-player quantum zero-sum games and polymatrix quantum zero-sum games, showing that no-regret algorithms converge to separable quantum Nash equilibria in time- average. In the case of general multi-player quantum games, our work leads to a novel solution concept, that of the separable quantum coarse correlated equilibria (QCCE), as the convergent outcome of the time-averaged behavior no-regret algorithms, offering a natural solution concept for decentralized quantum systems. Finally, we show that computing QCCEs can be formulated as a semidefinite program and establish the existence of entangled (i.e., non-separable) QCCEs, which are unlearnable via the current paradigm of no-regret learning.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] No-regret Reinforcement Learning
    Gopalan, Aditya
    2019 FIFTH INDIAN CONTROL CONFERENCE (ICC), 2019, : 16 - 16
  • [22] No-regret learning for repeated non-cooperative games with lossy bandits
    Liu, Wenting
    Lei, Jinlong
    Yi, Peng
    Hong, Yiguang
    AUTOMATICA, 2024, 160
  • [23] On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games
    Anagnostides, Ioannis
    Kalavasis, Alkis
    Sandholm, Tuomas
    Zampetakis, Manolis
    15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,
  • [24] No-Regret Learning in Time-Varying Zero-Sum Games
    Zhang, Mengxiao
    Zhao, Peng
    Luo, Haipeng
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [25] Near-Optimal No-Regret Learning Dynamics for General Convex Games
    Farina, Gabriele
    Anagnostides, Ioannis
    Luo, Haipeng
    Lee, Chung-Wei
    Kroer, Christian
    Sandholm, Tuomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [26] Doubly Optimal No-Regret Online Learning in Strongly Monotone Games with Bandit Feedback
    Ba, Wenjia
    Lin, Tianyi
    Zhang, Jiawei
    Zhou, Zhengyuan
    OPERATIONS RESEARCH, 2025,
  • [27] No-Regret Distributed Learning in Two-Network Zero-Sum Games
    Huang, Shijie
    Lei, Jinlong
    Hong, Yiguang
    Shanbhag, Uday, V
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 924 - 929
  • [28] No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation
    Hsieh, Yu-Guan
    Antonakopoulos, Kimon
    Cevher, Volkan
    Mertikopoulos, Panayotis
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [29] No-Regret Learning Supports Voters' Competence
    Spelda, Petr
    Stritecky, Vit
    Symons, John
    SOCIAL EPISTEMOLOGY, 2024, 38 (05) : 543 - 559
  • [30] On the convergence of no-regret learning in selfish routing
    Krichene, Walid
    Drighes, Benjamin
    Bayen, Alexandre
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 163 - 171