A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving

被引:0
|
作者
Carminati, Luca [1 ]
Cacciamani, Federico [1 ]
Ciccone, Marco [2 ]
Gatti, Nicola [1 ]
机构
[1] Politecn Milan, Milan, Italy
[2] Politecn Torino, Turin, Italy
关键词
LEVEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ex ante correlation is becoming the mainstream approach for sequential adversarial team games, where a team of players faces another team in a zero-sum game. It is known that team members' asymmetric information makes both equilibrium computation APX-hard and team's strategies not directly representable on the game tree. This latter issue prevents the adoption of successful tools for huge 2-player zero-sum games such as, e.g., abstractions, no-regret learning, and subgame solving. This work shows that we can recover from this weakness by bridging the gap between sequential adversarial team games and 2player games. In particular, we propose a new, suitable game representation that we call teampublic-information, in which a team is represented as a single coordinator who only knows information common to the whole team and prescribes to each member an action for any possible private state. The resulting representation is highly explainable, being a 2-player tree in which the team's strategies are behavioral with a direct interpretation and more expressive than the original extensive form when designing abstractions. Furthermore, we prove payoff equivalence of our representation, and we provide techniques that, starting directly from the extensive form, generate dramatically more compact representations without information loss. Finally, we experimentally evaluate our techniques when applied to a standard testbed, comparing their performance with the current state of the art.
引用
收藏
页数:20
相关论文
共 6 条
  • [1] Subgame Solving in Adversarial Team Games
    Zhang, Brian Hu
    Carminati, Luca
    Cacciamani, Federico
    Farina, Gabriele
    Olivieri, Pierriccardo
    Gatti, Nicola
    Sandholm, Tuomas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] THE ROLE OF AUTOSHAPING IN COOPERATIVE 2-PLAYER GAMES BETWEEN STARLINGS
    REBOREDA, JC
    KACELNIK, A
    [J]. JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1993, 60 (01) : 67 - 83
  • [3] Tight last-iterate convergence rates for no-regret learning in multi-player games
    Golowich, Noah
    Pattathil, Sarath
    Daskalakis, Constantinos
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-player General-Sum Games
    Anagnostides, Ioannis
    Daskalakis, Constantinos
    Farina, Gabriele
    Fishelson, Maxwell
    Golowich, Noah
    Sandholm, Tuomas
    [J]. PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 736 - 749
  • [5] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
    Dai, Zhongxiang
    Chen, Yizhou
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    Ho, Teck-Hua
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [6] R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
    Dai, Zhongxiang
    Chen, Yizhou
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    Ho, Teck-Hua
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119