CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space

被引：0

作者：

Li, Shuxin ^{[1
]}

Zhang, Youzhi ^{[2
]}

Wang, Xinrun ^{[1
]}

Xue, Wanqi ^{[1
]}

An, Bo ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Dartmouth Coll, Dept Comp Sci, Hanover, NH USA

来源：

PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In many real-world scenarios, a team of agents must coordinate with each other to compete against an opponent. The challenge of solving this type of game is that the team's joint action space grows exponentially with the number of agents, which results in the inefficiency of the existing algorithms, e.g., Counterfactual Regret Minimization (CFR). To address this problem, we propose a new framework of CFR: CFR-MIX. Firstly, we propose a new strategy representation that represents a joint action strategy using individual strategies of all agents and a consistency relationship to maintain the cooperation between agents. To compute the equilibrium with individual strategies under the CFR framework, we transform the consistency relationship between strategies to the consistency relationship between the cumulative regret values. Furthermore, we propose a novel decomposition method over cumulative regret values to guarantee the consistency relationship between the cumulative regret values. Finally, we introduce our new algorithm CFR-MIX which employs a mixing layer to estimate cumulative regret values of joint actions as a non-linear combination of cumulative regret values of individual actions. Experimental results show that CFR-MIX outperforms existing algorithms on various games significantly.

引用

页码：3663 / 3669

页数：7

共 15 条

[1] An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games
Meng, Linjian
Ge, Zhenxing
Tian, Pinzhuo
An, Bo
Gao, Yang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 5823 - 5831
[2] Near-Optimal Learning of Extensive-Form Games with Imperfect Information
Bai, Yu
Jin, Chi
Mei, Song
Yu, Tiancheng
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[3] An Algorithm for Constructing and Solving Imperfect Recall Abstractions of Large Extensive-Form Games
Cermak, Jiri
Bosansky, Branislav
Lisy, Viliam
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 936 - 942
[4] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
Bosansky, Branislav
Kiekintveld, Christopher
Lisy, Viliam
Pechoucek, Michal
20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +
[5] Solving Large Extensive-Form Games with Strategy Constraints
Davis, Trevor
Waugh, Kevin
Bowling, Michael
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1861 - 1868
[6] Discretization of Continuous Action Spaces in Extensive-Form Games
Kroer, Christian
Sandholm, Tuomas
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 47 - 56
[7] An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information
Bosansky, Branislav
Kiekintveld, Christopher
Lisy, Viliam
Pechoucek, Michal
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 51 : 829 - 866
[8] Block-Coordinate Methods and Restarting for Solving Extensive-Form Games
Chakrabarti, Darshan
Diakonikolas, Jelena
Kroer, Christian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[9] Computing Maxmin Strategies in Extensive-form Zero-sum Games with Imperfect Recall
Bosansky, Branislav
Cermak, Jiri
Horak, Karel
Pechoucek, Michal
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 63 - 74
[10] Evolving Action Abstractions for Real-Time Planning in Extensive-Form Games
Marino, Julian R. H.
Moraes, Rubens O.
Toledo, Claudio
Lelis, Levi H. S.
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2330 - 2337

← 1 2 →