CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space

被引:0
|
作者
Li, Shuxin [1 ]
Zhang, Youzhi [2 ]
Wang, Xinrun [1 ]
Xue, Wanqi [1 ]
An, Bo [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Dartmouth Coll, Dept Comp Sci, Hanover, NH USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-world scenarios, a team of agents must coordinate with each other to compete against an opponent. The challenge of solving this type of game is that the team's joint action space grows exponentially with the number of agents, which results in the inefficiency of the existing algorithms, e.g., Counterfactual Regret Minimization (CFR). To address this problem, we propose a new framework of CFR: CFR-MIX. Firstly, we propose a new strategy representation that represents a joint action strategy using individual strategies of all agents and a consistency relationship to maintain the cooperation between agents. To compute the equilibrium with individual strategies under the CFR framework, we transform the consistency relationship between strategies to the consistency relationship between the cumulative regret values. Furthermore, we propose a novel decomposition method over cumulative regret values to guarantee the consistency relationship between the cumulative regret values. Finally, we introduce our new algorithm CFR-MIX which employs a mixing layer to estimate cumulative regret values of joint actions as a non-linear combination of cumulative regret values of individual actions. Experimental results show that CFR-MIX outperforms existing algorithms on various games significantly.
引用
收藏
页码:3663 / 3669
页数:7
相关论文
共 15 条
  • [1] An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games
    Meng, Linjian
    Ge, Zhenxing
    Tian, Pinzhuo
    An, Bo
    Gao, Yang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 5823 - 5831
  • [2] Near-Optimal Learning of Extensive-Form Games with Imperfect Information
    Bai, Yu
    Jin, Chi
    Mei, Song
    Yu, Tiancheng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] An Algorithm for Constructing and Solving Imperfect Recall Abstractions of Large Extensive-Form Games
    Cermak, Jiri
    Bosansky, Branislav
    Lisy, Viliam
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 936 - 942
  • [4] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
    Bosansky, Branislav
    Kiekintveld, Christopher
    Lisy, Viliam
    Pechoucek, Michal
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +
  • [5] Solving Large Extensive-Form Games with Strategy Constraints
    Davis, Trevor
    Waugh, Kevin
    Bowling, Michael
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1861 - 1868
  • [6] Discretization of Continuous Action Spaces in Extensive-Form Games
    Kroer, Christian
    Sandholm, Tuomas
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 47 - 56
  • [7] An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information
    Bosansky, Branislav
    Kiekintveld, Christopher
    Lisy, Viliam
    Pechoucek, Michal
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 51 : 829 - 866
  • [8] Block-Coordinate Methods and Restarting for Solving Extensive-Form Games
    Chakrabarti, Darshan
    Diakonikolas, Jelena
    Kroer, Christian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Computing Maxmin Strategies in Extensive-form Zero-sum Games with Imperfect Recall
    Bosansky, Branislav
    Cermak, Jiri
    Horak, Karel
    Pechoucek, Michal
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 63 - 74
  • [10] Evolving Action Abstractions for Real-Time Planning in Extensive-Form Games
    Marino, Julian R. H.
    Moraes, Rubens O.
    Toledo, Claudio
    Lelis, Levi H. S.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2330 - 2337