Contextual Combinatorial Bandits in Real-Time Strategy Games

被引:0
|
作者
Yang, Zuozhi [1 ]
Ontanon, Santiago [1 ,2 ]
机构
[1] Drexel Univ, Philadelphia, PA 19104 USA
[2] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1109/COG52621.2021.9619063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The contextual bandit problem is a richer framework than stochastic bandits that has many applications since it allows the learner has access to additional information (the "context"). This additional information can help predict the expected utility of the different arms in many cases. Moreover, combinatorial bandits are a class of bandit problem where the space of possible arms to choose from has a combinatorial structure. In this paper, we investigate the bandit problem where we have both contextual information and there is a combinatorial arm structure, which we call contextual combinatorial bandits (CCMABs). We apply contextual combinatorial bandits to real-time strategy (RTS) games, and study different algorithms to solve CCMABs with different trade-offs of computational efficiency and learning biases. Specifically, we focus on the problem of determining map-specific game playing policies, and formulate it as a CCMABs.
引用
收藏
页码:735 / 743
页数:9
相关论文
共 50 条
  • [1] Combinatorial Multi-armed Bandits for Real-Time Strategy Games
    Ontanon, Santiago
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 58 : 665 - 702
  • [2] Combinatorial multi-armed bandits for real-time strategy games
    Ontañón, Santiago (santi@cs.drexel.edu), 1600, AI Access Foundation (58):
  • [3] Learning in Real-time Strategy Games
    Padmanabhan, Vineet
    Goud, Pranay
    Pujari, Arun K.
    Sethy, Harshit
    2015 14TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT 2015), 2015, : 165 - 170
  • [4] Contextual Combinatorial Cascading Bandits
    Li, Shuai
    Wang, Baoxiang
    Zhang, Shengyu
    Chen, Wei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [5] Interfacing Agents to Real-Time Strategy Games
    Jensen, Andreas Schmidt
    Kayso-Rordam, Christian
    Villadsen, Jorgen
    THIRTEENTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (SCAI 2015), 2015, 278 : 68 - 77
  • [6] Programmatic Strategies for Real-Time Strategy Games
    Marino, Julian R. H.
    Moraes, Rubens O.
    Oliveira, Tassiana C.
    Toledo, Claudio
    Lelis, Levi H. S.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 381 - 389
  • [7] Opponent modeling in real-time strategy games
    Schadd, Frederik
    Bakkes, Sander
    Sprouck, Pieter
    GAME-ON 2007: 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION, 2007, : 61 - 68
  • [8] Dynamic Formations in Real-Time Strategy Games
    van der Heijden, Marcel
    Bakkes, Sander
    Spronck, Pieter
    2008 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2008, : 47 - 54
  • [9] Guest Editorial Real-Time Strategy Games
    Buro, Michael
    Ontanon, Santiago
    Preuss, Mike
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2016, 8 (04) : 317 - 318
  • [10] An abstract model for real-time strategy games
    Keaveney, David
    O'Riordan, Colm
    PROCEEDINGS OF CGAMES'2007: 11TH INTERNATIONAL CONFERENCE ON COMPUTER GAMES: AI, ANIMATION, MOBILE, EDUCATIONAL AND SERIOUS GAMES, 2007, 2007, : 61 - 65