Contextual Combinatorial Bandits in Real-Time Strategy Games

被引：0

作者：

Yang, Zuozhi ^{[1
]}

Ontanon, Santiago ^{[1
,2
]}

机构：

[1] Drexel Univ, Philadelphia, PA 19104 USA

[2] Google, Mountain View, CA 94043 USA

来源：

2021 IEEE CONFERENCE ON GAMES (COG) | 2021年

关键词：

D O I：

10.1109/COG52621.2021.9619063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The contextual bandit problem is a richer framework than stochastic bandits that has many applications since it allows the learner has access to additional information (the "context"). This additional information can help predict the expected utility of the different arms in many cases. Moreover, combinatorial bandits are a class of bandit problem where the space of possible arms to choose from has a combinatorial structure. In this paper, we investigate the bandit problem where we have both contextual information and there is a combinatorial arm structure, which we call contextual combinatorial bandits (CCMABs). We apply contextual combinatorial bandits to real-time strategy (RTS) games, and study different algorithms to solve CCMABs with different trade-offs of computational efficiency and learning biases. Specifically, we focus on the problem of determining map-specific game playing policies, and formulate it as a CCMABs.

引用

页码：735 / 743

页数：9

共 50 条

[1] Combinatorial Multi-armed Bandits for Real-Time Strategy Games
Ontanon, Santiago
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 58 : 665 - 702
[2] Combinatorial multi-armed bandits for real-time strategy games
Ontañón, Santiago (santi@cs.drexel.edu), 1600, AI Access Foundation (58):
[3] Learning in Real-time Strategy Games
Padmanabhan, Vineet
Goud, Pranay
Pujari, Arun K.
Sethy, Harshit
2015 14TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT 2015), 2015, : 165 - 170
[4] Contextual Combinatorial Cascading Bandits
Li, Shuai
Wang, Baoxiang
Zhang, Shengyu
Chen, Wei
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[5] Interfacing Agents to Real-Time Strategy Games
Jensen, Andreas Schmidt
Kayso-Rordam, Christian
Villadsen, Jorgen
THIRTEENTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (SCAI 2015), 2015, 278 : 68 - 77
[6] Programmatic Strategies for Real-Time Strategy Games
Marino, Julian R. H.
Moraes, Rubens O.
Oliveira, Tassiana C.
Toledo, Claudio
Lelis, Levi H. S.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 381 - 389
[7] Opponent modeling in real-time strategy games
Schadd, Frederik
Bakkes, Sander
Sprouck, Pieter
GAME-ON 2007: 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION, 2007, : 61 - 68
[8] Dynamic Formations in Real-Time Strategy Games
van der Heijden, Marcel
Bakkes, Sander
Spronck, Pieter
2008 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2008, : 47 - 54
[9] Guest Editorial Real-Time Strategy Games
Buro, Michael
Ontanon, Santiago
Preuss, Mike
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2016, 8 (04) : 317 - 318
[10] An abstract model for real-time strategy games
Keaveney, David
O'Riordan, Colm
PROCEEDINGS OF CGAMES'2007: 11TH INTERNATIONAL CONFERENCE ON COMPUTER GAMES: AI, ANIMATION, MOBILE, EDUCATIONAL AND SERIOUS GAMES, 2007, 2007, : 61 - 65

← 1 2 3 4 5 →