Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

被引：0

作者：

Horak, Karel ^{[1
]}

Bosansky, Branislav ^{[1
]}

Pechoucek, Michal ^{[1
]}

机构：

[1] Czech Tech Univ, Fac Elect Engn, Dept Comp Sci, Prague, Czech Republic

来源：

THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2017年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Security problems can be modeled as two-player partially observable stochastic games with one-sided partial observability and infinite horizon (one-sided POSGs). We seek for optimal strategies of player 1 that correspond to robust strategies against the worst-case opponent (player 2) that is assumed to have a perfect information about the game. We present a novel algorithm for approximately solving one-sided POSGs based on the heuristic search value iteration (HSVI) for POMDPs. Our results include (1) theoretical properties of one-sided POSGs and their value functions, (2) guarantees showing the convergence of our algorithm to optimal strategies, and (3) practical demonstration of applicability and scalability of our algorithm on three different domains: pursuit-evasion, patrolling, and search games.

引用

页码：558 / 564

页数：7

共 50 条

[1] Solving zero-sum one-sided partially observable stochastic games
Horak, Karel
Bosansky, Branislav
Kovarik, Vojtech
Kiekintveld, Christopher
[J]. ARTIFICIAL INTELLIGENCE, 2023, 316
[2] Iterative algorithms for solving one-sided partially observable stochastic shortest path games
Tomášek, Petr
Horák, Karel
Bošanský, Branislav
[J]. International Journal of Approximate Reasoning, 2024, 175
[3] The Stackelberg equilibrium for one-sided zero-sum partially observable stochastic games
Zheng, Wei
Jung, Taeho
Lin, Hai
[J]. AUTOMATICA, 2022, 140
[4] Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games
Horak, Karel
Bosansky, Branislav
[J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 503 - 510
[5] Heuristic Search Value Iteration for Zero-Sum Stochastic Games
Buffet, Olivier
Dibangoye, Jilles
Saffidine, Abdallah
Thomas, Vincent
[J]. IEEE TRANSACTIONS ON GAMES, 2021, 13 (03) : 239 - 248
[6] ON ONE-SIDED STOCHASTIC GAMES AND THEIR APPLICATIONS TO FINANCE
Dshalalow, Jewgeni H.
Robinson, Randy
[J]. STOCHASTIC MODELS, 2012, 28 (01) : 1 - 14
[7] A Point-Based Approximate Algorithm for One-Sided Partially Observable Pursuit-Evasion Games
Horak, Karel
Bosansky, Branislav
[J]. DECISION AND GAME THEORY FOR SECURITY, (GAMESEC 2016), 2016, 9996 : 435 - 454
[8] Compact Representation of Value Function in Partially Observable Stochastic Games
Horak, Karel
Bosansky, Branislav
Kiekintveld, Christopher
Kamhoua, Charles
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 350 - 356
[9] Dynamic programming for partially observable stochastic games
Hansen, EA
Bernstein, DS
Zilberstein, S
[J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 709 - 715
[10] One-Sided Games in a War of Attrition
Asako, Yasushi
[J]. B E JOURNAL OF THEORETICAL ECONOMICS, 2015, 15 (02): : 313 - 331

← 1 2 3 4 5 →