Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games

被引:0
|
作者
Horak, Karel [1 ]
Bosansky, Branislav [1 ]
Pechoucek, Michal [1 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Dept Comp Sci, Prague, Czech Republic
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Security problems can be modeled as two-player partially observable stochastic games with one-sided partial observability and infinite horizon (one-sided POSGs). We seek for optimal strategies of player 1 that correspond to robust strategies against the worst-case opponent (player 2) that is assumed to have a perfect information about the game. We present a novel algorithm for approximately solving one-sided POSGs based on the heuristic search value iteration (HSVI) for POMDPs. Our results include (1) theoretical properties of one-sided POSGs and their value functions, (2) guarantees showing the convergence of our algorithm to optimal strategies, and (3) practical demonstration of applicability and scalability of our algorithm on three different domains: pursuit-evasion, patrolling, and search games.
引用
收藏
页码:558 / 564
页数:7
相关论文
共 50 条
  • [1] Solving zero-sum one-sided partially observable stochastic games
    Horak, Karel
    Bosansky, Branislav
    Kovarik, Vojtech
    Kiekintveld, Christopher
    [J]. ARTIFICIAL INTELLIGENCE, 2023, 316
  • [2] Iterative algorithms for solving one-sided partially observable stochastic shortest path games
    Tomášek, Petr
    Horák, Karel
    Bošanský, Branislav
    [J]. International Journal of Approximate Reasoning, 2024, 175
  • [3] The Stackelberg equilibrium for one-sided zero-sum partially observable stochastic games
    Zheng, Wei
    Jung, Taeho
    Lin, Hai
    [J]. AUTOMATICA, 2022, 140
  • [4] Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games
    Horak, Karel
    Bosansky, Branislav
    [J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 503 - 510
  • [5] Heuristic Search Value Iteration for Zero-Sum Stochastic Games
    Buffet, Olivier
    Dibangoye, Jilles
    Saffidine, Abdallah
    Thomas, Vincent
    [J]. IEEE TRANSACTIONS ON GAMES, 2021, 13 (03) : 239 - 248
  • [6] ON ONE-SIDED STOCHASTIC GAMES AND THEIR APPLICATIONS TO FINANCE
    Dshalalow, Jewgeni H.
    Robinson, Randy
    [J]. STOCHASTIC MODELS, 2012, 28 (01) : 1 - 14
  • [7] A Point-Based Approximate Algorithm for One-Sided Partially Observable Pursuit-Evasion Games
    Horak, Karel
    Bosansky, Branislav
    [J]. DECISION AND GAME THEORY FOR SECURITY, (GAMESEC 2016), 2016, 9996 : 435 - 454
  • [8] Compact Representation of Value Function in Partially Observable Stochastic Games
    Horak, Karel
    Bosansky, Branislav
    Kiekintveld, Christopher
    Kamhoua, Charles
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 350 - 356
  • [9] Dynamic programming for partially observable stochastic games
    Hansen, EA
    Bernstein, DS
    Zilberstein, S
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 709 - 715
  • [10] One-Sided Games in a War of Attrition
    Asako, Yasushi
    [J]. B E JOURNAL OF THEORETICAL ECONOMICS, 2015, 15 (02): : 313 - 331