Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games

被引:1
|
作者
Horak, Karel [1 ]
Bosansky, Branislav [1 ]
机构
[1] Czech Tech Univ, Dept Comp Sci, Fac Elect Engn, Prague, Czech Republic
关键词
Pursuit-evasion Games; One-sided Partial Observability; Infinite Horizon; Value Iteration; Concurrent Moves;
D O I
10.5220/0006190605030510
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pursuit-evasion scenarios appear widely in robotics, security domains, and many other real-world situations. We focus on two-player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. We assume that the players have partial observability, however, the evader has an advantage of knowing the current position of pursuer's units. This setting is particularly interesting for security domains where a robust strategy, maximizing the utility in the worst-case scenario, is often desirable. We provide, to the best of our knowledge, the first algorithm that provably converges to the value of a partially observable pursuit-evasion game with infinite horizon. Our algorithm extends well-known value iteration algorithm by exploiting that (1) value functions of our game depend only on the position of the pursuer and the belief he has about the position of the evader, and (2) that these functions are piecewise linear and convex in the belief space.
引用
收藏
页码:503 / 510
页数:8
相关论文
共 50 条
  • [1] A Point-Based Approximate Algorithm for One-Sided Partially Observable Pursuit-Evasion Games
    Horak, Karel
    Bosansky, Branislav
    [J]. DECISION AND GAME THEORY FOR SECURITY, (GAMESEC 2016), 2016, 9996 : 435 - 454
  • [2] A GENERALIZED VALUE OF DYNAMIC PURSUIT-EVASION GAMES
    TOMSKI, GV
    [J]. VESTNIK LENINGRADSKOGO UNIVERSITETA SERIYA MATEMATIKA MEKHANIKA ASTRONOMIYA, 1981, (04): : 40 - 44
  • [3] On applied nonlinear and bilevel programming for pursuit-evasion games
    Ehtamo, H
    Raivio, T
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2001, 108 (01) : 65 - 96
  • [4] On Applied Nonlinear and Bilevel Programming or Pursuit-Evasion Games
    H. Ehtamo
    T. Raivio
    [J]. Journal of Optimization Theory and Applications, 2001, 108 : 65 - 96
  • [5] Heuristic Search Value Iteration for One-Sided Partially Observable Stochastic Games
    Horak, Karel
    Bosansky, Branislav
    Pechoucek, Michal
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 558 - 564
  • [6] Solving zero-sum one-sided partially observable stochastic games
    Horak, Karel
    Bosansky, Branislav
    Kovarik, Vojtech
    Kiekintveld, Christopher
    [J]. ARTIFICIAL INTELLIGENCE, 2023, 316
  • [7] PARTIALLY OBSERVABLE LINEAR QUADRATIC STOCHASTIC PURSUIT EVASION GAMES
    CHAN, WL
    NG, SK
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1987, 13 (1-3) : 181 - 189
  • [8] PURSUIT-EVASION GAMES ON GRAPHS
    CHUNG, FRK
    COHEN, JE
    GRAHAM, RL
    [J]. JOURNAL OF GRAPH THEORY, 1988, 12 (02) : 159 - 167
  • [9] ON A CLASS OF PURSUIT-EVASION GAMES
    CHYUNG, DH
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1970, AC15 (04) : 458 - &
  • [10] Some Practical Approaches to Pursuit-Evasion Dynamic Games
    F. Imado
    [J]. Cybernetics and Systems Analysis, 2002, 38 (2) : 276 - 291