Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games

被引：0

作者：

Yan, Rui ^{[1
]}

Santos, Gabriel ^{[1
]}

Norman, Gethin ^{[1
,2
]}

Parker, David ^{[1
]}

Kwiatkowska, Marta ^{[1
]}

机构：

[1] Univ Oxford, Dept Comp Sci, Oxford OX1 2JD, England

[2] Univ Glasgow, Sch Comp Sci, Glasgow G12 8QQ, Scotland

来源：

INFORMATION AND COMPUTATION | 2024年 / 300卷

基金：

欧盟地平线“2020”;

关键词：

Stochastic games; Neuro-symbolic systems; Value iteration; Policy iteration; Borel state spaces; POLICY ITERATION; MARKOV GAMES;

D O I：

10.1016/j.ic.2024.105193

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Neuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neurosymbolic concurrent stochastic games (NS-CSGs), which comprise two probabilistic finitestate agents interacting in a shared continuous-state environment. Each agent observes the environment using a neural perception mechanism, which converts inputs such as images into symbolic percepts, and makes decisions symbolically. We focus on the class of NS-CSGs with Borel state spaces and prove the existence and measurability of the value function for zero-sum discounted cumulative rewards under piecewise-constant restrictions. To compute values and synthesise strategies, we first introduce a Borel measurable piecewiseconstant (B-PWC) representation of value functions and propose a B-PWC value iteration. Second, we introduce two novel representations for the value functions and strategies, and propose a minimax-action-free policy iteration based on alternating player choices. (c) 2024 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).

引用

页数：28

共 50 条

[31] New Algorithms for Solving Zero-Sum Stochastic Games
Oliu-Barton, Miquel
MATHEMATICS OF OPERATIONS RESEARCH, 2021, 46 (01) : 255 - 267
[32] TWO-PERSON ZERO-SUM STOCHASTIC GAMES
Baykal-Guersoy, Melike
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) : 135 - 152
[33] Limit Optimal Trajectories in Zero-Sum Stochastic Games
Sorin, Sylvain
Vigeral, Guillaume
DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (02) : 555 - 572
[34] LP Formulation of Asymmetric Zero-Sum Stochastic Games
Li, Lichun
Shamma, Jeff
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1930 - 1935
[35] APPROXIMATION THEOREMS FOR ZERO-SUM NONSTATIONARY STOCHASTIC GAMES
NOWAK, AS
PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1984, 92 (03) : 418 - 424
[36] Almost stationary ε-equilibria in zero-sum stochastic games
Flesch, J
Thuijsman, F
Vrieze, OJ
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2000, 105 (02) : 371 - 389
[37] Stochastic zero-sum differential games and backward stochastic differential equations
Oufdil, Khalid
RANDOM OPERATORS AND STOCHASTIC EQUATIONS, 2023, 31 (01) : 65 - 86
[38] Zero-sum constrained stochastic games with independent state processes
Altman, E
Avrachenkov, K
Marquez, R
Miller, G
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 62 (03) : 375 - 386
[39] AN ACCRETIVE OPERATOR APPROACH TO ERGODIC ZERO-SUM STOCHASTIC GAMES
Hochart, Antoine
JOURNAL OF DYNAMICS AND GAMES, 2019, 6 (01): : 27 - 51
[40] Zero-Sum Risk-Sensitive Stochastic Differential Games
Basu, Arnab
Ghosh, Mrinal K.
MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (03) : 437 - 449

← 1 2 3 4 5 →