Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games

被引：0

作者：

Yan, Rui ^{[1
]}

Santos, Gabriel ^{[1
]}

Norman, Gethin ^{[1
,2
]}

Parker, David ^{[1
]}

Kwiatkowska, Marta ^{[1
]}

机构：

[1] Univ Oxford, Dept Comp Sci, Oxford OX1 2JD, England

[2] Univ Glasgow, Sch Comp Sci, Glasgow G12 8QQ, Scotland

来源：

INFORMATION AND COMPUTATION | 2024年 / 300卷

基金：

欧盟地平线“2020”;

关键词：

Stochastic games; Neuro-symbolic systems; Value iteration; Policy iteration; Borel state spaces; POLICY ITERATION; MARKOV GAMES;

D O I：

10.1016/j.ic.2024.105193

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Neuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neurosymbolic concurrent stochastic games (NS-CSGs), which comprise two probabilistic finitestate agents interacting in a shared continuous-state environment. Each agent observes the environment using a neural perception mechanism, which converts inputs such as images into symbolic percepts, and makes decisions symbolically. We focus on the class of NS-CSGs with Borel state spaces and prove the existence and measurability of the value function for zero-sum discounted cumulative rewards under piecewise-constant restrictions. To compute values and synthesise strategies, we first introduce a Borel measurable piecewiseconstant (B-PWC) representation of value functions and propose a B-PWC value iteration. Second, we introduce two novel representations for the value functions and strategies, and propose a minimax-action-free policy iteration based on alternating player choices. (c) 2024 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).

引用

页数：28

共 50 条

[1] Finite-horizon Equilibria for Neuro-symbolic Concurrent Stochastic Games
Yan, Rui
Santos, Gabriel
Duan, Xiaoming
Parker, David
Kwiatkowska, Marta
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2170 - 2180
[2] Optimality in different strategy classes in zero-sum stochastic games
J. Flesch
F. Thuijsman
O. J. Vrieze
Mathematical Methods of Operations Research, 2002, 56 : 315 - 322
[3] Optimality in different strategy classes in zero-sum stochastic games
Flesch, J
Thuijsman, F
Vrieze, OJ
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2002, 56 (02) : 315 - 322
[4] Zero-sum ergodic stochastic games
Jaskiewicz, Anna
Nowak, Andrzej S.
2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 1741 - 1746
[5] Definable Zero-Sum Stochastic Games
Bolte, Jerome
Gaubert, Stephane
Vigeral, Guillaume
MATHEMATICS OF OPERATIONS RESEARCH, 2015, 40 (01) : 171 - 191
[6] Zero-Sum Stochastic Stackelberg Games
Goktas, Denizalp
Zhao, Jiayi
Greenwald, Amy
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[7] Strategy investments in zero-sum games
Garcia, Raul
Hosseinian, Seyedmohammadhossein
Pai, Mallesh
Schaefer, Andrew J.
OPTIMIZATION LETTERS, 2024, 18 (08) : 1771 - 1789
[8] Zero-sum Stochastic Games with Asymmetric Information
Kartik, Dhruva
Nayyar, Ashutosh
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4061 - 4066
[9] Zero-sum stochastic games with partial information
Ghosh, MK
McDonald, D
Sinha, S
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 121 (01) : 99 - 118
[10] Zero-Sum Stochastic Games with Partial Information
M. K. Ghosh
D. McDonald
S. Sinha
Journal of Optimization Theory and Applications, 2004, 121 : 99 - 118

← 1 2 3 4 5 →