Optimal strategies for adaptive zero-sum average Markov games

被引:5
|
作者
Adolfo Minjarez-Sosa, J. [1 ]
Vega-Amaya, Oscar [1 ]
机构
[1] Univ Sonora, Dept Matemat, Hermosillo 83000, Sonora, Mexico
关键词
Zero-sum Markov games; Average payoff; Adaptive strategies; STOCHASTIC GAMES; EQUATION;
D O I
10.1016/j.jmaa.2012.12.011
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider a class of discrete-time two person zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs. The game evolves according to the recursive equation x(n+1) = F (x(n), a(n), b(n), xi(n)), n = 0, 1, . . . , where the disturbance process {xi(n)} is formed by independent and identically distributed R-k-valued random vectors, which are observable but their common density rho* is unknown for both players. Combining suitable methods of statistical estimation of rho* with optimization procedures, we construct a pair of average optimal strategies. (C) 2013 Elsevier Inc. AB rights reserved.
引用
收藏
页码:44 / 56
页数:13
相关论文
共 50 条