Adaptive policies for time-varying stochastic systems under discounted criterion

被引:9
|
作者
Hilgert, N
Minjárez-Sosa, JA
机构
[1] ENSAM, INRA, Lab Biometrie, F-34060 Montpellier 1, France
[2] Univ Sonora, Dept Matemat, Hermosillo 83000, Sonora, Mexico
关键词
non-homogeneous Markov control processes; discrete-time stochastic systems; discounted cost criterion; optimal adaptive policy;
D O I
10.1007/s001860100170
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider a class of time-varying stochastic control systems, with Borel state and action spaces, and possibly unbounded costs. The processes evolve according to a discrete-time equation x(n+1) = G(n)(x(n), a(n), xi(n)), n = 0, 1,..., where the xi(n) are i.i.d. R-k-valued random vectors whose common density is unknown, and the G, are given functions converging, in a restricted way, to some function Ginfinity as n --> infinity. Assuming observability of xi(n), we construct an adaptive policy which is asymptotically discounted cost optimal for the limiting control system x(n+1) = Ginfinity(x(n), a(n), xi(n)).
引用
收藏
页码:491 / 505
页数:15
相关论文
共 50 条
  • [31] Multicriteria adaptive paths in stochastic, time-varying networks
    Opasanon, Sathaporn
    Miller-Hooks, Elise
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 173 (01) : 72 - 91
  • [32] IDENTIFICATION OF STOCHASTIC TIME-VARYING SYSTEMS.
    Moustafa, K.A.F.
    1600, (13 O):
  • [33] On Stabilization of Ito Stochastic Time-Varying Systems
    Gao Rong
    Zhang Huanshui
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2017, 30 (04) : 818 - 827
  • [34] A Popov criterion for systems with slowly time-varying parameters
    Jönsson, U
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (04) : 844 - 846
  • [35] A STABILITY-CRITERION FOR LINEAR TIME-VARYING SYSTEMS
    MORI, T
    FUKUMA, N
    KUWAHARA, M
    INTERNATIONAL JOURNAL OF CONTROL, 1981, 34 (03) : 585 - 591
  • [36] Stability criterion and stabilization of linear time-varying systems
    Tan, Feng
    Duan, Guangren
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 3238 - 3243
  • [37] A Popov criterion for systems with slowly time-varying parameters
    Jonsson, U
    PROCEEDINGS OF THE 1997 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1997, : 2504 - 2505
  • [38] APPROXIMATION, ESTIMATION AND CONTROL OF STOCHASTIC SYSTEMS UNDER A RANDOMIZED DISCOUNTED COST CRITERION
    Gonzalez-Hernandez, Juan
    Lopez-Martinez, Raquiel R.
    Adolfo Minjarez-Sosa, J.
    KYBERNETIKA, 2009, 45 (05) : 737 - 754
  • [39] Infinite horizon production scheduling in time-varying systems under stochastic demand
    Cheevaprawatdomrong, T
    Smith, RL
    OPERATIONS RESEARCH, 2004, 52 (01) : 105 - 115
  • [40] A simple algebraic criterion for stability of Bilateral Teleoperation Systems under time-varying delays
    de Lima, Matheus, V
    Mozelli, Leonardo A.
    Alves Neto, Armando
    Souza, Fernando O.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 137