On the adaptive control of a class of partially observed Markov decision processes

被引:0
|
作者
Hsu, Shun-Pin [1 ]
Arapostathis, Ari [2 ]
机构
[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung 402, Taiwan
[2] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
Adaptive control; Markov decision processes; Average-cost optimality; AVERAGE COST CRITERION; CHAINS;
D O I
10.1016/j.jmaa.2011.03.004
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function. Provided that a sequence of parameter estimates converging in probability to the true parameter value is available, we show that the certainty equivalence adaptive policy is optimal in the long-run average sense. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] On the Adaptive Control of a Class of Partially Observed Markov Decision Processes
    Hsu, Shun-Pin
    Chuang, Dong-Ming
    Arapostathis, Ari
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 5635 - +
  • [2] On the complexity of partially observed Markov decision processes
    Burago, D
    deRougemont, M
    Slissenko, A
    THEORETICAL COMPUTER SCIENCE, 1996, 157 (02) : 161 - 183
  • [3] Bayesian adaptive control of discrete time partially observed Markov processes
    Stettner, L
    STOCHASTIC THEORY AND CONTROL, PROCEEDINGS, 2002, 280 : 435 - 446
  • [4] Experimental Design for Partially Observed Markov Decision Processes
    Thorbergsson, Leifur
    Hooker, Giles
    SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2018, 6 (02): : 549 - 567
  • [5] Partially observed Markov decision processes with binomial observations
    Ben-Zvi, Tal
    Grosfeld-Nir, Abraham
    OPERATIONS RESEARCH LETTERS, 2013, 41 (02) : 201 - 206
  • [6] PARTIALLY OBSERVED CONTROL OF MARKOV-PROCESSES
    HIJAB, O
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 184 : 248 - 255
  • [7] Constrained Partially Observed Markov Decision Processes With Probabilistic Criteria for Adaptive Sequential Detection
    Chen, Richard C.
    Wagner, Kevin
    Blankenship, Gilmer L.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (02) : 487 - 493
  • [8] PARTIALLY OBSERVED CONTROL OF MARKOV-PROCESSES
    HIJAB, O
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 177 : 151 - 158
  • [9] Information Relaxation Bounds for Partially Observed Markov Decision Processes
    Haugh, Martin B.
    Lacedelli, Octavio Ruiz
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (08) : 3256 - 3271
  • [10] Entropy-Regularized Partially Observed Markov Decision Processes
    Molloy, Timothy L.
    Nair, Girish N.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 379 - 386