An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

被引：5

作者：

Vargo, Erik ^{[1
]}

Cogill, Randy ^{[1
]}

机构：

[1] Univ Virginia, Dept Syst & Informat Engn, Charlottesville, VA 22903 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2014年 / 59卷 / 10期

基金：

美国国家科学基金会;

关键词：

Adaptive control; Markov processes; stochastic optimal control; uncertain systems;

D O I：

10.1109/TAC.2014.2314527

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This technical note concerns the control of partially observable Markov decision processes characterized by a prior distribution over the underlying hidden Markov model parameters. In such instances, the control problem is commonly simplified by first choosing a point estimate from the model prior, and then selecting the control policy that is optimal with respect to the point estimate. Our contribution is to demonstrate, through a tractable yet nontrivial example, that even the best control policies constructed in this manner can significantly underperform the Bayes optimal policy. While this is an operative assumption in the Bayes-adaptive Markov decision process literature, to our knowledge no such illustrative example has been formally proposed.

引用

页码：2796 / 2800

页数：5

共 50 条

[1] Decentralized Control of Partially Observable Markov Decision Processes
Amato, Christopher
Chowdhary, Girish
Geramifard, Alborz
Uere, N. Kemal
Kochenderfer, Mykel J.
[J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
[2] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
Ross, Stephane
Pineau, Joelle
Chaib-draa, Brahim
Kreitmann, Pierre
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
[3] Partially Observable Markov Decision Processes and Robotics
Kurniawati, Hanna
[J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
[4] A tutorial on partially observable Markov decision processes
Littman, Michael L.
[J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
[5] Quantum partially observable Markov decision processes
Barry, Jennifer
Barry, Daniel T.
Aaronson, Scott
[J]. PHYSICAL REVIEW A, 2014, 90 (03):
[6] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
Martinez-Garcia, E. Everardo
Minjarez-Sosa, J. Adolfo
Vega-Amaya, Oscar
[J]. KYBERNETIKA, 2022, 58 (06) : 960 - 983
[7] Recursive learning automata for control of partially observable Markov decision processes
Chang, Hyeong Soo
Fu, Michael C.
Marcus, Steven I.
[J]. 2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 6091 - 6096
[8] Active learning in partially observable Markov decision processes
Jaulmes, R
Pineau, J
Precup, D
[J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608
[9] Structural Estimation of Partially Observable Markov Decision Processes
Chang, Yanling
Garcia, Alfredo
Wang, Zhide
Sun, Lu
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) : 5135 - 5141
[10] Entropy Maximization for Partially Observable Markov Decision Processes
Savas, Yagiz
Hibbard, Michael
Wu, Bo
Tanaka, Takashi
Topcu, Ufuk
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6948 - 6955

← 1 2 3 4 5 →