Quasi-Deterministic Partially Observable Markov Decision Processes

被引：0

作者：

Besse, Camille ^{[1
]}

Chaib-draa, Brahim ^{[1
]}

机构：

[1] Univ Laval, Dept Comp Sci, Quebec City, PQ, Canada

来源：

NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS | 2009年 / 5863卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study a subclass of POMDPs, called quasi-deterministic POMDPs (QDET-POMDPs), characterized by deterministic actions and stochastic observations. While this framework does not model the same general problems as POMDPs, they still capture a number of interesting and challenging problems and, in some cases, have interesting properties. By studying the observability available in this subclass, we show that QDET-POMDPs may fall many steps in the complexity classes of polynomial hierarchy.

引用

页码：237 / 246

页数：10

共 50 条

[1] Learning deterministic policies in partially observable Markov decision processes
Miyazaki, K
Kobayashi, S
[J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
[2] QUASI-DETERMINISTIC STATES FOR MARKOV SYSTEMS
DAWSON, DA
[J]. ADVANCES IN APPLIED PROBABILITY, 1975, 7 (02) : 231 - 232
[3] Partially Observable Markov Decision Processes and Robotics
Kurniawati, Hanna
[J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
[4] A tutorial on partially observable Markov decision processes
Littman, Michael L.
[J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
[5] Quantum partially observable Markov decision processes
Barry, Jennifer
Barry, Daniel T.
Aaronson, Scott
[J]. PHYSICAL REVIEW A, 2014, 90 (03):
[6] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
Martinez-Garcia, E. Everardo
Minjarez-Sosa, J. Adolfo
Vega-Amaya, Oscar
[J]. KYBERNETIKA, 2022, 58 (06) : 960 - 983
[7] OPTIMAL CONTROL OF PARTIALLY OBSERVABLE PIECEWISE DETERMINISTIC MARKOV PROCESSES
Bauerle, Nicole
Lange, Dirk
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (02) : 1441 - 1462
[8] Active learning in partially observable Markov decision processes
Jaulmes, R
Pineau, J
Precup, D
[J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608
[9] Structural Estimation of Partially Observable Markov Decision Processes
Chang, Yanling
Garcia, Alfredo
Wang, Zhide
Sun, Lu
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) : 5135 - 5141
[10] Entropy Maximization for Partially Observable Markov Decision Processes
Savas, Yagiz
Hibbard, Michael
Wu, Bo
Tanaka, Takashi
Topcu, Ufuk
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6948 - 6955

← 1 2 3 4 5 →