Optimization in distributed controlled Markov chains

被引：0

作者：

Wang, JJ ^{[1
]}

Cao, XR ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

来源：

1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5 | 1998年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The performance potential theory has proved to be a promising tool in optimizing the infinite-horizon Markov decision problem (MDP) [1]-[2] So far, the research in this area is implicitly focused on a simple system with a single controller In this paper, we consider the distributed controlled Markov chain, where the system consists of several individual control units and it evolves under the combined control of these nodes. Motivated by practical background, we investigate a structure of MDP with event-dependent decisions. We explore a notion of expanded Markov chain to map this problem to a traditional MDP model. In particular, we address ourselves to the complexity-reduction techniques to deal with the enlarged state space. For the distributed system where a particular node can only access partial system information, we develop some algorithms for decentralized potential estimation and policy iteration.

引用

页码：2501 / 2506

页数：6

共 50 条

[1] Algorithms for optimization and stabilization of controlled Markov chains
Sean Meyn
[J]. Sadhana, 1999, 24 : 339 - 367
[2] Algorithms for optimization and stabilization of controlled Markov chains
Meyn, S
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1999, 24 (4-5): : 339 - 367
[3] Distributed Markov Chains
Saha, Ratul
Esparza, Javier
Jha, Sumit Kumar
Mukund, Madhavan
Thiagarajan, P. S.
[J]. VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION (VMCAI 2015), 2015, 8931 : 117 - 134
[4] CONTROLLED MARKOV CHAINS
KESTEN, H
SPITZER, F
[J]. ANNALS OF PROBABILITY, 1975, 3 (01): : 32 - 40
[5] A Theory of Distributed Markov Chains
Thiagarajan, P. S.
Yangt, Shaofa
[J]. FUNDAMENTA INFORMATICAE, 2020, 175 (1-4) : 301 - 325
[6] A Class of Distributed Markov Chains
Thiagarajan, P. S.
[J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2014, (156): : 3 - 3
[7] CONTROLLED MARKOV-CHAINS WITH CONSTRAINTS
BORKAR, VS
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1990, 15 : 405 - 413
[8] Controlled Markov chains with utility functions
Iwamoto, S
Ueno, T
Fujita, T
[J]. MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 135 - 149
[9] Controlled Markov chains and safety criteria
Arapostathis, A
Kumar, R
Tangirala, S
[J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 1675 - 1680
[10] Distributed Averaging Via Lifted Markov Chains
Jung, Kyomin
Shah, Devavrat
Shin, Jinwoo
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (01) : 634 - 647

← 1 2 3 4 5 →