Optimization in distributed controlled Markov chains

被引:0
|
作者
Wang, JJ [1 ]
Cao, XR [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance potential theory has proved to be a promising tool in optimizing the infinite-horizon Markov decision problem (MDP) [1]-[2] So far, the research in this area is implicitly focused on a simple system with a single controller In this paper, we consider the distributed controlled Markov chain, where the system consists of several individual control units and it evolves under the combined control of these nodes. Motivated by practical background, we investigate a structure of MDP with event-dependent decisions. We explore a notion of expanded Markov chain to map this problem to a traditional MDP model. In particular, we address ourselves to the complexity-reduction techniques to deal with the enlarged state space. For the distributed system where a particular node can only access partial system information, we develop some algorithms for decentralized potential estimation and policy iteration.
引用
收藏
页码:2501 / 2506
页数:6
相关论文
共 50 条
  • [1] Algorithms for optimization and stabilization of controlled Markov chains
    Sean Meyn
    [J]. Sadhana, 1999, 24 : 339 - 367
  • [2] Algorithms for optimization and stabilization of controlled Markov chains
    Meyn, S
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1999, 24 (4-5): : 339 - 367
  • [3] Distributed Markov Chains
    Saha, Ratul
    Esparza, Javier
    Jha, Sumit Kumar
    Mukund, Madhavan
    Thiagarajan, P. S.
    [J]. VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION (VMCAI 2015), 2015, 8931 : 117 - 134
  • [4] CONTROLLED MARKOV CHAINS
    KESTEN, H
    SPITZER, F
    [J]. ANNALS OF PROBABILITY, 1975, 3 (01): : 32 - 40
  • [5] A Theory of Distributed Markov Chains
    Thiagarajan, P. S.
    Yangt, Shaofa
    [J]. FUNDAMENTA INFORMATICAE, 2020, 175 (1-4) : 301 - 325
  • [6] A Class of Distributed Markov Chains
    Thiagarajan, P. S.
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2014, (156): : 3 - 3
  • [7] CONTROLLED MARKOV-CHAINS WITH CONSTRAINTS
    BORKAR, VS
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1990, 15 : 405 - 413
  • [8] Controlled Markov chains with utility functions
    Iwamoto, S
    Ueno, T
    Fujita, T
    [J]. MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 135 - 149
  • [9] Controlled Markov chains and safety criteria
    Arapostathis, A
    Kumar, R
    Tangirala, S
    [J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 1675 - 1680
  • [10] Distributed Averaging Via Lifted Markov Chains
    Jung, Kyomin
    Shah, Devavrat
    Shin, Jinwoo
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (01) : 634 - 647