Relaxation for Constrained Decentralized Markov Decision Processes

被引:0
|
作者
Xu, Jie [1 ]
机构
[1] Univ Miami, 1251 Mem Dr, Coral Gables, FL 33146 USA
关键词
decentralized MDP; Lagrangian relaxation; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies a class of decentralized multi-agent stochastic optimization problems. In these problems, each agent has only a partial view of the world state, and a partial control of the actions but must cooperatively maximize the long-term system reward. The state that an agent observe consists of two parts-a common public component and an agent-specific private component. Importantly, taking actions incurs costs and the actions that the agents can take are subject to an overall cost constraint in each interaction period. We formulate this problem as an infinite time horizon Decentralized Markov Decision Process (DEC-MDP) with resource constraints and develop efficient approximate algorithms that allow decentralized computation of the agent policy based on Lagrangian relaxation.
引用
收藏
页码:1313 / 1314
页数:2
相关论文
共 50 条
  • [1] On constrained Markov decision processes
    Haviv, M
    [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [2] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
  • [3] The complexity of decentralized control of Markov decision processes
    Bernstein, DS
    Givan, R
    Immerman, N
    Zilberstein, S
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) : 819 - 840
  • [4] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    [J]. CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
  • [5] Robustness of policies in constrained Markov decision processes
    Zadorojniy, A
    Shwartz, A
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (04) : 635 - 638
  • [6] Markov decision processes with constrained stopping times
    Horiguchi, M
    Kurano, M
    Yasuda, M
    [J]. PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 706 - 710
  • [7] Reinforcement Learning for Constrained Markov Decision Processes
    Gattami, Ather
    Bai, Qinbo
    Aggarwal, Vaneet
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [8] Risk-Constrained Markov Decision Processes
    Borkar, Vivek
    Jain, Rahul
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (09) : 2574 - 2579
  • [9] Risk-constrained Markov Decision Processes
    Borkar, Vivek
    Jain, Rahul
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2664 - 2669
  • [10] Entropy Maximization for Constrained Markov Decision Processes
    Savas, Yagiz
    Ornik, Melkior
    Cubuktepe, Murat
    Topcu, Ufuk
    [J]. 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 911 - 918