Mode-matching control policies for multi-mode Markov decision processes

被引:0
|
作者
Ren, ZY [1 ]
Krogh, BH [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
关键词
multi-mode Markov decision processes; mode-matching control policy;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a Markov decision process (MDP) with a two-dimensional state vector, (S, D), where S is interpreted as the system state, and D is interpreted as the probability distribution of the system operating mode, denoted by M. The mode M determines the probability transition and reward structures for S. If the mode were known and constant, a constant-mode optimal controller for controlling the evolution of S could be computed off line. We are interested in knowing if and when the set of constant-mode optimal controllers can be used to control the system effectively when the mode evolves stochastically. We propose a mode-matching control policy under which the controller applied to the system at each epoch is the constant-mode optimal controller for the current most likely mode. We consider the case when the current mode is directly observable that is, D is the trivial distribution as well as the case when only the probability distribution of the current mode is available at each control epoch. Sufficient conditions under which the mode-matching control policies are optimal are derived. We also derive bounds on the performance degradation from the optimum when the non-optimal mode-matching control policies are used. The problem formulation, sufficient conditions and performance bounds are illustrated by a numerical example.
引用
收藏
页码:95 / 100
页数:6
相关论文
共 50 条
  • [31] MODE-MATCHING METHOD AS APPLIED TO RIDGED WAVEGUIDES
    PAREKH, SV
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS, 1973, 34 (02) : 285 - 287
  • [32] Design and control of a multi-mode drive system
    Gilbert, JM
    Abu Hassan, AH
    [J]. 1998 5TH INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL - PROCEEDINGS: AMC '98 - COIMBRA, 1998, : 611 - 616
  • [33] Adaptive policies for multi-mode project scheduling under uncertainty
    Godinho, Pedro
    Branco, Fernando G.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 216 (03) : 553 - 562
  • [34] Multi-mode Switching Control of Networked Control System Based on Hidden semi-Markov Model
    Yan, Ying
    Wang, Likun
    Wang, Yan
    [J]. AUTOMATIC MANUFACTURING SYSTEMS II, PTS 1 AND 2, 2012, 542-543 : 147 - +
  • [35] An improved mode-matching method for large cavities
    Bao, G
    Zhang, WW
    [J]. IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2005, 4 : 393 - 396
  • [36] Data preprocessing for generalized mode-matching method
    Kirilenko, A
    Kulik, D
    [J]. MATHEMATICAL METHODS IN ELECTROMAGNETIC THEORY, CONFERENCE PROCEEDINGS, VOLS 1 AND 2, 2002, : 535 - 539
  • [37] The multi-mode gyrotron
    Savilov, A. V.
    Glyavin, M. Yu.
    Philippov, V. N.
    [J]. PHYSICS OF PLASMAS, 2011, 18 (10)
  • [38] Optimization of Joint Decision of Transport Mode and Path in Multi-Mode Freight Transportation Network
    Lu, Yang
    Wang, Shuaiqi
    [J]. SENSORS, 2022, 22 (13)
  • [39] Synthesis and analysis of multi-mode profile horn using mode matching technique and evolutionary algorithm
    Dey, Ranajit
    Chakrabarty, Soumyabrata
    Jyoti, Rajeev
    Kurian, Thomas
    [J]. IET MICROWAVES ANTENNAS & PROPAGATION, 2016, 10 (03) : 276 - 282
  • [40] A Multi-mode Waveguide with Mode Selective Effect
    Ning, Ken
    Li, Xiao-Chun
    Mao, Jun-Fa
    [J]. 2019 49TH EUROPEAN MICROWAVE CONFERENCE (EUMC), 2019, : 694 - 697