Mode-matching control policies for multi-mode Markov decision processes

被引:0
|
作者
Ren, ZY [1 ]
Krogh, BH [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
关键词
multi-mode Markov decision processes; mode-matching control policy;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a Markov decision process (MDP) with a two-dimensional state vector, (S, D), where S is interpreted as the system state, and D is interpreted as the probability distribution of the system operating mode, denoted by M. The mode M determines the probability transition and reward structures for S. If the mode were known and constant, a constant-mode optimal controller for controlling the evolution of S could be computed off line. We are interested in knowing if and when the set of constant-mode optimal controllers can be used to control the system effectively when the mode evolves stochastically. We propose a mode-matching control policy under which the controller applied to the system at each epoch is the constant-mode optimal controller for the current most likely mode. We consider the case when the current mode is directly observable that is, D is the trivial distribution as well as the case when only the probability distribution of the current mode is available at each control epoch. Sufficient conditions under which the mode-matching control policies are optimal are derived. We also derive bounds on the performance degradation from the optimum when the non-optimal mode-matching control policies are used. The problem formulation, sufficient conditions and performance bounds are illustrated by a numerical example.
引用
收藏
页码:95 / 100
页数:6
相关论文
共 50 条
  • [1] Switching control in multi-mode Markov decision processes
    Ren, ZY
    Krogh, BH
    [J]. PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 2095 - 2101
  • [2] Development of Mode-matching Techniques to Efficiently Model Multi-mode Horns with Non-PEC Walls
    Brennan, Joseph
    Gradziel, Marcin
    Trappe, Neil
    [J]. 2019 PHOTONICS & ELECTROMAGNETICS RESEARCH SYMPOSIUM - SPRING (PIERS-SPRING), 2019, : 2308 - 2316
  • [3] MODE-MATCHING IN TIME DOMAIN
    Legenkiy, M. N.
    Yu, Zheng
    Yu, Butrym A.
    [J]. 2008 4TH INTERNATIONAL CONFERENCE ON ULTRAWIDEBAND AND ULTRASHORT IMPULSE SIGNALS, PROCEEDINGS, 2008, : 225 - 227
  • [4] Application of non-PEC walled mode-matching techniques to a prototype SAFARI M-band multi-mode receiver.
    Brennan, Joseph
    Gradziel, Marcin
    Trappe, Neil
    Ade, Peter
    [J]. 2020 14TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP 2020), 2020,
  • [5] A mode-matching technique for mode coupling in a gyrotron cavity
    Kasibhotla, V
    McCurdy, AH
    [J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 1996, 44 (02) : 225 - 232
  • [6] Multi-mode Model Predictive Control and Estimation for Uncertain Biotechnological Processes
    Morabito, Bruno
    Kienle, Achim
    Findeisen, Rolf
    Carius, Lisa
    [J]. IFAC PAPERSONLINE, 2019, 52 (01): : 709 - 714
  • [7] Mode-matching in vibrating microgyros using extremum seeking control
    Antonello, Riccardo
    Oboe, Roberto
    [J]. IECON 2007: 33RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, CONFERENCE PROCEEDINGS, 2007, : 2301 - 2306
  • [8] Analysis of Oversized Sliding Waveguide by Mode Matching and Multi-Mode Network Theory
    K. Ohkubo
    S. Kubo
    H. Idei
    T. Shimozuma
    Y. Yoshimura
    F. Leuterer
    M. Sato
    Y. Takita
    [J]. International Journal of Infrared and Millimeter Waves, 2000, 21 : 1463 - 1477
  • [9] Modeling and monitoring of nonlinear multi-mode processes
    Zhang, Yingwei
    Li, Shuai
    [J]. CONTROL ENGINEERING PRACTICE, 2014, 22 : 194 - 204
  • [10] Pedestrian Localization by Appearance Matching and Multi-mode Filtering
    Wu, Shunguang
    Bansal, Mayank
    Eledath, Jayan
    [J]. 2009 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1 AND 2, 2009, : 172 - 178