Online Policy Iteration Algorithm for Semi-Markov Switching State-Space Control Processes

被引：0

作者：

Jiang, Qi ^{[1
]}

Xi, Hong-Sheng ^{[2
]}

Yin, Bao-Qin ^{[2
]}

机构：

[1] Hefei Univ Technol, Dept Automat, Hefei 230009, Peoples R China

[2] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China

来源：

PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009) | 2009年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

DECISION-PROCESSES; SENSITIVITY-ANALYSIS; OPTIMIZATION; CONVERGENCE; POTENTIALS; SYSTEMS;

D O I：

10.1109/CDC.2009.5400958

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An event-based online policy iteration algorithm is presented for addressing hierarchical optimization problems. First, an event-driven analytical model with dynamic hierarchy called semi-Markov switching state-space control processes is introduced. Then, by exploiting the structure of dynamic hierarchy and the features of event-driven policy, an online adaptive optimization algorithm that combines potentials estimation and policy iteration is proposed. The convergence of this algorithm is also proved. Finally, as an illustrative example, the dynamic service composition in a service overlay network is formulated and addressed. Simulation results demonstrate the effectiveness of the presented algorithm.

引用

页码：2298 / 2303

页数：6

共 50 条

[1] Event-driven semi-Markov switching state-space control processes
Jiang, Q.
Xi, H. -S.
Yin, B. -Q.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (12): : 1861 - 1869
[2] Optimization of semi-Markov switching state-space control processes for network communication systems
Jiang Qi
Xi Hongsheng
Yin Baoqun
[J]. PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 2, 2007, : 707 - +
[3] SEMI-MARKOV PROCESSES - COUNTABLE STATE-SPACE
PYKE, R
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1960, 31 (01): : 245 - 246
[4] Approximate Policy Iteration for Semi-Markov Control Revisited
Gosavi, Abhijit
[J]. COMPLEX ADAPTIVE SYSTEMS, 2011, 6
[5] AN IMPROVED POLICY ITERATION ALGORITHM FOR SEMI-MARKOV MAINTENANCE PROBLEMS
VALDEZFLORES, C
FELDMAN, RM
[J]. IIE TRANSACTIONS, 1992, 24 (01) : 55 - 63
[6] Online adaptive-optimization algorithm for semi-Markov control processes
Jiang Qi
Xi Hongsheng
Yin Baoqun
[J]. 2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 76 - +
[7] INVARIANCE-PRINCIPLE FOR THE PROCESSES WITH SEMI-MARKOV SWITCH-OVERS WITH AN ARBITRARY STATE-SPACE
SILVESTROV, DS
[J]. LECTURE NOTES IN MATHEMATICS, 1983, 1021 : 617 - 628
[8] The policy iteration algorithm for average reward Markov decision processes with general state space
Meyn, SP
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (12) : 1663 - 1680
[9] The optimal robust control policy for uncertain semi-Markov control processes
Tang, H
Xi, HS
Yin, BQ
[J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2005, 36 (13) : 791 - 800
[10] ASYMPTOTIC ENLARGEMENT OF NON-HOMOGENEOUS MARKOV AND SEMI-MARKOV SYSTEMS WITH AN ARBITRARY STATE-SPACE
ANISIMOV, VV
[J]. DOPOVIDI AKADEMII NAUK UKRAINSKOI RSR SERIYA A-FIZIKO-MATEMATICHNI TA TECHNICHNI NAUKI, 1981, (12): : 3 - 6

← 1 2 3 4 5 →