Online Policy Iteration Algorithm for Semi-Markov Switching State-Space Control Processes

被引:0
|
作者
Jiang, Qi [1 ]
Xi, Hong-Sheng [2 ]
Yin, Bao-Qin [2 ]
机构
[1] Hefei Univ Technol, Dept Automat, Hefei 230009, Peoples R China
[2] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
DECISION-PROCESSES; SENSITIVITY-ANALYSIS; OPTIMIZATION; CONVERGENCE; POTENTIALS; SYSTEMS;
D O I
10.1109/CDC.2009.5400958
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An event-based online policy iteration algorithm is presented for addressing hierarchical optimization problems. First, an event-driven analytical model with dynamic hierarchy called semi-Markov switching state-space control processes is introduced. Then, by exploiting the structure of dynamic hierarchy and the features of event-driven policy, an online adaptive optimization algorithm that combines potentials estimation and policy iteration is proposed. The convergence of this algorithm is also proved. Finally, as an illustrative example, the dynamic service composition in a service overlay network is formulated and addressed. Simulation results demonstrate the effectiveness of the presented algorithm.
引用
收藏
页码:2298 / 2303
页数:6
相关论文
共 50 条