Stochastic Optimal Control of Unknown Linear Networked Control System using Q-Learning Methodology

被引：0

作者：

Xu, Hao ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA

来源：

2011 AMERICAN CONTROL CONFERENCE | 2011年

关键词：

Networked Control System (NCS); Q-function; Adaptive Estimator (AE); Optimal Control;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, the Bellman equation is utilized forward-in-time for the stochastic optimal control of Networked Control System (NCS) with unknown system dynamics in the presence of random delays and packet losses which are unknown. The proposed stochastic optimal control approach, referred normally as adaptive dynamic programming, uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation control of NCS with unknown system dynamics. Update laws for tuning the unknown parameters of the adaptive estimator (AE) online to obtain the time-based Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the approximated control signals converge to optimal control inputs. Simulation results are included to show the effectiveness of the proposed scheme.

引用

页码：2819 / 2824

页数：6

共 50 条

[41] Output Feedback Reinforcement Q-learning for Optimal Quadratic Tracking Control of Unknown Discrete-Time Linear Systems and Its Application
Zhao, Guangyue
Sun, Weijie
Cai, He
Peng, Yunjian
2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 750 - 755
[42] Reinforcement Q-Learning Algorithm for H∞ Tracking Control of Unknown Discrete-Time Linear Systems
Peng, Yunjian
Chen, Qian
Sun, Weijie
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4109 - 4122
[43] Optimal Trajectory Output Tracking Control with a Q-learning Algorithm
Vamvoudakis, Kyriakos G.
2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5752 - 5757
[44] OPTIMAL STOCHASTIC CONTROL OF DISCRETE LINEAR SYSTEMS WITH UNKNOWN GAIN
MURPHY, WJ
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (04) : 338 - &
[45] Stochastic optimal linear control of wireless networked control systems with delays and packet losses
Wang, Zhuwei
Wang, Xiaodong
Liu, Lihan
IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (07): : 742 - 751
[46] Linear quadratic optimal control method based on output feedback inverse reinforcement Q-learning
Liu, Wen
Fan, Jia-Lu
Xue, Wen-Qian
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (08): : 1469 - 1479
[47] Non-linear control based on Q-learning algorithms
Yang, Dong
Yin, Chang-Ming
Chen, Huan-Wen
Wu, Bo-Sen
Changsha Dianli Xueyuan Xuebao/Journal of Changsha University of Electric Power, 2003, 18 (01):
[48] Optimal Tracking Control of a Nonlinear Multiagent System Using Q-Learning via Event-Triggered Reinforcement Learning
Wang, Ziwei
Wang, Xin
Tang, Yijie
Liu, Ying
Hu, Jun
ENTROPY, 2023, 25 (02)
[49] OPTIMAL-CONTROL OF AN UNKNOWN LINEAR PROCESS WITH LEARNING
KIEFER, NM
NYARKO, Y
INTERNATIONAL ECONOMIC REVIEW, 1989, 30 (03) : 571 - 586
[50] SUBOPTIMAL CONTROL OF A LINEAR STOCHASTIC SYSTEM WITH UNKNOWN PARAMETERS
PERELMUT.VM
AUTOMATION AND REMOTE CONTROL, 1974, 35 (06) : 875 - 882

← 1 2 3 4 5 →