Stochastic Optimal Control of Unknown Linear Networked Control System using Q-Learning Methodology

被引:0
|
作者
Xu, Hao [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Networked Control System (NCS); Q-function; Adaptive Estimator (AE); Optimal Control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the Bellman equation is utilized forward-in-time for the stochastic optimal control of Networked Control System (NCS) with unknown system dynamics in the presence of random delays and packet losses which are unknown. The proposed stochastic optimal control approach, referred normally as adaptive dynamic programming, uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation control of NCS with unknown system dynamics. Update laws for tuning the unknown parameters of the adaptive estimator (AE) online to obtain the time-based Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the approximated control signals converge to optimal control inputs. Simulation results are included to show the effectiveness of the proposed scheme.
引用
收藏
页码:2819 / 2824
页数:6
相关论文
共 50 条
  • [41] Output Feedback Reinforcement Q-learning for Optimal Quadratic Tracking Control of Unknown Discrete-Time Linear Systems and Its Application
    Zhao, Guangyue
    Sun, Weijie
    Cai, He
    Peng, Yunjian
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 750 - 755
  • [42] Reinforcement Q-Learning Algorithm for H∞ Tracking Control of Unknown Discrete-Time Linear Systems
    Peng, Yunjian
    Chen, Qian
    Sun, Weijie
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4109 - 4122
  • [43] Optimal Trajectory Output Tracking Control with a Q-learning Algorithm
    Vamvoudakis, Kyriakos G.
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5752 - 5757
  • [45] Stochastic optimal linear control of wireless networked control systems with delays and packet losses
    Wang, Zhuwei
    Wang, Xiaodong
    Liu, Lihan
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (07): : 742 - 751
  • [46] Linear quadratic optimal control method based on output feedback inverse reinforcement Q-learning
    Liu, Wen
    Fan, Jia-Lu
    Xue, Wen-Qian
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (08): : 1469 - 1479
  • [47] Non-linear control based on Q-learning algorithms
    Yang, Dong
    Yin, Chang-Ming
    Chen, Huan-Wen
    Wu, Bo-Sen
    Changsha Dianli Xueyuan Xuebao/Journal of Changsha University of Electric Power, 2003, 18 (01):
  • [48] Optimal Tracking Control of a Nonlinear Multiagent System Using Q-Learning via Event-Triggered Reinforcement Learning
    Wang, Ziwei
    Wang, Xin
    Tang, Yijie
    Liu, Ying
    Hu, Jun
    ENTROPY, 2023, 25 (02)
  • [49] OPTIMAL-CONTROL OF AN UNKNOWN LINEAR PROCESS WITH LEARNING
    KIEFER, NM
    NYARKO, Y
    INTERNATIONAL ECONOMIC REVIEW, 1989, 30 (03) : 571 - 586