Data-Driven Optimal Controller Design for Maglev Train: Q-Learning Method

被引:1
|
作者
Xin, Liang [1 ]
Jiang, Hongwei [2 ]
Wen, Tao [1 ]
Long, Zhiqiang [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] CRRC Zhuzhou Locomot Co Ltd, Zhuzhou 412001, Hunan, Peoples R China
基金
国家重点研发计划;
关键词
Maglev train; Data-Ddriven Optimal Controller; Q-learning; TRACKING CONTROL; REINFORCEMENT; SYSTEMS;
D O I
10.1109/CCDC55256.2022.10033516
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The maglev train is an open-loop and unstable complex nonlinear system. Generally, design of offline controllers based on a single operating state. However, the system of maglev train will influence by various complex factors in actual operation. When the system model changes, the controller is designed and tuned offline will suffer severe performance degradation that will affect the system's stability. This paper proposes a Data-Ddriven Optimal Controller (DDOC) based on the Q-learning theory in reinforcement learning in response to this problem. The controller does not need to know the model information of the controlled object, only calculates iteratively based on the system's real-time input, output data, which has the advantages of fewer tuning parameters and fast convergence speed. For the problem that system model change during operation, the method proposed in this paper makes the system accurately track the given reference signal by dynamically and rapidly changing the parameters of feedback gain matrix through the real-time data of the system, thus ensuring the stability and reliability of the control system.
引用
收藏
页码:1289 / 1294
页数:6
相关论文
共 50 条
  • [41] Data-driven H∞ Optimal Controller Design using the Koopman Operator: Case Study
    Ganz, Felix
    Datar, Adwait
    Goettsch, Patrick
    Werner, Herbert
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 594 - 599
  • [42] A Class of Data-driven Based Controller Design and Its Parameters Tuning Method
    Wang Weihong
    Hou Zhongsheng
    Huo Haibo
    Jin Shangtai
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5090 - 5095
  • [43] Design of a Data-driven GMV Controller Using the Nelder-Mead Method
    Shi, LiYing
    Guan, Zhe
    Yamamoto, Toru
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : P60 - P60
  • [44] Optimal batch trajectory design based on an intelligent data-driven method
    Chen, JH
    Sheui, RG
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2003, 42 (07) : 1363 - 1378
  • [45] Explainable data-driven Q-learning control for a class of discrete-time linear autonomous systems
    Perrusquia, Adolfo
    Zou, Mengbang
    Guo, Weisi
    INFORMATION SCIENCES, 2024, 682
  • [46] Design of a Data-driven Predictive-PI Controller
    Ashida, Yoichiro
    Wakitani, Shin
    Yamamoto, Toru
    ICAROB 2019: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2019, : 451 - 454
  • [47] Data-Driven LPV Controller Design for Islanded Microgrids
    Madani, Seyed Sohail
    Karimi, Alireza
    IFAC PAPERSONLINE, 2021, 54 (07): : 433 - 438
  • [48] Reactive fuzzy controller design by Q-learning for mobile robot navigation
    张文志
    吕恬生
    Journal of Harbin Institute of Technology, 2005, (03) : 319 - 324
  • [49] Data-Driven Fuzzy Controller Design for Hypersonic Vehicle
    Bai, Jian-Ming
    Zhao, Guang-She
    Rong, Hai-Jun
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1698 - 1703
  • [50] Reactive fuzzy controller design by Q-learning for mobile robot navigation
    Zhang, Wen-Zhi
    Lu, Tian-Sheng
    Journal of Harbin Institute of Technology (New Series), 2005, 12 (03) : 319 - 324