Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems

被引:5
|
作者
Abouheaf, Mohamed [1 ]
Boase, Derek [2 ]
Gueaieb, Wail [2 ]
Spinello, Davide [3 ]
Al-Sharhan, Salah [4 ]
机构
[1] Bowling Green State Univ, Coll Technol Architecture & Appl Engn, Bowling Green, OH 43403 USA
[2] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward Ave, Ottawa, ON K1N 6N5, Canada
[3] Univ Ottawa, Dept Mech Engn, 161 Louis Pasteur, Ottawa, ON K1N 6N5, Canada
[4] Int Univ Sci & Technol, Comp Engn Dept, Ardiya, Kuwait
基金
加拿大自然科学与工程研究理事会;
关键词
Optimal control; Adaptive control; Reinforcement learning; Adaptive critics; Model-reference adaptive systems;
D O I
10.1016/j.engappai.2023.106029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper introduces an interactive machine learning mechanism to process the measurements of an uncertain, nonlinear dynamic process and hence advise an actuation strategy in real-time. For concept demonstration, a trajectory-following optimization problem of a Kinova robotic arm is solved using an integral reinforcement learning approach with guaranteed stability for slowly varying dynamics. The solution is implemented using a model-free value iteration process to solve the integral temporal difference equations of the problem. The performance of the proposed technique is benchmarked against that of another model-free high-order approach and is validated for dynamic payload and disturbances. Unlike its benchmark, the proposed adaptive strategy is capable of handling extreme process variations. This is experimentally demonstrated by introducing static and time-varying payloads close to the rated maximum payload capacity of the manipulator arm. The comparison algorithm exhibited up to a seven-fold percent overshoot compared to the proposed integral reinforcement learning solution. The robustness of the algorithm is further validated by disturbing the real-time adapted strategy gains with a white noise of a standard deviation as high as 5%.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning
    Zhao, Jingang
    Gan, Minggang
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) : 2429 - 2440
  • [32] Adaptive Fixed-Time Optimal Formation Control for Uncertain Nonlinear Multiagent Systems Using Reinforcement Learning
    Wang, Ping
    Yu, Chengpu
    Lv, Maolong
    Cao, Jinde
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1729 - 1743
  • [33] Iterative learning control schemes for a class of nonlinear systems: Theory and real-time implementation
    Ibrir, Salim
    Ramlal, Craig
    [J]. 2014 12TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2014, : 338 - +
  • [34] Real-time scheduling for a smart factory using a reinforcement learning approach
    Shiue, Yeou-Ren
    Lee, Ken-Chuan
    Su, Chao-Ton
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 604 - 614
  • [35] Real-time scheduling of multiple uncertain receding horizon control systems
    Gholami, Behnood
    Gordon, Brandon W.
    Rabbath, C. A.
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2009, 30 (02): : 179 - 195
  • [36] Considerations of Reinforcement Learning within Real-Time Wireless Communication Systems
    Jones, Alyse M.
    Headley, William C.
    [J]. 2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [37] Hybrid DVFS Scheduling for Real-Time Systems Based on Reinforcement Learning
    Muhammad, Fakhruddin
    ul Islam, Mahbub
    Lin, Man
    [J]. IEEE SYSTEMS JOURNAL, 2017, 11 (02): : 931 - 940
  • [38] Real-time security margin control using deep reinforcement learning
    Hagmar, Hannes
    Eriksson, Robert
    Tuan, Le Anh
    [J]. ENERGY AND AI, 2023, 13
  • [39] Adaptive Runtime Response Time Control in PLC-based Real-Time Systems using Reinforcement Learning
    Moghadam, Mahshid Helali
    Saadatmand, Mehrdad
    Borg, Markus
    Bohlin, Markus
    Lisper, Bjorn
    [J]. 2018 IEEE/ACM 13TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2018, : 217 - 223
  • [40] Reinforcement learning to achieve real-time control of triple inverted pendulum
    Baek, Jongchan
    Lee, Changhyeon
    Lee, Young Sam
    Jeon, Soo
    Han, Soohee
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128