Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems

被引:5
|
作者
Abouheaf, Mohamed [1 ]
Boase, Derek [2 ]
Gueaieb, Wail [2 ]
Spinello, Davide [3 ]
Al-Sharhan, Salah [4 ]
机构
[1] Bowling Green State Univ, Coll Technol Architecture & Appl Engn, Bowling Green, OH 43403 USA
[2] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward Ave, Ottawa, ON K1N 6N5, Canada
[3] Univ Ottawa, Dept Mech Engn, 161 Louis Pasteur, Ottawa, ON K1N 6N5, Canada
[4] Int Univ Sci & Technol, Comp Engn Dept, Ardiya, Kuwait
基金
加拿大自然科学与工程研究理事会;
关键词
Optimal control; Adaptive control; Reinforcement learning; Adaptive critics; Model-reference adaptive systems;
D O I
10.1016/j.engappai.2023.106029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper introduces an interactive machine learning mechanism to process the measurements of an uncertain, nonlinear dynamic process and hence advise an actuation strategy in real-time. For concept demonstration, a trajectory-following optimization problem of a Kinova robotic arm is solved using an integral reinforcement learning approach with guaranteed stability for slowly varying dynamics. The solution is implemented using a model-free value iteration process to solve the integral temporal difference equations of the problem. The performance of the proposed technique is benchmarked against that of another model-free high-order approach and is validated for dynamic payload and disturbances. Unlike its benchmark, the proposed adaptive strategy is capable of handling extreme process variations. This is experimentally demonstrated by introducing static and time-varying payloads close to the rated maximum payload capacity of the manipulator arm. The comparison algorithm exhibited up to a seven-fold percent overshoot compared to the proposed integral reinforcement learning solution. The robustness of the algorithm is further validated by disturbing the real-time adapted strategy gains with a white noise of a standard deviation as high as 5%.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Control Delay in Reinforcement Learning for Real-Time Dynamic Systems: A Memoryless Approach
    Schuitema, Erik
    Busoniu, Lucian
    Babuska, Robert
    Jonker, Pieter
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3226 - 3231
  • [2] Exploiting Parallel Computing to Control Uncertain Nonlinear Systems in Real-Time
    Condori, J.
    Maghareh, A.
    Orr, J.
    Li, H. -W.
    Montoya, H.
    Dyke, S.
    Gill, C.
    Prakash, A.
    [J]. EXPERIMENTAL TECHNIQUES, 2020, 44 (06) : 735 - 749
  • [3] Exploiting Parallel Computing to Control Uncertain Nonlinear Systems in Real-Time
    J. Condori
    A. Maghareh
    J. Orr
    H.-W. Li
    H. Montoya
    S. Dyke
    C. Gill
    A. Prakash
    [J]. Experimental Techniques, 2020, 44 : 735 - 749
  • [4] Estimation-based approach for real-time optimisation of uncertain nonlinear systems
    Moshksar, E.
    Guay, M.
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2017, 90 (09) : 2005 - 2019
  • [5] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
    Annaswamy, Anuradha M.
    Guha, Anubhav
    Cui, Yingnan
    Tang, Sunbochen
    Fisher, Peter A.
    Gaudio, Joseph E.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
  • [6] Deep Reinforcement Learning with Uncertain Data for Real-Time Stormwater System Control and Flood Mitigation
    Saliba, Sami M.
    Bowes, Benjamin D.
    Adams, Stephen
    Beling, Peter A.
    Goodall, Jonathan L.
    [J]. WATER, 2020, 12 (11) : 1 - 19
  • [7] Measurement-driven Quality Assessment of Nonlinear Systems by Exponential Replacement
    Stein, Manuel
    Nossek, Josef A.
    Barbe, Kurt
    [J]. 2016 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE PROCEEDINGS, 2016, : 258 - 262
  • [8] Adaptive Critic Control With Knowledge Transfer for Uncertain Nonlinear Dynamical Systems: A Reinforcement Learning Approach
    Zhang, Liangju
    Zhang, Kun
    Xie, Xiang Peng
    Chadli, Mohammed
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [9] A Suboptimal Approach to Real-time Model Predictive Control of Nonlinear Systems
    Graichen, Knut
    Egretzberger, Markus
    Kugi, Andreas
    [J]. AT-AUTOMATISIERUNGSTECHNIK, 2010, 58 (08) : 447 - 456
  • [10] Experience Replay for Real-Time Reinforcement Learning Control
    Adam, Sander
    Busoniu, Lucian
    Babuska, Robert
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (02): : 201 - 212