Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems

被引：5

作者：

Abouheaf, Mohamed ^{[1
]}

Boase, Derek ^{[2
]}

Gueaieb, Wail ^{[2
]}

Spinello, Davide ^{[3
]}

Al-Sharhan, Salah ^{[4
]}

机构：

[1] Bowling Green State Univ, Coll Technol Architecture & Appl Engn, Bowling Green, OH 43403 USA

[2] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward Ave, Ottawa, ON K1N 6N5, Canada

[3] Univ Ottawa, Dept Mech Engn, 161 Louis Pasteur, Ottawa, ON K1N 6N5, Canada

[4] Int Univ Sci & Technol, Comp Engn Dept, Ardiya, Kuwait

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 122卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Optimal control; Adaptive control; Reinforcement learning; Adaptive critics; Model-reference adaptive systems;

D O I：

10.1016/j.engappai.2023.106029

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The paper introduces an interactive machine learning mechanism to process the measurements of an uncertain, nonlinear dynamic process and hence advise an actuation strategy in real-time. For concept demonstration, a trajectory-following optimization problem of a Kinova robotic arm is solved using an integral reinforcement learning approach with guaranteed stability for slowly varying dynamics. The solution is implemented using a model-free value iteration process to solve the integral temporal difference equations of the problem. The performance of the proposed technique is benchmarked against that of another model-free high-order approach and is validated for dynamic payload and disturbances. Unlike its benchmark, the proposed adaptive strategy is capable of handling extreme process variations. This is experimentally demonstrated by introducing static and time-varying payloads close to the rated maximum payload capacity of the manipulator arm. The comparison algorithm exhibited up to a seven-fold percent overshoot compared to the proposed integral reinforcement learning solution. The robustness of the algorithm is further validated by disturbing the real-time adapted strategy gains with a white noise of a standard deviation as high as 5%.

引用

页数：18

共 50 条

[31] Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning
Zhao, Jingang
Gan, Minggang
[J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) : 2429 - 2440
[32] Adaptive Fixed-Time Optimal Formation Control for Uncertain Nonlinear Multiagent Systems Using Reinforcement Learning
Wang, Ping
Yu, Chengpu
Lv, Maolong
Cao, Jinde
[J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1729 - 1743
[33] Iterative learning control schemes for a class of nonlinear systems: Theory and real-time implementation
Ibrir, Salim
Ramlal, Craig
[J]. 2014 12TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2014, : 338 - +
[34] Real-time scheduling for a smart factory using a reinforcement learning approach
Shiue, Yeou-Ren
Lee, Ken-Chuan
Su, Chao-Ton
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 604 - 614
[35] Real-time scheduling of multiple uncertain receding horizon control systems
Gholami, Behnood
Gordon, Brandon W.
Rabbath, C. A.
[J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2009, 30 (02): : 179 - 195
[36] Considerations of Reinforcement Learning within Real-Time Wireless Communication Systems
Jones, Alyse M.
Headley, William C.
[J]. 2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
[37] Hybrid DVFS Scheduling for Real-Time Systems Based on Reinforcement Learning
Muhammad, Fakhruddin
ul Islam, Mahbub
Lin, Man
[J]. IEEE SYSTEMS JOURNAL, 2017, 11 (02): : 931 - 940
[38] Real-time security margin control using deep reinforcement learning
Hagmar, Hannes
Eriksson, Robert
Tuan, Le Anh
[J]. ENERGY AND AI, 2023, 13
[39] Adaptive Runtime Response Time Control in PLC-based Real-Time Systems using Reinforcement Learning
Moghadam, Mahshid Helali
Saadatmand, Mehrdad
Borg, Markus
Bohlin, Markus
Lisper, Bjorn
[J]. 2018 IEEE/ACM 13TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2018, : 217 - 223
[40] Reinforcement learning to achieve real-time control of triple inverted pendulum
Baek, Jongchan
Lee, Changhyeon
Lee, Young Sam
Jeon, Soo
Han, Soohee
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128

← 1 2 3 4 5 →