Reinforcement learning and optimal adaptive control: An overview and implementation examples

被引:156
|
作者
Khan, Said G. [1 ]
Herrmann, Guido [2 ,3 ]
Lewis, Frank L. [4 ]
Pipe, Tony [1 ]
Melhuish, Chris [5 ]
机构
[1] Univ W England, Bristol Robot Lab, Bristol BS16 1QY, Avon, England
[2] Univ Bristol, Bristol Robot Lab, Bristol, Avon, England
[3] Univ Bristol, Dept Mech Engn, Bristol, Avon, England
[4] Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
[5] Univ Bristol, Bristol Robot Lab, Bristol, Avon, England
基金
美国国家科学基金会;
关键词
Reinforcement learning; ADP; Q-learning; Optimal adaptive control; ADP; SYSTEMS;
D O I
10.1016/j.arcontrol.2012.03.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper provides an overview of the reinforcement learning and optimal adaptive control literature and its application to robotics. Reinforcement learning is bridging the gap between traditional optimal control, adaptive control and bio-inspired learning techniques borrowed from animals. This work is highlighting some of the key techniques presented by well known researchers from the combined areas of reinforcement learning and optimal control theory. At the end, an example of an implementation of a novel model-free Q-learning based discrete optimal adaptive controller for a humanoid robot arm is presented. The controller uses a novel adaptive dynamic programming (ADP) reinforcement learning (RI) approach to develop an optimal policy on-line. The RI joint space tracking controller was implemented for two links (shoulder flexion and elbow flexion joints) of the arm of the humanoid Bristol-Elumotion-Robotic-Torso II (BERT II) torso. The constrained case (joint limits) of the RL scheme was tested for a single link (elbow flexion) of the BERT II arm by modifying the cost function to deal with the extra nonlinearity due to the joint constraints. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:42 / 59
页数:18
相关论文
共 50 条
  • [21] Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics
    Chen, C.
    Huang, Y. P.
    Lam, W. H. K.
    Pan, T. L.
    Hsu, S. C.
    Sumalee, A.
    Zhong, R. X.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 142
  • [22] A reinforcement learning-based scheme for adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 57 - 62
  • [23] Reinforcement learning based computational adaptive optimal control and system identification for linear systems
    Subbarao, Kamesh
    Nuthi, Pavan
    Atmeh, Ghassan
    ANNUAL REVIEWS IN CONTROL, 2016, 42 : 319 - 331
  • [24] Reinforcement Learning-Based Adaptive Optimal Control for Nonlinear Systems With Asymmetric Hysteresis
    Zheng, Licheng
    Liu, Zhi
    Wang, Yaonan
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15800 - 15809
  • [25] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118
  • [26] Adaptive Optimal Control via Reinforcement Learning for Omni-Directional Wheeled Robots
    Sheikhlar, Arash
    Fakharian, Ahmad
    2016 4TH INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, AND AUTOMATION (ICCIA), 2016, : 208 - 213
  • [27] Identification and optimal control of nonlinear systems using recurrent neural networks and reinforcement learning: An overview
    Perrusquia, Adolfo
    Yu, Wen
    NEUROCOMPUTING, 2021, 438 : 145 - 154
  • [28] Training Drift Counteraction Optimal Control Policies Using Reinforcement Learning: An Adaptive Cruise Control Example
    Li, Zhaojian
    Chu, Tianshu
    Kolmanovsky, Ilya, V
    Yin, Xiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (09) : 2903 - 2912
  • [29] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
    Ramadan, Mohammad
    Younes, Ahmad Bani
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
  • [30] Reinforcement Learning for Model Problems of Optimal Control
    Semenov, S. S.
    Tsurkov, V. I.
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2023, 62 (03) : 508 - 521