Reinforcement learning and optimal adaptive control: An overview and implementation examples

被引:156
|
作者
Khan, Said G. [1 ]
Herrmann, Guido [2 ,3 ]
Lewis, Frank L. [4 ]
Pipe, Tony [1 ]
Melhuish, Chris [5 ]
机构
[1] Univ W England, Bristol Robot Lab, Bristol BS16 1QY, Avon, England
[2] Univ Bristol, Bristol Robot Lab, Bristol, Avon, England
[3] Univ Bristol, Dept Mech Engn, Bristol, Avon, England
[4] Univ Texas Arlington, Automat & Robot Res Inst, Arlington, TX USA
[5] Univ Bristol, Bristol Robot Lab, Bristol, Avon, England
基金
美国国家科学基金会;
关键词
Reinforcement learning; ADP; Q-learning; Optimal adaptive control; ADP; SYSTEMS;
D O I
10.1016/j.arcontrol.2012.03.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper provides an overview of the reinforcement learning and optimal adaptive control literature and its application to robotics. Reinforcement learning is bridging the gap between traditional optimal control, adaptive control and bio-inspired learning techniques borrowed from animals. This work is highlighting some of the key techniques presented by well known researchers from the combined areas of reinforcement learning and optimal control theory. At the end, an example of an implementation of a novel model-free Q-learning based discrete optimal adaptive controller for a humanoid robot arm is presented. The controller uses a novel adaptive dynamic programming (ADP) reinforcement learning (RI) approach to develop an optimal policy on-line. The RI joint space tracking controller was implemented for two links (shoulder flexion and elbow flexion joints) of the arm of the humanoid Bristol-Elumotion-Robotic-Torso II (BERT II) torso. The constrained case (joint limits) of the RL scheme was tested for a single link (elbow flexion) of the BERT II arm by modifying the cost function to deal with the extra nonlinearity due to the joint constraints. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:42 / 59
页数:18
相关论文
共 50 条
  • [41] Optimality and convergence of adaptive optimal control by reinforcement synthesis
    Lin, Wei-Song
    AUTOMATICA, 2011, 47 (05) : 1047 - 1052
  • [42] Adaptive Optimal Consensus Control of Multiagent Systems With Unknown Dynamics and Disturbances via Reinforcement Learning
    Chen L.
    Dong C.
    Dai S.-L.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2193 - 2203
  • [43] Adaptive Duty Cycle Control for Optimal Battery Energy Storage System Charging by Reinforcement Learning
    Wiencek, Richard
    Ghosh, Sagnika
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 8 - 10
  • [44] Reinforcement-Learning-Based Adaptive Optimal Flight Control with Output Feedback and Input Constraints
    Sun, Bo
    van Kampen, Erik-Jan
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2021, 44 (09) : 1685 - 1691
  • [45] Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning
    Moghadam, Rohollah
    Modares, Hamidreza
    IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (16): : 2165 - 2174
  • [46] Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS
    Lewis, Frank L.
    Vrabie, Draguna
    Vamvoudakis, Kyriakos G.
    IEEE CONTROL SYSTEMS MAGAZINE, 2012, 32 (06): : 76 - 105
  • [47] Reinforcement Learning-Based Adaptive Optimal Control for Partially Unknown Systems Using Differentiator
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1039 - 1044
  • [48] Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers
    Lewis, Frank L.
    Vrabie, Draguna
    Vamvoudakis, Kyriakos G.
    IEEE Control Systems, 2012, 32 (06) : 76 - 105
  • [49] A reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2010, 31 (04): : 365 - 374
  • [50] Enhancing the Performance of Adaptive Iterative Learning Control with Reinforcement Learning
    Nemec, Bojan
    Simonic, Mihael
    Likar, Nejc
    Ude, Ales
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 2192 - 2199