Near-optimal Trajectory Tracking in Quadcopters using Reinforcement Learning

被引:0
|
作者
Engelhardt, Randal [1 ]
Velazquez, Alberto [2 ]
Sardarmehni, Tohid [1 ]
机构
[1] Calif State Univ Northridge, Mech Engn, Northridge, CA 91330 USA
[2] Univ Texas Rio Grande Valley, Mech Engn, Edinburg, TX 78539 USA
来源
IFAC PAPERSONLINE | 2024年 / 58卷 / 28期
关键词
Optimal Control; Reinforcement Learning; Quadcopter; MODEL-PREDICTIVE CONTROL; QUADROTOR;
D O I
10.1016/j.ifacol.2024.12.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The control of quadcopters poses significant challenges due to their complex dynamics characterized by highly nonlinear couplings, high system order, and under-actuation. This paper presents a novel control solution aimed at achieving near-optimal trajectory tracking for quadcopters. A near-optimal solution based on approximate dynamic programming is proposed to address the curse of dimensionality inherent in traditional dynamic programming, employing a single network adaptive critic. Extensive simulations validate the effectiveness and robustness of the proposed solution.
引用
收藏
页码:61 / 65
页数:5
相关论文
共 50 条
  • [1] Near-Optimal Reinforcement Learning in Polynomial Time
    Michael Kearns
    Satinder Singh
    Machine Learning, 2002, 49 : 209 - 232
  • [2] Near-optimal reinforcement learning in polynomial time
    Kearns, M
    Singh, S
    MACHINE LEARNING, 2002, 49 (2-3) : 209 - 232
  • [3] Near-optimal Regret Bounds for Reinforcement Learning
    Jaksch, Thomas
    Ortner, Ronald
    Auer, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1563 - 1600
  • [4] Near-optimal regret bounds for reinforcement learning
    Jaksch, Thomas
    Ortner, Ronald
    Auer, Peter
    Journal of Machine Learning Research, 2010, 11 : 1563 - 1600
  • [5] Near-optimal Reinforcement Learning in Factored MDPs
    Osband, Ian
    Van Roy, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [6] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
    Zhang, Junzhe
    Bareinboim, Elias
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Near-Optimal Reinforcement Learning with Self-Play
    Bai, Yu
    Jin, Chi
    Yu, Tiancheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] Polynomial-time reinforcement learning of near-optimal policies
    Pivazyan, K
    Shoham, Y
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 205 - 210
  • [9] Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
    Ortner, Ronald
    Maillard, Odalric-Ambrym
    Ryabko, Daniil
    Algorithmic Learning Theory (ALT 2014), 2014, 8776 : 140 - 154
  • [10] Near-Optimal Location Tracking Using Sensor Networks
    Sharma, Gokarna
    Krishnan, Hari
    Busch, Costas
    Brandt, Steven R.
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 738 - 747