Near-optimal Trajectory Tracking in Quadcopters using Reinforcement Learning

被引：0

作者：

Engelhardt, Randal ^{[1
]}

Velazquez, Alberto ^{[2
]}

Sardarmehni, Tohid ^{[1
]}

机构：

[1] Calif State Univ Northridge, Mech Engn, Northridge, CA 91330 USA

[2] Univ Texas Rio Grande Valley, Mech Engn, Edinburg, TX 78539 USA

来源：

IFAC PAPERSONLINE | 2024年 / 58卷 / 28期

关键词：

Optimal Control; Reinforcement Learning; Quadcopter; MODEL-PREDICTIVE CONTROL; QUADROTOR;

D O I：

10.1016/j.ifacol.2024.12.011

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The control of quadcopters poses significant challenges due to their complex dynamics characterized by highly nonlinear couplings, high system order, and under-actuation. This paper presents a novel control solution aimed at achieving near-optimal trajectory tracking for quadcopters. A near-optimal solution based on approximate dynamic programming is proposed to address the curse of dimensionality inherent in traditional dynamic programming, employing a single network adaptive critic. Extensive simulations validate the effectiveness and robustness of the proposed solution.

引用

页码：61 / 65

页数：5

共 50 条

[1] Near-Optimal Reinforcement Learning in Polynomial Time
Michael Kearns
Satinder Singh
Machine Learning, 2002, 49 : 209 - 232
[2] Near-optimal reinforcement learning in polynomial time
Kearns, M
Singh, S
MACHINE LEARNING, 2002, 49 (2-3) : 209 - 232
[3] Near-optimal Regret Bounds for Reinforcement Learning
Jaksch, Thomas
Ortner, Ronald
Auer, Peter
JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1563 - 1600
[4] Near-optimal regret bounds for reinforcement learning
Jaksch, Thomas
Ortner, Ronald
Auer, Peter
Journal of Machine Learning Research, 2010, 11 : 1563 - 1600
[5] Near-optimal Reinforcement Learning in Factored MDPs
Osband, Ian
Van Roy, Benjamin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[6] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
Zhang, Junzhe
Bareinboim, Elias
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[7] Near-Optimal Reinforcement Learning with Self-Play
Bai, Yu
Jin, Chi
Yu, Tiancheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[8] Polynomial-time reinforcement learning of near-optimal policies
Pivazyan, K
Shoham, Y
EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 205 - 210
[9] Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
Ortner, Ronald
Maillard, Odalric-Ambrym
Ryabko, Daniil
Algorithmic Learning Theory (ALT 2014), 2014, 8776 : 140 - 154
[10] Near-Optimal Location Tracking Using Sensor Networks
Sharma, Gokarna
Krishnan, Hari
Busch, Costas
Brandt, Steven R.
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 738 - 747

← 1 2 3 4 5 →