A Novel Learning-Based Trajectory Generation Strategy for a Quadrotor

被引:3
|
作者
Hua, Hean [1 ,2 ]
Fang, Yongchun [3 ,4 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Hunan Univ, Natl Engn Res Ctr Robot Visual Percept & Control, Changsha 410082, Peoples R China
[3] Nankai Univ, Coll Artificial Intelligence, Inst Robot & Automat Informat Syst, Tianjin 300353, Peoples R China
[4] Nankai Univ, Tianjin Key Lab Intelligent Robot, Tianjin 300353, Peoples R China
基金
中国国家自然科学基金;
关键词
Trajectory; Navigation; Real-time systems; Optimization; Planning; Vehicle dynamics; Decision making; Local trajectory generation; quadrotor; real-time motion planning; real-world validation; reinforcement learning (RL); OBSTACLE AVOIDANCE; ROBUST; FLIGHT; UAV;
D O I
10.1109/TNNLS.2022.3217814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a learning-based trajectory generation framework is proposed for quadrotors, which guarantees real-time, efficient, and practice-reliable navigation by online making human-like decisions via reinforcement learning (RL) and imitation learning (IL). Specifically, inspired by human driving behavior and the perception range of sensors, a real-time local planner is designed by combining learning and optimization techniques, where the smooth and flexible trajectories are online planned efficiently in the observable area. In particular, the key problems in the framework, temporal optimality (time allocation), and spatial optimality (trajectory distribution) are solved by designing an RL policy, which provides human-like commands in real-time (e.g., slower or faster) to achieve better navigation, instead of generating traditional low-level motions. In this manner, real-time trajectories are calculated using convex optimization according to the efficient and accurate decisions of the RL policy. In addition, to improve generalization performance and to accelerate the training, an expert policy and IL are employed in the framework. Compared with existing works, the kernel contribution is to design a real-time practice-oriented intelligent trajectory generation framework for quadrotors, where human-like decision-making and model-based optimization are integrated to plan high-quality trajectories. The results of comparative experiments in known and unknown environments illustrate the superior performance of the proposed trajectory generation strategy in terms of efficiency, smoothness, and flexibility.
引用
收藏
页码:9068 / 9079
页数:12
相关论文
共 50 条
  • [1] A Novel Reinforcement Learning-Based Robust Control Strategy for a Quadrotor
    Hua, Hean
    Fang, Yongchun
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (03) : 2812 - 2821
  • [2] Learning-based Trajectory Generation for Intelligent Vehicles in Urban Environment
    Guo, Chunzhao
    Kidono, Kiyosumi
    Ogawa, Masaru
    [J]. 2016 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2016, : 1236 - 1241
  • [3] Control based motion primitives for quadrotor trajectory generation
    Lai, Shupeng
    Lan, Menglu
    Chen, Ben M.
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8323 - 8329
  • [4] Optimal Trajectory Generation of a Quadrotor Based on the Differential Flatness
    Yu, Jing
    Cai, Zhihao
    Wang, Yingxun
    [J]. PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 678 - 683
  • [5] Minimum Jerk Trajectory Generation of a Quadrotor Based on the Differential Flatness
    Yu, Jing
    Cai, Zhihao
    Wang, Yingxun
    [J]. 2014 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2014, : 832 - 837
  • [6] Alternating Minimization Based Trajectory Generation for Quadrotor Aggressive Flight
    Wang, Zhepei
    Zhou, Xin
    Xu, Chao
    Chu, Jian
    Gao, Fei
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 4836 - 4843
  • [7] Deep learning-based privacy-preserving framework for synthetic trajectory generation
    Kim, Jong Wook
    Jang, Beakcheol
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 206
  • [8] A deep reinforcement learning-based approach to onboard trajectory generation for hypersonic vehicles
    Bao, C. Y.
    Zhou, X.
    Wang, P.
    He, R. Z.
    Tang, G. J.
    [J]. AERONAUTICAL JOURNAL, 2023, 127 (1315): : 1638 - 1658
  • [9] The Minimum Jerk Trajectory Generation of a Quadrotor Based on the Differential Flatness
    Yu, Jing
    Cai, Zhihao
    Wang, Yingxun
    [J]. 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 1061 - 1066
  • [10] Trajectory tracking strategy of quadrotor with output delay
    Qi, Jiaming
    Lv, Yueyong
    Gao, Duozhi
    Zhang, Zhaodi
    Li, Chenxing
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1303 - 1308