Unmanned Aerial Vehicle Pitch Control Using Deep Reinforcement Learning with Discrete Actions in Wind Tunnel Test

被引:14
|
作者
Wada, Daichi [1 ]
Araujo-Estrada, Sergio A. [2 ]
Windsor, Shane [2 ]
机构
[1] Japan Aerosp Explorat Agcy, Aeronaut Technol Directorate, Tokyo 1810015, Japan
[2] Univ Bristol, Dept Aerosp Engn, Bristol BS8 1TR, Avon, England
基金
欧洲研究理事会;
关键词
attitude control; deep reinforcement learning; fixed-wing aircraft; unmanned aerial vehicle; wind tunnel test;
D O I
10.3390/aerospace8010018
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Deep reinforcement learning is a promising method for training a nonlinear attitude controller for fixed-wing unmanned aerial vehicles. Until now, proof-of-concept studies have demonstrated successful attitude control in simulation. However, detailed experimental investigations have not yet been conducted. This study applied deep reinforcement learning for one-degree-of-freedom pitch control in wind tunnel tests with the aim of gaining practical understandings of attitude control application. Three controllers with different discrete action choices, that is, elevator angles, were designed. The controllers with larger action rates exhibited better performance in terms of following angle-of-attack commands. The root mean square errors for tracking angle-of-attack commands decreased from 3.42 degrees to 1.99 degrees as the maximum action rate increased from 10 degrees/s to 50 degrees/s. The comparison between experimental and simulation results showed that the controller with a smaller action rate experienced the friction effect, and the controllers with larger action rates experienced fluctuating behaviors in elevator maneuvers owing to delay. The investigation of the effect of friction and delay on pitch control highlighted the importance of conducting experiments to understand actual control performances, specifically when the controllers were trained with a low-fidelity model.
引用
下载
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Unmanned Aerial Vehicle Pitch Control under Delay Using Deep Reinforcement Learning with Continuous Action in Wind Tunnel Test
    Wada, Daichi
    Araujo-Estrada, Sergio A.
    Windsor, Shane
    AEROSPACE, 2021, 8 (09)
  • [2] Wind tunnel test of an unmanned aerial vehicle (UAV)
    Chung Jindeog
    Lee Jangyeon
    Sung Bongzoo
    Koo Samok
    KSME International Journal, 2003, 17 : 776 - 783
  • [3] Wind tunnel test of an unmanned aerial vehicle (UAV)
    Jindeog, C
    Jangyeon, L
    Bongzoo, S
    Samok, K
    KSME INTERNATIONAL JOURNAL, 2003, 17 (05): : 776 - 783
  • [4] Autonomous control of unmanned aerial vehicle for chemical detection using deep reinforcement learning
    Byun, Hyung Joon
    Nam, Hyunwoo
    ELECTRONICS LETTERS, 2022, 58 (11) : 423 - 425
  • [5] WIND-TUNNEL TEST TECHNIQUES FOR UNMANNED AERIAL VEHICLE SEPARATION INVESTIGATIONS
    MOYER, SA
    TALBOT, MD
    JOURNAL OF AIRCRAFT, 1994, 31 (03): : 585 - 590
  • [6] Trajectory tracking control of an unmanned aerial vehicle with deep reinforcement learning for tasks inside the EAST
    Yu, Chao
    Yang, Yang
    Cheng, Yong
    Wang, Zheng
    Shi, Mingming
    FUSION ENGINEERING AND DESIGN, 2023, 194
  • [7] Wind tunnel investigation of the propellers for unmanned aerial vehicle
    Czyz, Zbigniew
    Karpinski, Pawel
    Skiba, Krzysztof
    2021 IEEE 8TH INTERNATIONAL WORKSHOP ON METROLOGY FOR AEROSPACE (IEEE METROAEROSPACE), 2021, : 672 - 676
  • [8] Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning
    Zhou, Wenhong
    Liu, Zhihong
    Li, Jie
    Xu, Xin
    Shen, Lincheng
    NEUROCOMPUTING, 2021, 466 : 285 - 297
  • [9] Speed and heading control of an unmanned surface vehicle using deep reinforcement learning
    Wu, Ting
    Ye, Hui
    Xiang, Zhengrong
    Yang, Xiaofei
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 573 - 578
  • [10] Deep Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Control Using 3D Hand Gestures
    Khan, Fawad Salam
    Mohd, Mohd Norzali Haji
    Zulkifli, Saiful Azrin B. M.
    Abro, Ghulam E. Mustafa
    Kazi, Suhail
    Soomro, Dur Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5741 - 5759