Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions

被引:2
|
作者
Shen, Shang-En [1 ]
Huang, Yi-Cheng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Mech Engn, Taichung 40227, Taiwan
关键词
quadrotor UAV; reinforcement learning; logic control; target recognition; action decision making;
D O I
10.3390/drones8110660
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Most literature has extensively discussed reinforcement learning (RL) for controlling rotorcraft drones during flight for traversal tasks. However, most studies lack adequate details regarding the design of reward and punishment mechanisms, and there is a limited exploration of the feasibility of applying reinforcement learning in actual flight control following simulation experiments. Consequently, this study focuses on the exploration of reward and punishment design and state input for RL. The simulation environment is constructed using AirSim and Unreal Engine, with onboard camera footage serving as the state input for reinforcement learning. The research investigates three RL algorithms suitable for discrete action training. The Deep Q Network (DQN), Advantage Actor-Critic (A2C), and Proximal Policy Optimization (PPO) were combined with three different reward and punishment design mechanisms for training and testing. The results indicate that employing the PPO algorithm along with a continuous return method as the reward mechanism allows for effective convergence during the training process, achieving a target traversal rate of 71% in the testing environment. Furthermore, this study proposes integrating the YOLOv7-tiny object detection (OD) system to assess the applicability of reinforcement learning in real-world settings. Unifying the state inputs of simulated and OD environments and replacing the original simulated image inputs with a maximum dual-target approach, the experimental simulation achieved a target traversal rate of 52% ultimately. In summary, this research formulates a set of logical frameworks for an RL reward and punishment design deployed with real-time Yolo's OD implementation synergized as a useful aid for related RL studies.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] How to Train Your Quadrotor: A Framework for Consistently Smooth and Responsive Flight Control via Reinforcement Learning
    Mysore, Siddharth
    Mabsout, Bassel
    Saenko, Kate
    Mancuso, Renato
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2021, 5 (04)
  • [42] Design and implementation of a low-complexity flight controller for a quadrotor UAV
    Mirtaba, Mohammad
    Jeddi, Mohammad
    Nikoofard, Amirhossein
    Shirmohammadi, Zahra
    INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2023, 11 (02) : 689 - 700
  • [43] Design and implementation of a low-complexity flight controller for a quadrotor UAV
    Mohammad Mirtaba
    Mohammad Jeddi
    Amirhossein Nikoofard
    Zahra Shirmohammadi
    International Journal of Dynamics and Control, 2023, 11 : 689 - 700
  • [44] Deep Reinforcement Learning with Omnidirectional Images: application to UAV Navigation in Forests
    Artizzu, Charles-Olivier
    Allibert, Guillaume
    Demonceaux, Cedric
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 229 - 234
  • [45] Reinforcement Learning with Multiple Actions
    Nishiyama, Riku
    Yamada, Satoshi
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 207 - 213
  • [46] Reinforcement Learning with Parameterized Actions
    Masson, Warwick
    Ranchod, Pravesh
    Konidaris, George
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1934 - 1940
  • [47] Dynamic Spectrum Interaction of UAV Flight Formation Communication With Priority: A Deep Reinforcement Learning Approach
    Lin, Yun
    Wang, Meiyu
    Zhou, Xianglong
    Ding, Guoru
    Mao, Shiwen
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2020, 6 (03) : 892 - 903
  • [48] Coupled PID-SDRE Controller of a Quadrotor: Positioning and Stabilization of UAV Flight
    Chodnicki, Marcin
    Stecz, Wojciech
    Giernacki, Wojciech
    Stepien, Slawomir
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (ICINCO), 2022, : 524 - 530
  • [49] Biplane-Quadrotor Tail-Sitter UAV: Flight Dynamics and Control
    Swarnkar, Swati
    Parwana, Hardik
    Kothari, Mangal
    Abhishek, Abhishek
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2018, 41 (05) : 1049 - 1067
  • [50] Quadrotor UAV flight control via a novel saturation integral backstepping controller
    Zhou, Laihong
    Zhang, Juqian
    She, Houxin
    Jin, Hong
    AUTOMATIKA, 2019, 60 (02) : 193 - 206