Towards better generalization in quadrotor landing using deep reinforcement learning

被引:0
|
作者
Wang, Jiawei [1 ,2 ]
Wang, Teng [3 ]
He, Zichen [1 ]
Cai, Wenzhe [3 ]
Sun, Changyin [2 ,3 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Southeast Univ, Dept Automat, Nanjing 210018, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Auxiliary task; Partitioned experience; Quadrotor landing; Generalization; UNMANNED AIRCRAFT SYSTEMS;
D O I
10.1007/s10489-022-03503-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the autonomous landing of unmanned aerial vehicles (UAVs) has attracted extensive attention due to the widespread applications of UAVs. With the rapid improvements in machine learning and artificial intelligence, recent research has begun to explore deep reinforcement learning (DRL) to learn the landing policy directly from raw observation data. However, current DRL-based solutions tend to suffer poor generalization to unseen environments. To deal with this issue, we formulate the landing problem as a two-stage DRL problem and bootstrap the DRL procedures by augmenting regular DRL loss with an auxiliary localization task. The auxiliary localization task provides dense supervision signals that aid in landing-relevant representation learning. In particular, two marker localization approaches are delicately designed based on deep classification and regression models, and differences between the two configurations are explored, aiming to answer the fundamental question of how to exploit localization better for representation learning. Furthermore, we propose a novel and flexible sampling strategy called Dynamic Partitioned Experience Replay to stabilize and accelerate the training procedure. Experimental results show that the auxiliary localization tasks combined with the improved sampling strategy aid the trained model to generalize in unseen environments. In addition, the trained model can be seamlessly transferred to the real-world quadrotors and has achieved outstanding landing performances.
引用
收藏
页码:6195 / 6213
页数:19
相关论文
共 50 条
  • [1] Towards better generalization in quadrotor landing using deep reinforcement learning
    Jiawei Wang
    Teng Wang
    Zichen He
    Wenzhe Cai
    Changyin Sun
    [J]. Applied Intelligence, 2023, 53 : 6195 - 6213
  • [2] Inclined Quadrotor Landing using Deep Reinforcement Learning
    Kooi, Jacob E.
    Babuska, Robert
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2361 - 2368
  • [3] Monocular Vision based Autonomous Landing of Quadrotor through Deep Reinforcement Learning
    Xu, Yinbo
    Liu, Zhihong
    Wang, Xiangke
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 10014 - 10019
  • [4] Quadrotor motion control using deep reinforcement learning
    Jiang, Zifei
    Lynch, Alan F.
    [J]. JOURNAL OF UNMANNED VEHICLE SYSTEMS, 2021, 9 (04) : 234 - 251
  • [5] Waypoint Navigation of Quadrotor using Deep Reinforcement Learning
    Himanshu, K. Harikumar
    Pushpangathan, Jinraj, V
    [J]. IFAC PAPERSONLINE, 2022, 55 (22): : 281 - 286
  • [6] Bootstrap State Representation Using Style Transfer for Better Generalization in Deep Reinforcement Learning
    Rahman, Md Masudur
    Xue, Yexiang
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 100 - 115
  • [7] A Reinforcement Learning Approach for Autonomous Control and Landing of a Quadrotor
    Vankadari, Madhu Babu
    Das, Kaushik
    Shinde, Chinmay
    Kumar, Swagat
    [J]. 2018 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2018, : 676 - 683
  • [8] Video surveillance using deep transfer learning and deep domain adaptation: Towards better generalization
    Himeur, Yassine
    Al-Maadeed, Somaya
    Kheddar, Hamza
    Al-Maadeed, Noor
    Abualsaud, Khalid
    Mohamed, Amr
    Khattab, Tamer
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [9] Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning
    Devo, Alessandro
    Mezzetti, Giacomo
    Costante, Gabriele
    Fravolini, Mario L.
    Valigi, Paolo
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (05) : 1546 - 1561
  • [10] Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning
    Mokhtar, Mohamed
    El-Badawy, Ayman
    [J]. 2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1045 - 1052