Multi-Task Reinforcement Learning for Quadrotors

被引:0
|
作者
Xing, Jiaxu [1 ,2 ,3 ]
Geles, Ismail [1 ,2 ,3 ]
Song, Yunlong [1 ,2 ,3 ]
Aljalbout, Elie [1 ,2 ,3 ]
Scaramuzza, Davide [1 ,2 ,3 ]
机构
[1] Univ Zurich, Dept Informat, Robot & Percept Grp, CH-8006 Zurich, Switzerland
[2] Univ Zurich, Dept Neuroinformat, CH-8006 Zurich, Switzerland
[3] Swiss Fed Inst Technol, CH-8006 Zurich, Switzerland
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 03期
基金
欧洲研究理事会;
关键词
Reinforcement learning; machine learning for robot control; aerial systems: perception and autonomy;
D O I
10.1109/LRA.2024.3520894
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning (RL) has shown great effectiveness in quadrotor control, enabling specialized policies to develop even human-champion-level performance in single-task scenarios. However, these specialized policies often struggle with novel tasks, requiring a complete retraining of the policy from scratch. To address this limitation, this paper presents a novel multi-task reinforcement learning (MTRL) framework tailored for quadrotor control, leveraging the shared physical dynamics of the platform to enhance sample efficiency and task performance. By employing a multi-critic architecture and shared task encoders, our framework facilitates knowledge transfer across tasks, enabling a single policy to execute diverse maneuvers, including high-speed stabilization, velocity tracking, and autonomous racing. Our experimental results, validated both in simulation and real-world scenarios, demonstrate that our framework outperforms baseline approaches in terms of sample efficiency and overall task performance. Video is available at https://youtu.be/HfK9UT1OVnY.
引用
收藏
页码:2112 / 2119
页数:8
相关论文
共 50 条
  • [31] Multi-Task Reinforcement Learning with Context-based Representations
    Sodhani, Shagun
    Zhang, Amy
    Pineau, Joelle
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [32] Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
    Lan, Siming
    Zhang, Rui
    Yi, Qi
    Guo, Jiaming
    Peng, Shaohui
    Gao, Yunkai
    Wu, Fan
    Chen, Ruizhi
    Du, Zidong
    Hu, Xing
    Zhang, Xishan
    Li, Ling
    Chen, Yunji
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Multi-task reinforcement learning in partially observable stochastic environments
    Li, Hui
    Liao, Xuejun
    Carin, Lawrence
    Journal of Machine Learning Research, 2009, 10 : 1131 - 1186
  • [34] Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
    Yu, Tianhe
    Kumar, Aviral
    Chebotar, Yevgen
    Hausman, Karol
    Levine, Sergey
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] Discovering Synergies for Robot Manipulation with Multi-Task Reinforcement Learning
    He, Zhanpeng
    Ciocarlie, Matei
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2714 - 2721
  • [36] PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction
    Bai, Fengshuo
    Zhang, Hongming
    Tao, Tianyang
    Wu, Zhiheng
    Wang, Yanna
    Xu, Bo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6728 - 6736
  • [37] Efficient Design Space Exploration with Multi-Task Reinforcement Learning
    Hoffmann, Patrick
    Gorelik, Kirill
    Ivanov, Valentin
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024, 2024, : 1378 - 1385
  • [38] Prioritized Sampling with Intrinsic Motivation in Multi-Task Reinforcement Learning
    D'Eramo, Carlo
    Chalvatzaki, Georgia
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [39] A Multi-Task Reinforcement Learning Approach for Navigating Unsignalized Intersections
    Kai, Shixiong
    Wang, Bin
    Chen, Dong
    Hao, Jianye
    Zhang, Hongbo
    Liu, Wulong
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1682 - 1687
  • [40] Multi-task Deep Reinforcement Learning: a Combination of Rainbow and DisTraL
    Andalibi, Milad
    Setoodeh, Peyman
    Mansourieh, Ali
    Asemani, Mohammad Hassan
    2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,