Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

被引:0
|
作者
ZHAO, Haoran [1 ]
FU, Hang [1 ]
YANG, Fan [1 ]
QU, Che [1 ]
ZHOU, Yaoming [1 ,2 ,3 ]
机构
[1] School of Aeronautic Science and Engineering, Beihang University, Beijing,100191, China
[2] Beijing Advanced Discipline Center for Unmanned Aircraft System, Beihang University, Beijing,100191, China
[3] Tianmushan Laboratory, Hangzhou,311115, China
来源
Chinese Journal of Aeronautics | 1600年 / 37卷 / 11期
基金
中国国家自然科学基金;
关键词
Aerial vehicle - Data driven - Data-driven learning - Markov Decision Processes - Motion and path planning - Motion-planning - Offline - Reinforcement learning approach - Reinforcement learnings - Unmanned aerial vehicle;
D O I
暂无
中图分类号
学科分类号
摘要
Non-learning based motion and path planning of an Unmanned Aerial Vehicle (UAV) is faced with low computation efficiency, mapping memory occupation and local optimization problems. This article investigates the challenge of quadrotor control using offline reinforcement learning. By establishing a data-driven learning paradigm that operates without real-environment interaction, the proposed workflow offers a safer approach than traditional reinforcement learning, making it particularly suited for UAV control in industrial scenarios. The introduced algorithm evaluates dataset uncertainty and employs a pessimistic estimation to foster offline deep reinforcement learning. Experiments highlight the algorithm's superiority over traditional online reinforcement learning methods, especially when learning from offline datasets. Furthermore, the article emphasizes the importance of a more general behavior policy. In evaluations, the trained policy demonstrated versatility by adeptly navigating diverse obstacles, underscoring its real-world applicability. © 2024
引用
收藏
页码:386 / 397
相关论文
共 50 条
  • [41] Data-Driven Control of Hydraulic Manipulators by Reinforcement Learning
    Yao, Zhikai
    Xu, Fengyu
    Jiang, Guo-Ping
    Yao, Jianyong
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2673 - 2684
  • [42] A data-driven approach for motion planning of industrial robots controlled by high-level motion commands
    Hou, Shuxiao
    Bdiwi, Mohamad
    Rashid, Aquib
    Krusche, Sebastian
    Ihlenfeldt, Steffen
    FRONTIERS IN ROBOTICS AND AI, 2023, 9
  • [43] A data-driven path planning model for crowd capacity analysis
    Tan, Sing Kuang
    Hu, Nan
    Cai, Wentong
    JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 34 : 66 - 79
  • [44] Data-Driven Control of COVID-19 in Buildings: A Reinforcement-Learning Approach
    Hosseinloo, Ashkan Haji
    Nabi, Saleh
    Hosoi, Anette
    Dahleh, Munther A.
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 5691 - 5699
  • [45] Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach
    Wang, Haozhe
    Wu, Yulei
    Min, Geyong
    Xu, Jie
    Tang, Pengcheng
    INFORMATION SCIENCES, 2019, 498 : 106 - 116
  • [46] Improving Local Motion Planning with a Reinforcement Learning Approach
    Garrote, Luis
    Temporao, Diogo
    Tempordo, Samuel
    Pereira, Ricardo
    Barros, Tiago
    Nunes, Urbano J.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 206 - 213
  • [47] A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning
    Abouheaf, Mohammed
    Gueaieb, Wail
    Spinello, Davide
    Al-Sharhan, Salah
    2021 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2021), 2021,
  • [48] Contrastive Learning: An Alternative Surrogate for Offline Data-Driven Evolutionary Computation
    Huang, Hao-Gan
    Gong, Yue-Jiao
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (02) : 370 - 384
  • [49] Mobile Parallel Manipulators, Modelling and Data-Driven Motion Planning
    Khoukhi, Amar
    Hamdan, Mutaz
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [50] Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees
    Jha, Devesh K.
    Zhu, Minghui
    Wang, Yebin
    Ray, Asok
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5716 - 5721