Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

被引:0
|
作者
ZHAO, Haoran [1 ]
FU, Hang [1 ]
YANG, Fan [1 ]
QU, Che [1 ]
ZHOU, Yaoming [1 ,2 ,3 ]
机构
[1] School of Aeronautic Science and Engineering, Beihang University, Beijing,100191, China
[2] Beijing Advanced Discipline Center for Unmanned Aircraft System, Beihang University, Beijing,100191, China
[3] Tianmushan Laboratory, Hangzhou,311115, China
来源
Chinese Journal of Aeronautics | 1600年 / 37卷 / 11期
基金
中国国家自然科学基金;
关键词
Aerial vehicle - Data driven - Data-driven learning - Markov Decision Processes - Motion and path planning - Motion-planning - Offline - Reinforcement learning approach - Reinforcement learnings - Unmanned aerial vehicle;
D O I
暂无
中图分类号
学科分类号
摘要
Non-learning based motion and path planning of an Unmanned Aerial Vehicle (UAV) is faced with low computation efficiency, mapping memory occupation and local optimization problems. This article investigates the challenge of quadrotor control using offline reinforcement learning. By establishing a data-driven learning paradigm that operates without real-environment interaction, the proposed workflow offers a safer approach than traditional reinforcement learning, making it particularly suited for UAV control in industrial scenarios. The introduced algorithm evaluates dataset uncertainty and employs a pessimistic estimation to foster offline deep reinforcement learning. Experiments highlight the algorithm's superiority over traditional online reinforcement learning methods, especially when learning from offline datasets. Furthermore, the article emphasizes the importance of a more general behavior policy. In evaluations, the trained policy demonstrated versatility by adeptly navigating diverse obstacles, underscoring its real-world applicability. © 2024
引用
收藏
页码:386 / 397
相关论文
共 50 条
  • [21] Data-Driven Personalized Learning Path Planning Based on Cognitive Diagnostic Assessments in MOOCs
    Jiang, Bo
    Li, Xinya
    Yang, Shuhao
    Kong, Yaqi
    Cheng, Wei
    Hao, Chuanyan
    Lin, Qiaomin
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [22] Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning
    Menner, Marcel
    Berntorp, Karl
    Zeilinger, Melanie N.
    Cairano, Stefano DI
    IEEE Transactions on Intelligent Vehicles, 2021, 6 (01): : 131 - 145
  • [23] Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning
    Menner, Marcel
    Berntorp, Karl
    Zeilinger, Melanie N.
    Di Cairano, Stefano
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (01): : 131 - 145
  • [24] Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect
    Sacchetto, Luca
    Korte, Mathias
    Gronauer, Sven
    Diepold, Klaus
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7539 - 7544
  • [25] Data-driven Predictive Control for Safe Motion Planning
    Dai, Li
    Huang, Teng
    Gao, Yulong
    Li, Sihang
    Deng, Yunshan
    Xia, Yuanqing
    UNMANNED SYSTEMS, 2025,
  • [26] Data-Driven Topological Motion Planning with Persistent Cohomology
    Pokorny, Florian T.
    Kragic, Danica
    ROBOTICS: SCIENCE AND SYSTEMS XI, 2015,
  • [27] A Data-Driven Approach to Motion Planning and Optimal Control of Medical Nanorobots with Dynamic Window Approach
    Pandav, Prahlad
    Ren, Juan
    IFAC PAPERSONLINE, 2023, 56 (03): : 583 - 588
  • [28] Data-driven planning via imitation learning
    Choudhury, Sanjiban
    Bhardwaj, Mohak
    Arora, Sankalp
    Kapoor, Ashish
    Ranade, Gireeja
    Scherer, Sebastian
    Dey, Debadeepta
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (13-14): : 1632 - 1672
  • [29] Data-driven learning and planning for environmental sampling
    Ma, Kai-Chieh
    Liu, Lantao
    Heidarsson, Hordur K.
    Sukhatme, Gaurav S.
    JOURNAL OF FIELD ROBOTICS, 2018, 35 (05) : 643 - 661
  • [30] Data-driven unmanned surface vessel path following control method based on reinforcement learning
    Deng, Weinan
    Li, Hao
    Wen, YuanQiao
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3035 - 3040