Data-driven offline reinforcement learning approach for quadrotor's motion and path planning

被引：0

作者：

ZHAO, Haoran ^{[1
]}

FU, Hang ^{[1
]}

YANG, Fan ^{[1
]}

QU, Che ^{[1
]}

ZHOU, Yaoming ^{[1
,2
,3
]}

机构：

[1] School of Aeronautic Science and Engineering, Beihang University, Beijing,100191, China

[2] Beijing Advanced Discipline Center for Unmanned Aircraft System, Beihang University, Beijing,100191, China

[3] Tianmushan Laboratory, Hangzhou,311115, China

来源：

Chinese Journal of Aeronautics | 1600年 / 37卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Aerial vehicle - Data driven - Data-driven learning - Markov Decision Processes - Motion and path planning - Motion-planning - Offline - Reinforcement learning approach - Reinforcement learnings - Unmanned aerial vehicle;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Non-learning based motion and path planning of an Unmanned Aerial Vehicle (UAV) is faced with low computation efficiency, mapping memory occupation and local optimization problems. This article investigates the challenge of quadrotor control using offline reinforcement learning. By establishing a data-driven learning paradigm that operates without real-environment interaction, the proposed workflow offers a safer approach than traditional reinforcement learning, making it particularly suited for UAV control in industrial scenarios. The introduced algorithm evaluates dataset uncertainty and employs a pessimistic estimation to foster offline deep reinforcement learning. Experiments highlight the algorithm's superiority over traditional online reinforcement learning methods, especially when learning from offline datasets. Furthermore, the article emphasizes the importance of a more general behavior policy. In evaluations, the trained policy demonstrated versatility by adeptly navigating diverse obstacles, underscoring its real-world applicability. © 2024

引用

页码：386 / 397

共 50 条

[41] Data-Driven Control of Hydraulic Manipulators by Reinforcement Learning
Yao, Zhikai
Xu, Fengyu
Jiang, Guo-Ping
Yao, Jianyong
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2673 - 2684
[42] A data-driven approach for motion planning of industrial robots controlled by high-level motion commands
Hou, Shuxiao
Bdiwi, Mohamad
Rashid, Aquib
Krusche, Sebastian
Ihlenfeldt, Steffen
FRONTIERS IN ROBOTICS AND AI, 2023, 9
[43] A data-driven path planning model for crowd capacity analysis
Tan, Sing Kuang
Hu, Nan
Cai, Wentong
JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 34 : 66 - 79
[44] Data-Driven Control of COVID-19 in Buildings: A Reinforcement-Learning Approach
Hosseinloo, Ashkan Haji
Nabi, Saleh
Hosoi, Anette
Dahleh, Munther A.
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 5691 - 5699
[45] Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach
Wang, Haozhe
Wu, Yulei
Min, Geyong
Xu, Jie
Tang, Pengcheng
INFORMATION SCIENCES, 2019, 498 : 106 - 116
[46] Improving Local Motion Planning with a Reinforcement Learning Approach
Garrote, Luis
Temporao, Diogo
Tempordo, Samuel
Pereira, Ricardo
Barros, Tiago
Nunes, Urbano J.
2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 206 - 213
[47] A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning
Abouheaf, Mohammed
Gueaieb, Wail
Spinello, Davide
Al-Sharhan, Salah
2021 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2021), 2021,
[48] Contrastive Learning: An Alternative Surrogate for Offline Data-Driven Evolutionary Computation
Huang, Hao-Gan
Gong, Yue-Jiao
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (02) : 370 - 384
[49] Mobile Parallel Manipulators, Modelling and Data-Driven Motion Planning
Khoukhi, Amar
Hamdan, Mutaz
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
[50] Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees
Jha, Devesh K.
Zhu, Minghui
Wang, Yebin
Ray, Asok
2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5716 - 5721

← 1 2 3 4 5 →