High-Speed Three-Dimensional Aerial Vehicle Evasion Based on a Multi-Stage Dueling Deep Q-Network
被引:0
|
作者:
Yang, Yefeng
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Yang, Yefeng
[1
,2
]
Huang, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Huang, Tao
[1
,2
]
Wang, Xinxin
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Wang, Xinxin
[1
]
Wen, Chih-Yung
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Wen, Chih-Yung
[2
]
Huang, Xianlin
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Huang, Xianlin
[1
]
机构:
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
[2] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
aerial vehicle evasion;
deep reinforcement learning;
dueling deep Q-network;
multi-stage training;
DIFFERENTIAL GAME;
GUIDANCE LAW;
PURSUERS;
MANEUVER;
EQUATION;
EVADERS;
D O I:
10.3390/aerospace9110673
中图分类号:
V [航空、航天];
学科分类号:
08 ;
0825 ;
摘要:
This paper proposes a multi-stage dueling deep Q-network (MS-DDQN) algorithm to address the high-speed aerial vehicle evasion problem. High-speed aerial vehicle pursuit and evasion are an ongoing game attracting significant research attention in the field of autonomous aerial vehicle decision making. However, traditional maneuvering methods are usually not applicable in high-speed scenarios. Independent of the aerial vehicle model, the implemented MS-DDQN-based method searches for an approximate optimal maneuvering policy by iteratively interacting with the environment. Furthermore, the multi-stage learning mechanism was introduced to improve the training data quality. Simulation experiments were conducted to compare the proposed method with several typical evasion maneuvering policies and to reveal the effectiveness and robustness of the proposed MS-DDQN algorithm.
机构:
Changan Univ, Sch Elect & Control Engn, Xian, Peoples R ChinaChangan Univ, Sch Elect & Control Engn, Xian, Peoples R China
Zhang, Jinqi
Yan, Maode
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Elect & Control Engn, Xian, Peoples R ChinaChangan Univ, Sch Elect & Control Engn, Xian, Peoples R China
Yan, Maode
Zuo, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Elect & Control Engn, Xian, Peoples R China
Changan Univ, Sch Elect & Control Engn, Xian 710064, Peoples R ChinaChangan Univ, Sch Elect & Control Engn, Xian, Peoples R China
机构:
Army Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Yao, Jiangyi
Li, Xiongwei
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Li, Xiongwei
Zhang, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Zhang, Yang
Ji, Jingyu
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Dept UAV, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Ji, Jingyu
Wang, Yanchao
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Wang, Yanchao
Zhang, Danyang
论文数: 0引用数: 0
h-index: 0
机构:
Army Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
Zhang, Danyang
Liu, Yicen
论文数: 0引用数: 0
h-index: 0
机构:
State Key Lab Blind Signal Proc, Chengdu 610000, Peoples R ChinaArmy Engn Univ, Equipment Simulat Training Ctr, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
机构:
School of Astronautics, Northwestern Polytechnical University, Xi'an,710072, ChinaSchool of Astronautics, Northwestern Polytechnical University, Xi'an,710072, China
Wang, Xin
Yan, Jie
论文数: 0引用数: 0
h-index: 0
机构:
Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an,710072, ChinaSchool of Astronautics, Northwestern Polytechnical University, Xi'an,710072, China
Yan, Jie
Meng, Tingwei
论文数: 0引用数: 0
h-index: 0
机构:
School of Astronautics, Northwestern Polytechnical University, Xi'an,710072, ChinaSchool of Astronautics, Northwestern Polytechnical University, Xi'an,710072, China
Meng, Tingwei
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica,
2022,
43
(09):