Deep Reinforcement Learning for UAV Intelligent Mission Planning

被引:12
|
作者
Yue, Longfei [1 ]
Yang, Rennong [1 ]
Zhang, Ying [1 ]
Yu, Lixin [1 ]
Wang, Zhuangzhuang [2 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Coll, Xian 710051, Peoples R China
[2] Air Force Engn Univ, Aviat Maintenance NCO Sch, Xinyang 464000, Peoples R China
基金
中国国家自然科学基金;
关键词
GO; SUPPRESSION; ALGORITHM; GAME;
D O I
10.1155/2022/3551508
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Rapid and precise air operation mission planning is a key technology in unmanned aerial vehicles (UAVs) autonomous combat in battles. In this paper, an end-to-end UAV intelligent mission planning method based on deep reinforcement learning (DRL) is proposed to solve the shortcomings of the traditional intelligent optimization algorithm, such as relying on simple, static, low-dimensional scenarios, and poor scalability. Specifically, the suppression of enemy air defense (SEAD) mission planning is described as a sequential decision-making problem and formalized as a Markov decision process (MDP). Then, the SEAD intelligent planning model based on the proximal policy optimization (PPO) algorithm is established and a general intelligent planning architecture is proposed. Furthermore, three policy training tricks, i.e., domain randomization, maximizing policy entropy, and underlying network parameter sharing, are introduced to improve the learning performance and generalizability of PPO. Experiments results show that the model in this work is efficient and stable, and can be adapted to the unknown continuous high-dimensional environment. It can be concluded that the UAV intelligent mission planning model based on DRL has powerful intelligent planning performance, and provides a new idea for researching UAV autonomy.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Deep Reinforcement Learning for Intelligent Communications
    Tan J.-J.
    Liang Y.-C.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (02): : 169 - 181
  • [32] A survey on security of UAV and deep reinforcement learning
    Sarikaya, Burcu Sonmez
    Bahtiyar, Serif
    AD HOC NETWORKS, 2024, 164
  • [33] INTELLIGENT ROUTE PLANNING METHOD FOR UAV BASED ON SWARM INTELLIGENCE AND DEEP LEARNING TECHNOLOGY
    Yang, Jian
    Huang, Xuejun
    COMPUTING AND INFORMATICS, 2024, 43 (04) : 874 - 899
  • [34] Deep Reinforcement Learning for Secrecy Energy-Efficient UAV Communication with Reconfigurable Intelligent Surface
    Tham, Mau-Luen
    Wong, Yi Jie
    Iqbal, Amjad
    Bin Ramli, Nordin
    Zhu, Yongxu
    Dagiuklas, Tasos
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [35] Intelligent land vehicle model transfer trajectory planning method of deep reinforcement learning
    Yu L.-L.
    Shao X.-Y.
    Long Z.-W.
    Wei Y.-D.
    Zhou K.-J.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (09): : 1409 - 1422
  • [36] Human Knowledge Augmented Deep Reinforcement Learning for Intelligent Automatic Radiotherapy Treatment Planning
    Shen, C.
    Chen, L.
    Gonzalez, Y.
    Nguyen, D.
    Jiang, S.
    Jia, X.
    MEDICAL PHYSICS, 2020, 47 (06) : E333 - E333
  • [37] Network Planning with Deep Reinforcement Learning
    Zhu, Hang
    Gupta, Varun
    Ahuja, Satyajeet Singh
    Tian, Yuandong
    Zhang, Ying
    Jin, Xin
    SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, : 258 - 271
  • [38] A Deep Reinforcement Learning Based UAV Trajectory Planning Method For Integrated Sensing And Communications Networks
    Lin, Heyun
    Zhang, Zhihai
    Wei, Longkun
    Zhou, Zihao
    Zheng, Tian
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [39] UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning
    Theile, Mirco
    Bayerlein, Harald
    Nai, Richard
    Gesbert, David
    Caccamo, Marco
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1444 - 1449
  • [40] A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control
    Sabzekar, Sina
    Samadzad, Mahdi
    Mehditabrizi, Asal
    Tak, Ala Nekouvaght
    UNMANNED SYSTEMS, 2024, 12 (03) : 477 - 498