Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

被引:0
|
作者
Charles H. Wu
Donald A. Sofge
Daniel M. Lofaro
机构
[1] Cornell University,Distributed Autonomous Systems Group
[2] U.S. Naval Research Laboratory,Navy Center for Applied Research in Artificial Intelligence
[3] U.S. Naval Research Laboratory,undefined
来源
关键词
Reinforcement Learning; Swarm robotics; MADDPG; Hardware;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we study the multi-agent pursuit–evasion problem, and present an extension of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) deep reinforcement learning algorithm. Previous pursuit–evasion advancements with MADDPG have focused on training capture strategies dependent on the restriction of evader movement with environmental features. We demonstrate a method to train pursuer agents to collaboratively surround and encircle an evader for reliable capture without a strategy rooted in environment entrapment (i.e. cornering). Our method utilizes a novel two-stage, variable-aggression, continuous reward function based on geometrical inscribed circles (incircles), along with a corresponding observation space, with agents operating in an entrapment-disadvantaged environment. Our results show reliable capture of an intelligent, superior evader by three trained pursuers in open space with our encircling strategy. A key novelty of our work is demonstrating the ability to transition behaviors learned using deep reinforcement learning from a simulated robotic system with imperfect world assumptions to a real-world robotic agents.
引用
收藏
页码:355 / 364
页数:9
相关论文
共 50 条
  • [1] Crafting a robotic swarm pursuit-evasion capture strategy using deep reinforcement learning
    Wu, Charles H.
    Sofge, Donald A.
    Lofaro, Daniel M.
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2022, 27 (02) : 355 - 364
  • [2] Orbital Interception Pursuit Strategy for Random Evasion Using Deep Reinforcement Learning
    Jiang, Rui
    Ye, Dong
    Xiao, Yan
    Sun, Zhaowei
    Zhang, Zeming
    [J]. SPACE: SCIENCE & TECHNOLOGY, 2023, 3
  • [3] Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning
    Xu, Can
    Zhang, Yin
    Wang, Weigang
    Dong, Ligang
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [4] Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning
    Singh, Gurpreet
    Lofaro, Daniel M.
    Sofge, Donald
    [J]. ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 226 - 233
  • [5] Near-optimal interception strategy for orbital pursuit-evasion using deep reinforcement learning
    Zhang, Jingrui
    Zhang, Kunpeng
    Zhang, Yao
    Shi, Heng
    Tang, Liang
    Li, Mou
    [J]. ACTA ASTRONAUTICA, 2022, 198 : 9 - 25
  • [6] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    [J]. AEROSPACE, 2023, 10 (09)
  • [7] Learning Evasion Strategy in Pursuit-Evasion by Deep Q-network
    Zhu, Jiagang
    Zou, Wei
    Zhu, Zheng
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 67 - 72
  • [8] A Deep Reinforcement Learning Approach for the Pursuit Evasion Game in the Presence of Obstacles
    Qi, Qi
    Zhang, Xuebo
    Guo, Xian
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 68 - 73
  • [9] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
    Boyin Jin
    Yupeng Liang
    Ziyao Han
    Kazuhiro Ohkura
    [J]. Artificial Life and Robotics, 2020, 25 : 588 - 595
  • [10] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
    Jin, Boyin
    Liang, Yupeng
    Han, Ziyao
    Ohkura, Kazuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (04) : 588 - 595