Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

被引：0

作者：

Charles H. Wu

Donald A. Sofge

Daniel M. Lofaro

机构：

[1] Cornell University,Distributed Autonomous Systems Group

[2] U.S. Naval Research Laboratory,Navy Center for Applied Research in Artificial Intelligence

[3] U.S. Naval Research Laboratory,undefined

来源：

Artificial Life and Robotics | 2022年 / 27卷

关键词：

Reinforcement Learning; Swarm robotics; MADDPG; Hardware;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper we study the multi-agent pursuit–evasion problem, and present an extension of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) deep reinforcement learning algorithm. Previous pursuit–evasion advancements with MADDPG have focused on training capture strategies dependent on the restriction of evader movement with environmental features. We demonstrate a method to train pursuer agents to collaboratively surround and encircle an evader for reliable capture without a strategy rooted in environment entrapment (i.e. cornering). Our method utilizes a novel two-stage, variable-aggression, continuous reward function based on geometrical inscribed circles (incircles), along with a corresponding observation space, with agents operating in an entrapment-disadvantaged environment. Our results show reliable capture of an intelligent, superior evader by three trained pursuers in open space with our encircling strategy. A key novelty of our work is demonstrating the ability to transition behaviors learned using deep reinforcement learning from a simulated robotic system with imperfect world assumptions to a real-world robotic agents.

引用

页码：355 / 364

页数：9

共 50 条

[1] Crafting a robotic swarm pursuit-evasion capture strategy using deep reinforcement learning
Wu, Charles H.
Sofge, Donald A.
Lofaro, Daniel M.
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2022, 27 (02) : 355 - 364
[2] Orbital Interception Pursuit Strategy for Random Evasion Using Deep Reinforcement Learning
Jiang, Rui
Ye, Dong
Xiao, Yan
Sun, Zhaowei
Zhang, Zeming
[J]. SPACE: SCIENCE & TECHNOLOGY, 2023, 3
[3] Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning
Xu, Can
Zhang, Yin
Wang, Weigang
Dong, Ligang
[J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
[4] Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning
Singh, Gurpreet
Lofaro, Daniel M.
Sofge, Donald
[J]. ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 226 - 233
[5] Near-optimal interception strategy for orbital pursuit-evasion using deep reinforcement learning
Zhang, Jingrui
Zhang, Kunpeng
Zhang, Yao
Shi, Heng
Tang, Liang
Li, Mou
[J]. ACTA ASTRONAUTICA, 2022, 198 : 9 - 25
[6] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
Guo, Yunhe
Jiang, Zijian
Huang, Hanqiao
Fan, Hongjia
Weng, Weiye
[J]. AEROSPACE, 2023, 10 (09)
[7] Learning Evasion Strategy in Pursuit-Evasion by Deep Q-network
Zhu, Jiagang
Zou, Wei
Zhu, Zheng
[J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 67 - 72
[8] A Deep Reinforcement Learning Approach for the Pursuit Evasion Game in the Presence of Obstacles
Qi, Qi
Zhang, Xuebo
Guo, Xian
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 68 - 73
[9] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
Boyin Jin
Yupeng Liang
Ziyao Han
Kazuhiro Ohkura
[J]. Artificial Life and Robotics, 2020, 25 : 588 - 595
[10] Generating collective foraging behavior for robotic swarm using deep reinforcement learning
Jin, Boyin
Liang, Yupeng
Han, Ziyao
Ohkura, Kazuhiro
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (04) : 588 - 595

← 1 2 3 4 5 →