A deep reinforcement learning approach for multi-agent mobile robot patrolling

被引：8

作者：

Jana, Meghdeep ^{[1
]}

Vachhani, Leena ^{[1
]}

Sinha, Arpita ^{[1
]}

机构：

[1] Indian Inst Technol, Autonomous Robots & Multiagent Syst Lab, Syst & Control Engn, Mumbai, Maharashtra, India

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS | 2022年 / 6卷 / 04期

关键词：

Multi-agent; patrolling; Markov decision process; Reinforcement learning; Deep learning;

D O I：

10.1007/s41315-022-00235-1

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Patrolling strategies primarily deal with minimising the time taken to visit specific locations and cover an area. The use of intelligent agents in patrolling has become beneficial in automation and analysing patterns in patrolling. However, practical scenarios demand these strategies to be adaptive in various conditions and robust against adversaries. Traditional Q-learning based patrolling keeps track of all possible states and actions in a Q-table, making them susceptible to the curse of dimensionality. For multi-agent patrolling to be adaptive in various scenarios represented using graphs, we propose a formulation of the Markov Decision Process (MDP) with state-representations that can be utilised for Deep Reinforcement Learning (DRL) approaches such as Deep Q-Networks (DQN). The implemented DQN can estimate the MDP using a finite length state vector trained with a novel reward function. Proposed state-space representation is independent of the number of nodes in the graph, thereby addressing scalability to graph dimensions. We also propose a reward function to penalise the agents for lack of global coordination while providing immediate local feedback on their actions. As independent policy learners subject to the MDP and reward function, the DRL agents formed a collaborative patrolling strategy. The policies learned by the agents generalise and adapt to multiple behaviours without explicit training or design to do so. We provide empirical analysis that shows the strategy's adaptive capabilities with changes in agents' position, non-uniform node visit frequency requirements, changes in a graph structure representing the environment, and induced randomness in the trajectories. DRL patrolling proves to be a promising patrolling strategy for intelligent agents by potentially being scalable, adaptive, and robust against adversaries.

引用

页码：724 / 745

页数：22

共 50 条

[1] A deep reinforcement learning approach for multi-agent mobile robot patrolling
Meghdeep Jana
Leena Vachhani
Arpita Sinha
[J]. International Journal of Intelligent Robotics and Applications, 2022, 6 : 724 - 745
[2] Decoupling Patrolling Tasks for Water Quality Monitoring: A Multi-Agent Deep Reinforcement Learning Approach
Diop, Dame Seck
Luis, Samuel Yanes
Esteve, Manuel Perales
Marin, Sergio L. Toral
Reina, Daniel Gutierrez
[J]. IEEE ACCESS, 2024, 12 : 75559 - 75576
[3] A multi-agent reinforcement learning approach to robot soccer
Yong Duan
Bao Xia Cui
Xin He Xu
[J]. Artificial Intelligence Review, 2012, 38 : 193 - 211
[4] A multi-agent reinforcement learning approach to robot soccer
Duan, Yong
Cui, Bao Xia
Xu, Xin He
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
[5] Robust Multi-agent Patrolling Strategies Using Reinforcement Learning
Lauri, Fabrice
Koukam, Abderrafiaa
[J]. SWARM INTELLIGENCE BASED OPTIMIZATION (ICSIBO 2014), 2014, 8472 : 157 - 165
[6] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
Orr, James
Dutta, Ayan
[J]. SENSORS, 2023, 23 (07)
[7] Multi-Agent/Robot Deep Reinforcement Learning with Macro-Actions
Xiao, Yuchen
Hoffman, Joshua
Xia, Tian
Amato, Christopher
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13965 - 13966
[8] Multi-Agent Deep Reinforcement Learning for Coordinated Multipoint in Mobile Networks
Schneider, Stefan
Karl, Holger
Khalili, Ramin
Hecker, Artur
[J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 908 - 924
[9] A HYBRID APPROACH BASED ON MULTI-AGENT GEOSIMULATION AND REINFORCEMENT LEARNING TO SOLVE A UAV PATROLLING PROBLEM
Perron, Jimmy
Hogan, Jimmy
Moulin, Bernard
Berger, Jean
Belanger, Micheline
[J]. 2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, : 1259 - +
[10] Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit
Yu, Chao
Dong, Yinzhao
Li, Yangning
Chen, Yatong
[J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 499 - 504

← 1 2 3 4 5 →