A deep reinforcement learning approach for multi-agent mobile robot patrolling

被引:8
|
作者
Jana, Meghdeep [1 ]
Vachhani, Leena [1 ]
Sinha, Arpita [1 ]
机构
[1] Indian Inst Technol, Autonomous Robots & Multiagent Syst Lab, Syst & Control Engn, Mumbai, Maharashtra, India
关键词
Multi-agent; patrolling; Markov decision process; Reinforcement learning; Deep learning;
D O I
10.1007/s41315-022-00235-1
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Patrolling strategies primarily deal with minimising the time taken to visit specific locations and cover an area. The use of intelligent agents in patrolling has become beneficial in automation and analysing patterns in patrolling. However, practical scenarios demand these strategies to be adaptive in various conditions and robust against adversaries. Traditional Q-learning based patrolling keeps track of all possible states and actions in a Q-table, making them susceptible to the curse of dimensionality. For multi-agent patrolling to be adaptive in various scenarios represented using graphs, we propose a formulation of the Markov Decision Process (MDP) with state-representations that can be utilised for Deep Reinforcement Learning (DRL) approaches such as Deep Q-Networks (DQN). The implemented DQN can estimate the MDP using a finite length state vector trained with a novel reward function. Proposed state-space representation is independent of the number of nodes in the graph, thereby addressing scalability to graph dimensions. We also propose a reward function to penalise the agents for lack of global coordination while providing immediate local feedback on their actions. As independent policy learners subject to the MDP and reward function, the DRL agents formed a collaborative patrolling strategy. The policies learned by the agents generalise and adapt to multiple behaviours without explicit training or design to do so. We provide empirical analysis that shows the strategy's adaptive capabilities with changes in agents' position, non-uniform node visit frequency requirements, changes in a graph structure representing the environment, and induced randomness in the trajectories. DRL patrolling proves to be a promising patrolling strategy for intelligent agents by potentially being scalable, adaptive, and robust against adversaries.
引用
收藏
页码:724 / 745
页数:22
相关论文
共 50 条
  • [1] A deep reinforcement learning approach for multi-agent mobile robot patrolling
    Meghdeep Jana
    Leena Vachhani
    Arpita Sinha
    [J]. International Journal of Intelligent Robotics and Applications, 2022, 6 : 724 - 745
  • [2] Decoupling Patrolling Tasks for Water Quality Monitoring: A Multi-Agent Deep Reinforcement Learning Approach
    Diop, Dame Seck
    Luis, Samuel Yanes
    Esteve, Manuel Perales
    Marin, Sergio L. Toral
    Reina, Daniel Gutierrez
    [J]. IEEE ACCESS, 2024, 12 : 75559 - 75576
  • [3] A multi-agent reinforcement learning approach to robot soccer
    Yong Duan
    Bao Xia Cui
    Xin He Xu
    [J]. Artificial Intelligence Review, 2012, 38 : 193 - 211
  • [4] A multi-agent reinforcement learning approach to robot soccer
    Duan, Yong
    Cui, Bao Xia
    Xu, Xin He
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
  • [5] Robust Multi-agent Patrolling Strategies Using Reinforcement Learning
    Lauri, Fabrice
    Koukam, Abderrafiaa
    [J]. SWARM INTELLIGENCE BASED OPTIMIZATION (ICSIBO 2014), 2014, 8472 : 157 - 165
  • [6] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
    Orr, James
    Dutta, Ayan
    [J]. SENSORS, 2023, 23 (07)
  • [7] Multi-Agent/Robot Deep Reinforcement Learning with Macro-Actions
    Xiao, Yuchen
    Hoffman, Joshua
    Xia, Tian
    Amato, Christopher
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13965 - 13966
  • [8] Multi-Agent Deep Reinforcement Learning for Coordinated Multipoint in Mobile Networks
    Schneider, Stefan
    Karl, Holger
    Khalili, Ramin
    Hecker, Artur
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 908 - 924
  • [9] A HYBRID APPROACH BASED ON MULTI-AGENT GEOSIMULATION AND REINFORCEMENT LEARNING TO SOLVE A UAV PATROLLING PROBLEM
    Perron, Jimmy
    Hogan, Jimmy
    Moulin, Bernard
    Berger, Jean
    Belanger, Micheline
    [J]. 2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, : 1259 - +
  • [10] Distributed multi-agent deep reinforcement learning for cooperative multi-robot pursuit
    Yu, Chao
    Dong, Yinzhao
    Li, Yangning
    Chen, Yatong
    [J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 499 - 504