Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach

被引:1
|
作者
Guo, Lingxiao [1 ]
Pan, Haoxuan [2 ]
Duan, Xiaoming [2 ]
He, Jianping [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
D O I
10.1109/ICRA48891.2023.10160923
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patrolling with multiple robots is a challenging task. While the robots collaboratively and repeatedly cover the regions of interest in the environment, their routes should satisfy two often conflicting properties: i) (efficiency) the time intervals between two consecutive visits to the regions are small; ii) (unpredictability) the patrolling trajectories are random and unpredictable. We manage to strike a balance between the two goals by i) recasting the original patrolling problem as a Graph Deep Learning problem; ii) directly solving this problem on the graph in the framework of cooperative multi-agent reinforcement learning. Treating the decisions of a team of agents as a sequence input, our model outputs the agents' actions in order by an autoregressive mechanism. Extensive simulation studies show that our approach has comparable performance with existing algorithms in terms of efficiency and outperforms them in terms of unpredictability. To our knowledge, this is the first work that successfully solves the patrolling problem with reinforcement learning on a graph.
引用
收藏
页码:3504 / 3509
页数:6
相关论文
共 50 条
  • [1] A new approach to multi-robot harbour patrolling: theory and experiments
    Marino, Alessandro
    Antonelli, Gianluca
    Aguiar, A. Pedro
    Pascoal, Antonio
    [J]. 2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1760 - 1765
  • [2] A Survey on Multi-robot Patrolling Algorithms
    Portugal, David
    Rocha, Rui
    [J]. TECHNOLOGICAL INNOVATION FOR SUSTAINABILITY, 2011, 349 : 139 - 146
  • [3] Trust Modeling in Multi-Robot Patrolling
    Pippin, Charles
    Christensen, Henrik
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 59 - 66
  • [4] A MARL-Based Approach for Easing MAS Organization Engineering
    Soule, Julien
    Jamont, Jean-Paul
    Occello, Michel
    Traonouez, Louis-Marie
    Theron, Paul
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 321 - 334
  • [5] A Survey of Multi-robot Regular and Adversarial Patrolling
    Huang, Li
    Zhou, MengChu
    Hao, Kuangrong
    Hou, Edwin
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) : 894 - 903
  • [6] A Survey of Multi-robot Regular and Adversarial Patrolling
    Li Huang
    MengChu Zhou
    Kuangrong Hao
    Edwin Hou
    [J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (04) : 894 - 903
  • [7] Stochastic Multi-Robot Patrolling with Limited Visibility
    Alam, Tauhidul
    Rahman, Md. Mahbubur
    Carrillo, Pedro
    Bobadilla, Leonardo
    Rapp, Brian
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 97 (02) : 411 - 429
  • [8] Stochastic Multi-Robot Patrolling with Limited Visibility
    Tauhidul Alam
    Md. Mahbubur Rahman
    Pedro Carrillo
    Leonardo Bobadilla
    Brian Rapp
    [J]. Journal of Intelligent & Robotic Systems, 2020, 97 : 411 - 429
  • [9] Randomized Multi-Robot Patrolling with Unidirectional Visibility
    Echefu, Louis
    Alam, Tauhidul
    Newaz, Abdullah Al Redwan
    [J]. 2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 324 - 329
  • [10] PatrolGRAPH: a Distributed Algorithm for Multi-Robot Patrolling
    Baglietto, M.
    Cannata, G.
    Capezio, F.
    Grosso, A.
    Sgorbissa, A.
    Zaccaria, R.
    [J]. IAS-10: INTELLIGENT AUTONOMOUS SYSTEMS 10, 2008, : 415 - 424