Collision-aware Multi-robot Motion Coordination Deep-RL with Dynamic Priority Strategy

被引:3
|
作者
Zhao, Liang [1 ]
Han, Sheng [1 ,2 ]
Lin, Youfang [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China
[2] CAAC, Key Lab Intelligent Passenger Serv Civil Aviat, Beijing, Peoples R China
关键词
motion coordination; multi-robot system (MRS); deep reinforcement learning (DRL); LEVEL;
D O I
10.1109/ICTAI52525.2021.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The motion coordination of multi-robots is the basis of the multi-robot system (MRS). The motion coordination problem of multi-robot satisfies the Markov property, so deep reinforcement learning (DRL) can be used to solve this problem. There are two limitations in the existing researches which apply DRL to solve this problem: the success rate of training is low and the effect of coordination is poor. We believe that in the training process, the agent should learn the collision constraint relationship from the collision. Therefore, we propose a Partially Tolerant Collision (PTC) collision handling strategy. And we believe that the completion time of motion coordination is related to the remaining distance of robots. Therefore, we propose the Dynamic Priority Strategy (DPS), which sets the priority for the robot based on the remaining distance of robots. This strategy is integrated into the reward setting of DRL. We use Path Checkerboard Diagram (PCD) as the basis for training and simulation. By experimenting with algorithms such as DQN, DDQN, and MLDDQN, the results of our proposed model are better than previous studies.
引用
下载
收藏
页码:65 / 72
页数:8
相关论文
共 50 条
  • [21] A Multi-threads Architecture for the Motion Coordination of a Heterogeneous Multi-Robot System
    Marchese, Fabio M.
    MICAI 2007: SIXTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, : 418 - 428
  • [22] Non-communicative multi-robot coordination in dynamic environments
    Kok, JR
    Spaan, MTJ
    Vlassis, N
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2005, 50 (2-3) : 99 - 114
  • [23] IMMUNE SYSTEM-INSPIRED DYNAMIC MULTI-ROBOT COORDINATION
    Khan, Muhammad Tahir
    de Silva, Clarence
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, VOL 3, 2010, : 37 - 43
  • [24] Switching formation strategy with the directed dynamic topology for collision avoidance of a multi-robot system in uncertain environments
    Vu Phi Tran
    Garratt, Matthew A.
    Petersen, Ian R.
    IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (18): : 2948 - 2959
  • [25] Learning Interaction-Aware Trajectory Predictions for Decentralized Multi-Robot Motion Planning in Dynamic Environments
    Zhu, Hai
    Claramunt, Francisco Martinez
    Brito, Bruno
    Alonso-Mora, Javier
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2256 - 2263
  • [26] Multi-Robot Environmental Coverage With a Two-Stage Coordination Strategy via Deep Reinforcement Learning
    Zhu, Lina
    Cheng, Jiyu
    Zhang, Hao
    Zhang, Wei
    Liu, Yuehu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 5022 - 5033
  • [27] Abstraction-based Motion Coordination Control for Multi-Robot Systems
    Pan, Zhuo-Rui
    Ren, Wei
    Sun, Xi-Ming
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6966 - 6971
  • [28] A switching formation strategy for obstacle avoidance of a multi-robot system based on robot priority model
    Dai, Yanyan
    Kim, YoonGu
    Wee, SungGil
    Lee, DongHa
    Lee, SukGyu
    ISA TRANSACTIONS, 2015, 56 : 123 - 134
  • [29] DECENTRALIZED MULTI-ROBOT MOTION PLANNING APPLICABLE TO DYNAMIC ENVIRONMENT
    Wu, Bin
    Suh, C. Steve
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2019, VOL 4, 2020,
  • [30] Lazy Auctions for Multi-robot Collision Avoidance and Motion Control under Uncertainty
    Calliess, Jan-P.
    Lyons, Daniel
    Hanebeck, Uwe D.
    ADVANCED AGENT TECHNOLOGY, 2012, 7068 : 295 - +