Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引:0
|
作者
Vinod, Abraham P. [1 ]
Safaoui, Sleiman [2 ]
Summers, Tyler H. [2 ]
Yoshikawa, Nobuyuki [3 ]
Di Cairano, Stefano [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA
[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan
关键词
Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;
D O I
10.1109/TCST.2024.3433229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.
引用
收藏
页码:2492 / 2499
页数:8
相关论文
共 50 条
  • [31] Detecting Communities in Networks: a Decentralized Approach Based on Multiagent Reinforcement Learning
    Paim, Eduardo C.
    Bazzan, Ana L. C.
    Chira, Camelia
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2225 - 2232
  • [32] Mean-Field Multiagent Reinforcement Learning: A Decentralized Network Approach
    Gu, Haotian
    Guo, Xin
    Wei, Xiaoli
    Xu, Renyuan
    MATHEMATICS OF OPERATIONS RESEARCH, 2025, 50 (01) : 506 - 536
  • [33] Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous TeamsScaling Up the Multiagent Planning with Distributed Learning and Approximate Representations
    N. Kemal Ure
    Girish Chowdhary
    Yu Fan Chen
    Jonathan P. How
    John Vian
    Journal of Intelligent & Robotic Systems, 2014, 74 : 529 - 544
  • [34] Decentralized Multiagent Reinforcement Learning for Efficient Robotic Control by Coordination Graphs
    Yu, Chao
    Wang, Dongxu
    Ren, Jiankang
    Ge, Hongwei
    Sun, Liang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 191 - 203
  • [35] REFORMA: Robust REinFORceMent Learning via Adaptive Adversary for Drones Flying under Disturbances
    Hsu, Hao-Lun
    Meng, Haocheng
    Luo, Shaocheng
    Dong, Juncheng
    Tarokh, Vahid
    Pajic, Miroslav
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 5169 - 5175
  • [36] Safe Reinforcement Learning via Shielding
    Alshiekh, Mohammed
    Bloem, Roderick
    Ehlers, Ruediger
    Koenighofer, Bettina
    Niekum, Scott
    Topcu, Ufuk
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2669 - 2678
  • [37] Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction
    Jiang, Jesse
    Zhao, Ye
    Coogan, Samuel
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2641 - 2646
  • [38] Decentralized reinforcement learning control and emergence of motion patterns
    Svinin, M
    Yamada, K
    Okhura, K
    Ueda, K
    SENSOR FUSION AND DECENTRALIZED CONTROL IN ROBOTIC SYSTEMS, 1998, 3523 : 223 - 234
  • [39] Planning for potential: efficient safe reinforcement learning
    den Hengst, Floris
    Francois-Lavet, Vincent
    Hoogendoorn, Mark
    van Harmelen, Frank
    MACHINE LEARNING, 2022, 111 (06) : 2255 - 2274
  • [40] Planning for potential: efficient safe reinforcement learning
    Floris den Hengst
    Vincent François-Lavet
    Mark Hoogendoorn
    Frank van Harmelen
    Machine Learning, 2022, 111 : 2255 - 2274