Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引:0
|
作者
Vinod, Abraham P. [1 ]
Safaoui, Sleiman [2 ]
Summers, Tyler H. [2 ]
Yoshikawa, Nobuyuki [3 ]
Di Cairano, Stefano [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA
[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan
关键词
Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;
D O I
10.1109/TCST.2024.3433229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.
引用
收藏
页码:2492 / 2499
页数:8
相关论文
共 50 条
  • [41] Distributed learning for planning under uncertainty problems with heterogeneous teams: Scaling up the multiagent planning with distributed learning and approximate representations
    Ure, N. Kemal
    Chowdhary, Girish
    Chen, Yu Fan
    How, Jonathan P.
    Vian, John
    Journal of Intelligent and Robotic Systems: Theory and Applications, 2014, 74 (1-2): : 529 - 544
  • [42] Reinforcement Learning of Informed Initial Policies for Decentralized Planning
    Kraemer, Landon
    Banerjee, Bikramjit
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)
  • [43] Decentralized Safe Reactive Planning under TWTL Specifications
    Peterson, Ryan
    Buyukkocak, Ali Tevfik
    Aksaray, Derya
    Yazicioglu, Yasin
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6599 - 6604
  • [44] Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs
    Chen, Fanfei
    Martin, John D.
    Huang, Yewei
    Wang, Jinkun
    Englot, Brendan
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6140 - 6147
  • [45] Multiairport Departure Scheduling via Multiagent Reinforcement Learning
    Cai, Kaiquan
    Li, Ziqi
    Guo, Tong
    Du, Wenbo
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 102 - 116
  • [46] Subgoal identification for reinforcement learning and planning in multiagent problem solving
    Chiu, Chung-Cheng
    Soo, Von-Wun
    MULTIAGENT SYSTEM TECHNOLOGIES, PROCEEDINGS, 2007, 4687 : 37 - +
  • [47] Hierarchical production control and distribution planning under retail uncertainty with reinforcement learning
    Deng, Yang
    Chow, Andy H. F.
    Yan, Yimo
    Su, Zicheng
    Zhou, Zhili
    Kuo, Yong-Hong
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025,
  • [48] Hierarchical motion planning under uncertainty
    Chakravorty, S.
    Saha, R.
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 5077 - 5082
  • [49] A Study on Cooperative Action Selection Considering Unfairness in Decentralized Multiagent Reinforcement Learning
    Matsui, Toshihiro
    Matsuo, Hiroshi
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2017, : 88 - 95
  • [50] Safe Adaptive Policy Transfer Reinforcement Learning for Distributed Multiagent Control
    Du, Bin
    Xie, Wei
    Li, Yang
    Yang, Qisong
    Zhang, Weidong
    Negenborn, Rudy R.
    Pang, Yusong
    Chen, Hongtian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1939 - 1946