Constrained Coverage of Unknown Environment Using Safe Reinforcement Learning

被引:1
|
作者
Zhang, Yunlin [1 ]
You, Junjie [1 ]
Shi, Lei [2 ]
Shao, Jinliang [1 ,3 ]
Zheng, Wei Xing [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Peoples R China
[2] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
[3] Lab Electromagnet Space Cognit & Intelligent Cont, Beijing 100089, Peoples R China
[4] Western Sydney Univ, Sch Comp Data & Math Sci, Sydney, NSW 2751, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CDC49753.2023.10383702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Achieving a connected, collision-free and timeefficient coverage in unknown environments is challenging for multi-agent systems. Particularly, agents with second-order dynamics are supposed to efficiently search and reach the optimal deployment positions over targets whose distribution is unknown, while preserving the distributed connectivity and avoiding collision. In this paper, a safe reinforcement learning based shield method is proposed for unknown environment exploration while correcting actions of agents for safety guarantee and avoiding invalid samples into policy updating. The shield is achieved distributively by a control barrier function and its validity is proved in theory. Moreover, policies of the optimal coverage are centrally learned via reward engineering and executed distributively. Numerical results show that the proposed approach not only achieves zero safety violations during training, but also speeds up the convergence of learning.
引用
收藏
页码:3415 / 3420
页数:6
相关论文
共 50 条
  • [1] Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
    Ota, Kei
    Jha, Devesh K.
    Oiki, Tomoaki
    Miura, Mamoru
    Nammoto, Takashi
    Nikovski, Daniel
    Mariyama, Toshisada
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3487 - 3494
  • [2] Safe Robot Navigation Using Constrained Hierarchical Reinforcement Learning
    Roza, Felippe Schmoeller
    Rasheed, Hassan
    Roscher, Karsten
    Ning, Xiangyu
    Guennemann, Stephan
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 737 - 742
  • [3] Learning to grasp in unknown environment by reinforcement learning and shaping
    Rezzoug, N.
    Gorce, P.
    Abellard, A.
    Ben Khelifa, M.
    Abellard, P.
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4487 - +
  • [4] Constrained Variational Policy Optimization for Safe Reinforcement Learning
    Liu, Zuxin
    Cen, Zhepeng
    Isenbaev, Vladislav
    Liu, Wei
    Wu, Zhiwei Steven
    Li, Bo
    Zhao, Ding
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [5] Constrained Visual Representation Learning With Bisimulation Metrics for Safe Reinforcement Learning
    Wang, Rongrong
    Cheng, Yuhu
    Wang, Xuesong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 379 - 393
  • [6] An Iterative Online Approach to Safe Learning in Unknown Constrained Environments
    Minh Vu
    Zeng, Shen
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7330 - 7335
  • [7] Robot Position/Force Control in Unknown Environment Using Hybrid Reinforcement Learning
    Perrusquia, Adolfo
    Yu Wen
    CYBERNETICS AND SYSTEMS, 2020, 51 (04) : 542 - 560
  • [8] Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning
    Ma, Haitong
    Liu, Changliu
    Li, Shengbo Eben
    Zheng, Sifa
    Chen, Jianyu
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [9] A reinforcement learning approach for robot control in an unknown environment
    Xiao, NF
    Nahavandi, S
    IEEE ICIT' 02: 2002 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS I AND II, PROCEEDINGS, 2002, : 1096 - 1099
  • [10] Constrained Cross-Entropy Method for Safe Reinforcement Learning
    Wen, Min
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (07) : 3123 - 3137