Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction

被引:9
|
作者
Jiang, Jesse [1 ]
Zhao, Ye [2 ]
Coogan, Samuel [1 ,3 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Sch Mech Engn, Atlanta, GA 30332 USA
[3] Georgia Inst Technol, Sch Civil & Environm Engn, Atlanta, GA 30332 USA
来源
基金
美国国家科学基金会;
关键词
Uncertainty; Stochastic systems; Gaussian processes; Planning; Markov processes; Automata; Process control; hybrid systems; Gaussian process learning;
D O I
10.1109/LCSYS.2022.3173993
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of refining satisfiability bounds for partially-known stochastic systems against planning specifications defined using syntactically co-safe Linear Temporal Logic (scLTL). We propose an abstraction-based approach that iteratively generates high-confidence Interval Markov Decision Process (IMDP) abstractions of the system from high-confidence bounds on the unknown component of the dynamics obtained via Gaussian process regression. In particular, we develop a synthesis strategy to sample the unknown dynamics by finding paths which avoid specification-violating states using a product IMDP. We further provide a heuristic to choose among various candidate paths to maximize the information gain. Finally, we propose an iterative algorithm to synthesize a satisfying control policy for the product IMDP system. We demonstrate our work with a case study on mobile robot navigation.
引用
收藏
页码:2641 / 2646
页数:6
相关论文
共 50 条
  • [1] Abstraction-Based Planning for Uncertainty-Aware Legged Navigation
    Jiang, Jesse
    Coogan, Samuel
    Zhao, Ye
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 221 - 234
  • [2] Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive Imitation Learning
    Diaz, Manfred
    Fevens, Thomas
    Paull, Liam
    2021 18TH CONFERENCE ON ROBOTS AND VISION (CRV 2021), 2021, : 72 - 78
  • [3] Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
    Dai, Siyang
    Liu, Jun
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9540 - 9549
  • [4] Safe Model-Based Reinforcement Learning With an Uncertainty-Aware Reachability Certificate
    Yu, Dongjie
    Zou, Wenjun
    Yang, Yujie
    Ma, Haitong
    Li, Shengbo Eben
    Yin, Yuming
    Chen, Jianyu
    Duan, Jingliang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 4129 - 4142
  • [5] Uncertainty-Aware Reinforcement Learning for Safe Control of Autonomous Vehicles in Signalized Intersections
    Emamifar, Mehrnoosh
    Ghoreishi, Seyede Fatemeh
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 81 - 82
  • [6] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
    Kuo, Cheng-Yu
    Schaarschmidt, Andreas
    Cui, Yunduan
    Asfour, Tamim
    Matsubara, Takamitsu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
  • [7] Innovative Applications of Unsupervised Learning in Uncertainty-Aware Pharmaceutical Supply Chain Planning
    Kochakkashani, Farid
    Kayvanfar, Vahid
    Baldacci, Roberto
    IEEE ACCESS, 2024, 12 : 107984 - 107999
  • [8] Robustness via Uncertainty-aware Cycle Consistency
    Upadhyay, Uddeshya
    Chen, Yanbei
    Akata, Zeynep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Uncertainty-aware automated machine learning toolbox
    Dorst, Tanja
    Schneider, Tizian
    Eichstaedt, Sascha
    Schuetze, Andreas
    TM-TECHNISCHES MESSEN, 2023, 90 (03) : 141 - 153
  • [10] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
    Enkhsaikhan, Bayaraa
    Jo, Ohyun
    IEEE ACCESS, 2024, 12 : 166553 - 166563