Safe Learning for Uncertainty-Aware Planning via Interval MDP Abstraction

被引：9

作者：

Jiang, Jesse ^{[1
]}

Zhao, Ye ^{[2
]}

Coogan, Samuel ^{[1
,3
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

[2] Georgia Inst Technol, Sch Mech Engn, Atlanta, GA 30332 USA

[3] Georgia Inst Technol, Sch Civil & Environm Engn, Atlanta, GA 30332 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2022年 / 6卷

基金：

美国国家科学基金会;

关键词：

Uncertainty; Stochastic systems; Gaussian processes; Planning; Markov processes; Automata; Process control; hybrid systems; Gaussian process learning;

D O I：

10.1109/LCSYS.2022.3173993

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We study the problem of refining satisfiability bounds for partially-known stochastic systems against planning specifications defined using syntactically co-safe Linear Temporal Logic (scLTL). We propose an abstraction-based approach that iteratively generates high-confidence Interval Markov Decision Process (IMDP) abstractions of the system from high-confidence bounds on the unknown component of the dynamics obtained via Gaussian process regression. In particular, we develop a synthesis strategy to sample the unknown dynamics by finding paths which avoid specification-violating states using a product IMDP. We further provide a heuristic to choose among various candidate paths to maximize the information gain. Finally, we propose an iterative algorithm to synthesize a satisfying control policy for the product IMDP system. We demonstrate our work with a case study on mobile robot navigation.

引用

页码：2641 / 2646

页数：6

共 50 条

[1] Abstraction-Based Planning for Uncertainty-Aware Legged Navigation
Jiang, Jesse
Coogan, Samuel
Zhao, Ye
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 221 - 234
[2] Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive Imitation Learning
Diaz, Manfred
Fevens, Thomas
Paull, Liam
2021 18TH CONFERENCE ON ROBOTS AND VISION (CRV 2021), 2021, : 72 - 78
[3] Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
Dai, Siyang
Liu, Jun
Cheung, Ngai-Man
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9540 - 9549
[4] Safe Model-Based Reinforcement Learning With an Uncertainty-Aware Reachability Certificate
Yu, Dongjie
Zou, Wenjun
Yang, Yujie
Ma, Haitong
Li, Shengbo Eben
Yin, Yuming
Chen, Jianyu
Duan, Jingliang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 4129 - 4142
[5] Uncertainty-Aware Reinforcement Learning for Safe Control of Autonomous Vehicles in Signalized Intersections
Emamifar, Mehrnoosh
Ghoreishi, Seyede Fatemeh
2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 81 - 82
[6] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
Kuo, Cheng-Yu
Schaarschmidt, Andreas
Cui, Yunduan
Asfour, Tamim
Matsubara, Takamitsu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
[7] Innovative Applications of Unsupervised Learning in Uncertainty-Aware Pharmaceutical Supply Chain Planning
Kochakkashani, Farid
Kayvanfar, Vahid
Baldacci, Roberto
IEEE ACCESS, 2024, 12 : 107984 - 107999
[8] Robustness via Uncertainty-aware Cycle Consistency
Upadhyay, Uddeshya
Chen, Yanbei
Akata, Zeynep
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[9] Uncertainty-aware automated machine learning toolbox
Dorst, Tanja
Schneider, Tizian
Eichstaedt, Sascha
Schuetze, Andreas
TM-TECHNISCHES MESSEN, 2023, 90 (03) : 141 - 153
[10] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
Enkhsaikhan, Bayaraa
Jo, Ohyun
IEEE ACCESS, 2024, 12 : 166553 - 166563

← 1 2 3 4 5 →