Early Failure Detection of Deep End-to-End Control Policy by Reinforcement Learning

被引：0

作者：

Lee, Keuntaek ^{[1
]}

Saigol, Kamil ^{[2
]}

Theodorou, Evangelos A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

[2] Lyft Inc, San Francisco, CA USA

来源：

2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2019年

关键词：

D O I：

10.1109/icra.2019.8794189

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose the use of Bayesian networks, which provide both a mean value and an uncertainty estimate as output, to enhance the safety of learned control policies under circumstances in which a test-time input differs significantly from the training set. Our algorithm combines reinforcement learning and end-to-end imitation learning to simultaneously learn a control policy as well as a threshold over the predictive uncertainty of the learned model, with no hand-tuning required. Corrective action, such as a return of control to the model predictive controller or human expert, is taken before the failure of tasks, when the uncertainty threshold is exceeded. We validate our method on fully-observable and vision-based partially-observable systems using cart-pole and autonomous driving simulations using deep convolutional Bayesian neural networks. We demonstrate that our method is robust to uncertainty resulting from varying system dynamics as well as from partial state observability.

引用

页码：8543 / 8549

页数：7

共 50 条

[1] End-to-End Deep Reinforcement Learning for Exoskeleton Control
Rose, Lowell
Bazzocchi, Michael C. F.
Nejat, Goldie
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4294 - 4301
[2] End-to-end Control of Kart Agent with Deep Reinforcement Learning
Zhang Ruiming
Liu Chengju
Chen Qijun
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1688 - 1693
[3] Traffic Signal Control Using End-to-End Off-Policy Deep Reinforcement Learning
Chu, Kai-Fung
Lam, Albert Y. S.
Li, Victor O. K.
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7184 - 7195
[4] End-to-end sensorimotor control problems of AUVs with deep reinforcement learning
Wu, Hui
Song, Shiji
Hsu, Yachu
You, Keyou
Wu, Cheng
[J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5869 - 5874
[5] Deep reinforcement learning framework for end-to-end semiconductor process control
Hirtz, Thomas
Tian, He
Shahzad, Shazrah
Wu, Fan
Yang, Yi
Ren, Tian-Ling
[J]. Neural Computing and Applications, 2024, 36 (20) : 12443 - 12460
[6] NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning
Haj-Ali, Ameer
Ahmed, Nesreen K.
Willke, Ted
Shao, Yakun Sophia
Asanovic, Krste
Stoica, Ion
[J]. CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 242 - 255
[7] End-to-End Race Driving with Deep Reinforcement Learning
Jaritz, Maximilian
de Charette, Raoul
Toromanoff, Marin
Perot, Etienne
Nashashibi, Fawzi
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 2070 - 2075
[8] AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning
Carlucho, Ignacio
De Paula, Mariano
Wang, Sen
Menna, Bruno V.
Petillot, Yvan R.
Acosta, Gerardo G.
[J]. OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
[9] End-to-end offline reinforcement learning for glycemia control
Beolet, Tristan
Adenis, Alice
Huneker, Erik
Louis, Maxime
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 154
[10] Learning a Deep Neural Net Policy for End-to-End Control of Autonomous Vehicles
Rausch, Viktor
Hansen, Andreas
Solowjow, Eugen
Liu, Chang
Kreuzer, Edwin
Hedrick, J. Karl
[J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 4914 - 4919

← 1 2 3 4 5 →