Early Failure Detection of Deep End-to-End Control Policy by Reinforcement Learning

被引:0
|
作者
Lee, Keuntaek [1 ]
Saigol, Kamil [2 ]
Theodorou, Evangelos A. [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Lyft Inc, San Francisco, CA USA
关键词
D O I
10.1109/icra.2019.8794189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose the use of Bayesian networks, which provide both a mean value and an uncertainty estimate as output, to enhance the safety of learned control policies under circumstances in which a test-time input differs significantly from the training set. Our algorithm combines reinforcement learning and end-to-end imitation learning to simultaneously learn a control policy as well as a threshold over the predictive uncertainty of the learned model, with no hand-tuning required. Corrective action, such as a return of control to the model predictive controller or human expert, is taken before the failure of tasks, when the uncertainty threshold is exceeded. We validate our method on fully-observable and vision-based partially-observable systems using cart-pole and autonomous driving simulations using deep convolutional Bayesian neural networks. We demonstrate that our method is robust to uncertainty resulting from varying system dynamics as well as from partial state observability.
引用
收藏
页码:8543 / 8549
页数:7
相关论文
共 50 条
  • [1] End-to-End Deep Reinforcement Learning for Exoskeleton Control
    Rose, Lowell
    Bazzocchi, Michael C. F.
    Nejat, Goldie
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4294 - 4301
  • [2] End-to-end Control of Kart Agent with Deep Reinforcement Learning
    Zhang Ruiming
    Liu Chengju
    Chen Qijun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1688 - 1693
  • [3] Traffic Signal Control Using End-to-End Off-Policy Deep Reinforcement Learning
    Chu, Kai-Fung
    Lam, Albert Y. S.
    Li, Victor O. K.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7184 - 7195
  • [4] End-to-end sensorimotor control problems of AUVs with deep reinforcement learning
    Wu, Hui
    Song, Shiji
    Hsu, Yachu
    You, Keyou
    Wu, Cheng
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5869 - 5874
  • [5] Deep reinforcement learning framework for end-to-end semiconductor process control
    Hirtz, Thomas
    Tian, He
    Shahzad, Shazrah
    Wu, Fan
    Yang, Yi
    Ren, Tian-Ling
    [J]. Neural Computing and Applications, 2024, 36 (20) : 12443 - 12460
  • [6] NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning
    Haj-Ali, Ameer
    Ahmed, Nesreen K.
    Willke, Ted
    Shao, Yakun Sophia
    Asanovic, Krste
    Stoica, Ion
    [J]. CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 242 - 255
  • [7] End-to-End Race Driving with Deep Reinforcement Learning
    Jaritz, Maximilian
    de Charette, Raoul
    Toromanoff, Marin
    Perot, Etienne
    Nashashibi, Fawzi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 2070 - 2075
  • [8] AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning
    Carlucho, Ignacio
    De Paula, Mariano
    Wang, Sen
    Menna, Bruno V.
    Petillot, Yvan R.
    Acosta, Gerardo G.
    [J]. OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [9] End-to-end offline reinforcement learning for glycemia control
    Beolet, Tristan
    Adenis, Alice
    Huneker, Erik
    Louis, Maxime
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 154
  • [10] Learning a Deep Neural Net Policy for End-to-End Control of Autonomous Vehicles
    Rausch, Viktor
    Hansen, Andreas
    Solowjow, Eugen
    Liu, Chang
    Kreuzer, Edwin
    Hedrick, J. Karl
    [J]. 2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 4914 - 4919