Learning Stochastic Parametric Differentiable Predictive Control Policies

被引:4
|
作者
Drgona, Jan [1 ]
Mukherjee, Sayak [1 ]
Tuor, Aaron [1 ]
Halappanavar, Mahantesh [1 ]
Vrabie, Draguna [1 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99352 USA
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 25期
关键词
Stochastic explicit model predictive control; offline model-based policy optimization; deep neural networks; differentiable programming; parametric programming; STATE; SYSTEMS; MPC;
D O I
10.1016/j.ifacol.2022.09.334
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of synthesizing stochastic explicit model predictive control policies is known to be quickly intractable even for systems of modest complexity when using classical control-theoretic methods. To address this challenge, we present a scalable alternative called stochastic parametric differentiable predictive control (SP-DPC) for unsupervised learning of neural control policies governing stochastic linear systems subject to nonlinear chance constraints. SP-DPC is formulated as a deterministic approximation to the stochastic parametric constrained optimal control problem. This formulation allows us to directly compute the policy gradients via automatic differentiation of the problem's value function, evaluated over sampled parameters and uncertainties. In particular, the computed expectation of the SP-DPC problem's value function is backpropagated through the closed-loop system rollouts parametrized by a known nominal system dynamics model and neural control policy which allows for direct model-based policy optimization. We demonstrate the computational efficiency and scalability of the proposed policy optimization algorithm in three numerical examples, including systems with a large number of states or subject to nonlinear constraints. Copyright (C) 2022 The Authors.
引用
收藏
页码:121 / 126
页数:6
相关论文
共 50 条
  • [1] Learning Constrained Parametric Differentiable Predictive Control Policies With Guarantees
    Drgona, Jan
    Tuor, Aaron
    Vrabie, Draguna
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3596 - 3607
  • [2] Deep Learning Explicit Differentiable Predictive Control Laws for Buildings
    Drgona, Jan
    Tuor, Aaron
    Skomski, Elliott
    Vasisht, Soumya
    Vrabie, Draguna
    [J]. IFAC PAPERSONLINE, 2021, 54 (06): : 14 - 19
  • [3] Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems
    Drgona, Jan
    Kis, Karol
    Tuor, Aaron
    Vrabie, Draguna
    Klauco, Martin
    [J]. JOURNAL OF PROCESS CONTROL, 2022, 116 : 80 - 92
  • [4] Learning Continuous Control Policies by Stochastic Value Gradients
    Heess, Nicolas
    Wayne, Greg
    Silver, David
    Lillicrap, Timothy
    Tassa, Yuval
    Erez, Tom
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [5] Neural Lyapunov Differentiable Predictive Control
    Mukherjee, Sayak
    Drgona, Jan
    Tuor, Aaron
    Halappanavar, Mahantesh
    Vrabie, Draguna
    [J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2097 - 2104
  • [6] Stochastic Iterative Learning Model Predictive Control based on Stochastic Approximation
    Park, ByungJun
    Oh, Se-Kyu
    Lee, Jong Min
    [J]. IFAC PAPERSONLINE, 2019, 52 (01): : 604 - 609
  • [7] Learning High-Level Policies for Model Predictive Control
    Song, Yunlong
    Scaramuzza, Davide
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7629 - 7636
  • [8] ITERATIVE LEARNING CONTROL ON NONLINEAR STOCHASTIC NETWORKED SYSTEMS WITH NON-DIFFERENTIABLE DYNAMICS
    Alsadat, Najafi Sedigheh
    Ali, Delavarkhalafi
    Mehdi, Karbassi Seyed
    [J]. BULLETIN OF THE SOUTH URAL STATE UNIVERSITY SERIES-MATHEMATICAL MODELLING PROGRAMMING & COMPUTER SOFTWARE, 2021, 14 (04): : 63 - 73
  • [9] Control of stochastic quantum dynamics by differentiable programming
    Schafer, Frank
    Sekatski, Pavel
    Koppenhofer, Martin
    Bruder, Christoph
    Kloc, Michal
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (03):
  • [10] Learning Based Stochastic Data-Driven Predictive Control
    Hiremath, Sandesh Athni
    Mishra, Vikas Kumar
    Bajcinca, Naim
    [J]. 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 1684 - 1691