Strong Stationarity for Optimal Control Problems with Non-smooth Integral Equation Constraints: Application to a Continuous DNN

被引:0
|
作者
Harbir Antil
Livia Betz
Daniel Wachsmuth
机构
[1] George Mason University,The Center for Mathematics and Artificial Intelligence, Department of Mathematical Sciences
[2] Universität Würzburg,Institut für Mathematik
来源
关键词
Non-smooth optimization; Optimal control of fractional ODEs; Integral equations; Strong stationarity; Caputo derivative; Deep neural networks; 34A08; 45D05; 49J21;
D O I
暂无
中图分类号
学科分类号
摘要
Motivated by the residual type neural networks (ResNet), this paper studies optimal control problems constrained by a non-smooth integral equation associated to a fractional differential equation. Such non-smooth equations, for instance, arise in the continuous representation of fractional deep neural networks (DNNs). Here the underlying non-differentiable function is the ReLU or max function. The control enters in a nonlinear and multiplicative manner and we additionally impose control constraints. Because of the presence of the non-differentiable mapping, the application of standard adjoint calculus is excluded. We derive strong stationary conditions by relying on the limited differentiability properties of the non-smooth map. While traditional approaches smoothen the non-differentiable function, no such smoothness is retained in our final strong stationarity system. Thus, this work also closes a gap which currently exists in continuous neural networks with ReLU type activation function.
引用
收藏
相关论文
共 50 条