Stability of Deep Neural Networks via Discrete Rough Paths

被引:1
|
作者
Bayer, Christian [1 ]
Friz, Peter K. [2 ]
Tapia, Nikolas [3 ,4 ]
机构
[1] Weierstrass Inst, D-10117 Berlin, Germany
[2] Tech Univ Berlin, D-10623 Berlin, Germany
[3] Weierstrass Inst, Berlin, Germany
[4] TU Berlin, Berlin, Germany
来源
关键词
residual neural networks; rough paths; p-variation; stability;
D O I
10.1137/22M1472358
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Using rough path techniques, we provide a priori estimates for the output of deep residual neural networks in terms of both the input data and the (trained) network weights. As trained network weights are typically very rough when seen as functions of the layer, we propose to derive stability bounds in terms of the total p-variation of trained weights for any p \in [1, 3]. Unlike the C1-theory underlying the neural ODE literature, our estimates remain bounded even in the limiting case of weights behaving like Brownian motions, as suggested in [A.-S. Cohen, R. Cont, A. Rossier, and R. Xu, Proceedings of the 38th International Conference on Machine Learning, JMLR, Cambridge, MA, 2021, pp. 2039-2048]. Mathematically, we interpret residual neural network as solutions to (rough) difference equations, and analyze them based on recent results of discrete-time signatures and rough path theory.
引用
收藏
页码:50 / 76
页数:27
相关论文
共 50 条
  • [41] Deep neural networks for accurate predictions of crystal stability
    Ye, Weike
    Chen, Chi
    Wang, Zhenbin
    Chu, Iek-Heng
    Ong, Shyue Ping
    NATURE COMMUNICATIONS, 2018, 9
  • [42] Deep neural networks for accurate predictions of crystal stability
    Weike Ye
    Chi Chen
    Zhenbin Wang
    Iek-Heng Chu
    Shyue Ping Ong
    Nature Communications, 9
  • [43] Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks
    Wang, Shenhao
    Mo, Baichuan
    Zhao, Jinhua
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2021, 146 : 333 - 358
  • [44] Globally exponential stability of discrete-time cellular neural networks with discrete delays
    Ju, Peijun
    Zhang, Wei
    Liu, Guocai
    Tian, Li
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 188 - +
  • [45] Stability of Stochastic Discrete-Time Neural Networks with Discrete Delays and the Leakage Delay
    Hou, Liyuan
    Zhu, Hong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [46] The Chaotification of Discrete Hopfield Neural Networks via Impulsive Control
    Liu Na
    Guan Zhi-Hong
    CHINESE PHYSICS LETTERS, 2009, 26 (07)
  • [47] An overview on rough neural networks
    Liao, Hongmei
    Ding, Shifei
    Wang, Miaomiao
    Ma, Gang
    NEURAL COMPUTING & APPLICATIONS, 2016, 27 (07): : 1805 - 1816
  • [48] REVISITING LANDSCAPE ANALYSIS IN DEEP NEURAL NETWORKS: ELIMINATING DECREASING PATHS TO INFINITY
    Liang, Shiyu
    Sun, Ruoyu
    Srikant, R.
    SIAM JOURNAL ON OPTIMIZATION, 2022, 32 (04) : 2797 - 2827
  • [49] VIDEO COLOR GRADING VIA DEEP NEURAL NETWORKS
    Gibbs, John L.
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2018, 13 (02): : 1 - 15
  • [50] Discrete gradient flow approximations of high dimensional evolution partial differential equations via deep neural networks
    Georgoulis, Emmanuil H.
    Loulakis, Michail
    Tsiourvas, Asterios
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2023, 117