Stability of Deep Neural Networks via Discrete Rough Paths

被引:1
|
作者
Bayer, Christian [1 ]
Friz, Peter K. [2 ]
Tapia, Nikolas [3 ,4 ]
机构
[1] Weierstrass Inst, D-10117 Berlin, Germany
[2] Tech Univ Berlin, D-10623 Berlin, Germany
[3] Weierstrass Inst, Berlin, Germany
[4] TU Berlin, Berlin, Germany
来源
关键词
residual neural networks; rough paths; p-variation; stability;
D O I
10.1137/22M1472358
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Using rough path techniques, we provide a priori estimates for the output of deep residual neural networks in terms of both the input data and the (trained) network weights. As trained network weights are typically very rough when seen as functions of the layer, we propose to derive stability bounds in terms of the total p-variation of trained weights for any p \in [1, 3]. Unlike the C1-theory underlying the neural ODE literature, our estimates remain bounded even in the limiting case of weights behaving like Brownian motions, as suggested in [A.-S. Cohen, R. Cont, A. Rossier, and R. Xu, Proceedings of the 38th International Conference on Machine Learning, JMLR, Cambridge, MA, 2021, pp. 2039-2048]. Mathematically, we interpret residual neural network as solutions to (rough) difference equations, and analyze them based on recent results of discrete-time signatures and rough path theory.
引用
收藏
页码:50 / 76
页数:27
相关论文
共 50 条
  • [1] Numerical Attractors via Discrete Rough Paths
    Cong, Nguyen Dinh
    Duc, Luu Hoang
    Hong, Phan Thanh
    JOURNAL OF DYNAMICS AND DIFFERENTIAL EQUATIONS, 2023, 37 (1) : 727 - 748
  • [2] Improving the Robustness of Deep Neural Networks via Stability Training
    Zheng, Stephan
    Song, Yang
    Leung, Thomas
    Goodfellow, Ian
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4480 - 4488
  • [3] Deep Networks as Paths on the Manifold of Neural Representations
    Lange, Richard D.
    Kwok, Devin
    Matelsky, Jordan
    Wang, Xinyue
    Rolnick, David
    Kording, Konrad P.
    TOPOLOGICAL, ALGEBRAIC AND GEOMETRIC LEARNING WORKSHOPS 2023, VOL 221, 2023, 221
  • [4] On the approximation of rough functions with deep neural networks
    De Ryck T.
    Mishra S.
    Ray D.
    SeMA Journal, 2022, 79 (3) : 399 - 440
  • [5] Differential Equations Driven by Rough Paths: An Approach via Discrete Approximation
    Davie, A. M.
    APPLIED MATHEMATICS RESEARCH EXPRESS, 2008, (01)
  • [6] PATHWISE STABILITY OF LIKELIHOOD ESTIMATORS FOR DIFFUSIONS VIA ROUGH PATHS
    Diehl, Joscha
    Friz, Peter
    Mai, Hilmar
    ANNALS OF APPLIED PROBABILITY, 2016, 26 (04): : 2169 - 2192
  • [7] Stability of discrete Hopfield neural networks
    Ma, R. N.
    Zhang, J. S.
    Bai, G. Q.
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 2266 - 2270
  • [8] Deep learning architecture using rough sets and rough neural networks
    Hassan, Yasser F.
    KYBERNETES, 2017, 46 (04) : 693 - 705
  • [9] Discrete rough paths and limit theorems
    Liu, Yanghui
    Tindel, Samy
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2020, 56 (03): : 1730 - 1774
  • [10] SENSITIVITIES VIA ROUGH PATHS
    Marie, Nicolas
    ESAIM-PROBABILITY AND STATISTICS, 2015, 19 : 515 - 543