Derivation and analysis of parallel-in-time neural ordinary differential equations

被引:0
|
作者
E. Lorin
机构
[1] Université de Montréal,Centre de Recherches Mathématiques,
[2] Carleton University,School of Mathematics and Statistics
关键词
Residual Neural Network; Neural Ordinary Differential Equations; Parareal method; Parallelism-in-time; 65Y05; 65L20; 68T01; 68T20;
D O I
暂无
中图分类号
学科分类号
摘要
The introduction in 2015 of Residual Neural Networks (RNN) and ResNET allowed for outstanding improvements of the performance of learning algorithms for evolution problems containing a “large” number of layers. Continuous-depth RNN-like models called Neural Ordinary Differential Equations (NODE) were then introduced in 2019. The latter have a constant memory cost, and avoid the a priori specification of the number of hidden layers. In this paper, we derive and analyze a parallel (-in-parameter and time) version of the NODE, which potentially allows for a more efficient implementation than a standard/naive parallelization of NODEs with respect to the parameters only. We expect this approach to be relevant whenever we have access to a very large number of processors, or when we are dealing with high dimensional ODE systems. Moreover, when using implicit ODE solvers, solutions to linear systems with up to cubic complexity are then required for solving nonlinear systems using for instance Newton’s algorithm; as the proposed approach allows to reduce the overall number of time-steps thanks to an iterative increase of the accuracy order of the ODE system solvers, it then reduces the number of linear systems to solve, hence benefiting from a scaling effect.
引用
收藏
页码:1035 / 1059
页数:24
相关论文
共 50 条
  • [1] Derivation and analysis of parallel-in-time neural ordinary differential equations
    Lorin, E.
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2020, 88 (10) : 1035 - 1059
  • [2] Nonlinear parallel-in-time Schur complement solvers for ordinary differential equations
    Badia, Santiago
    Olm, Marc
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2018, 344 : 794 - 806
  • [3] Latent Time Neural Ordinary Differential Equations
    Anumasa, Srinivas
    Srijith, P. K.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6010 - 6018
  • [4] NEURAL ORDINARY DIFFERENTIAL EQUATIONS FOR TIME SERIES RECONSTRUCTION
    Androsov, D. V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2023, (04) : 69 - 75
  • [5] Neural ordinary differential equations for ecological and evolutionary time-series analysis
    Bonnaffe, Willem
    Sheldon, Ben C.
    Coulson, Tim
    METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (07): : 1301 - 1315
  • [6] Neural Ordinary Differential Equations
    Chen, Ricky T. Q.
    Rubanova, Yulia
    Bettencourt, Jesse
    Duvenaud, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [7] A Parallel-in-Time Implementation of the Numerov Method For Wave Equations
    Sun, Yafei
    Wu, Shu-Lin
    Xu, Yingxiang
    JOURNAL OF SCIENTIFIC COMPUTING, 2022, 90 (01)
  • [8] A Parallel-in-Time Implementation of the Numerov Method For Wave Equations
    Yafei Sun
    Shu-Lin Wu
    Yingxiang Xu
    Journal of Scientific Computing, 2022, 90
  • [9] Parareal Neural Networks Emulating a Parallel-in-Time Algorithm
    Lee, Youngkyu
    Park, Jongho
    Lee, Chang-Ock
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6353 - 6364
  • [10] λ-SYMMETRIES ON THE DERIVATION OF FIRST INTEGRALS OF ORDINARY DIFFERENTIAL EQUATIONS
    Muriel, C.
    Romero, J. L.
    WASCOM 2009: 15TH CONFERENCE ON WAVES AND STABILITY IN CONTINUOUS MEDIA, 2010, : 303 - 308