Derivation and analysis of parallel-in-time neural ordinary differential equations

被引:0
|
作者
E. Lorin
机构
[1] Université de Montréal,Centre de Recherches Mathématiques,
[2] Carleton University,School of Mathematics and Statistics
关键词
Residual Neural Network; Neural Ordinary Differential Equations; Parareal method; Parallelism-in-time; 65Y05; 65L20; 68T01; 68T20;
D O I
暂无
中图分类号
学科分类号
摘要
The introduction in 2015 of Residual Neural Networks (RNN) and ResNET allowed for outstanding improvements of the performance of learning algorithms for evolution problems containing a “large” number of layers. Continuous-depth RNN-like models called Neural Ordinary Differential Equations (NODE) were then introduced in 2019. The latter have a constant memory cost, and avoid the a priori specification of the number of hidden layers. In this paper, we derive and analyze a parallel (-in-parameter and time) version of the NODE, which potentially allows for a more efficient implementation than a standard/naive parallelization of NODEs with respect to the parameters only. We expect this approach to be relevant whenever we have access to a very large number of processors, or when we are dealing with high dimensional ODE systems. Moreover, when using implicit ODE solvers, solutions to linear systems with up to cubic complexity are then required for solving nonlinear systems using for instance Newton’s algorithm; as the proposed approach allows to reduce the overall number of time-steps thanks to an iterative increase of the accuracy order of the ODE system solvers, it then reduces the number of linear systems to solve, hence benefiting from a scaling effect.
引用
收藏
页码:1035 / 1059
页数:24
相关论文
共 50 条
  • [21] The continuous memory: A neural network with ordinary differential equations for continuous-time series analysis
    Li, Bo
    Chen, Haoyu
    An, Zhiyong
    Yu, Yuan
    Jia, Ying
    Chen, Long
    Sun, Mingyan
    APPLIED SOFT COMPUTING, 2024, 167
  • [22] Time-aware neural ordinary differential equations for incomplete time series modeling
    Zhuoqing Chang
    Shubo Liu
    Run Qiu
    Song Song
    Zhaohui Cai
    Guoqing Tu
    The Journal of Supercomputing, 2023, 79 : 18699 - 18727
  • [23] Time-aware neural ordinary differential equations for incomplete time series modeling
    Chang, Zhuoqing
    Liu, Shubo
    Qiu, Run
    Song, Song
    Cai, Zhaohui
    Tu, Guoqing
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (16): : 18699 - 18727
  • [24] A UNIFIED ANALYSIS FRAMEWORK FOR ITERATIVE PARALLEL-IN-TIME ALGORITHMS
    Gander, Martin J.
    Lunet, Thibaut
    Ruprecht, Daniel
    Speck, Robert
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (05): : A2275 - A2303
  • [25] Heavy Ball Neural Ordinary Differential Equations
    Xia, Hedi
    Suliafu, Vai
    Ji, Hangjie
    Nguyen, Tan M.
    Bertozzi, Andrea L.
    Osher, Stanley J.
    Wang, Bao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [26] Interpretable polynomial neural ordinary differential equations
    Fronk, Colby
    Petzold, Linda
    CHAOS, 2023, 33 (04)
  • [27] On Numerical Integration in Neural Ordinary Differential Equations
    Zhu, Aiqing
    Jin, Pengzhan
    Zhu, Beibei
    Tang, Yifa
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [28] Survey on Graph Neural Ordinary Differential Equations
    Jiao, Pengfei
    Chen, Shuxin
    Guo, Xuan
    He, Dongxiao
    Liu, Dong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (08): : 2045 - 2066
  • [29] Interpretable Fourier Neural Ordinary Differential Equations
    Bian, Hanlin
    Zhu, Wei
    Chen, Zhang
    Li, Jingsui
    Pei, Chao
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 885 - 890
  • [30] Transcriptomic forecasting with neural ordinary differential equations
    Erbe, Rossin
    Stein-O'Brien, Genevieve
    Fertig, Elana J.
    PATTERNS, 2023, 4 (08):