Connections Between Numerical Algorithms for PDEs and Neural Networks

被引:7
|
作者
Alt, Tobias [1 ]
Schrader, Karl [1 ]
Augustin, Matthias [1 ]
Peter, Pascal [1 ]
Weickert, Joachim [1 ]
机构
[1] Saarland Univ, Fac Math & Comp Sci, Math Image Anal Grp, Campus E1-7, D-66041 Saarbrucken, Germany
基金
欧洲研究理事会;
关键词
Numerical algorithms; Partial differential equations; Neural networks; Nonlinear diffusion; Stability; DIFFUSION; FRAMEWORK; REGULARIZATION; COMPRESSION; DRIVEN;
D O I
10.1007/s10851-022-01106-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate numerous structural connections between numerical algorithms for partial differential equations (PDEs) and neural architectures. Our goal is to transfer the rich set of mathematical foundations from the world of PDEs to neural networks. Besides structural insights, we provide concrete examples and experimental evaluations of the resulting architectures. Using the example of generalised nonlinear diffusion in 1D, we consider explicit schemes, acceleration strategies thereof, implicit schemes, and multigrid approaches. We connect these concepts to residual networks, recurrent neural networks, and U-net architectures. Our findings inspire a symmetric residual network design with provable stability guarantees and justify the effectiveness of skip connections in neural networks from a numerical perspective. Moreover, we present U-net architectures that implement multigrid techniques for learning efficient solutions of partial differential equation models, and motivate uncommon design choices such as trainable nonmonotone activation functions. Experimental evaluations show that the proposed architectures save half of the trainable parameters and can thus outperform standard ones with the same model complexity. Our considerations serve as a basis for explaining the success of popular neural architectures and provide a blueprint for developing new mathematically well-founded neural building blocks.
引用
收藏
页码:185 / 208
页数:24
相关论文
共 50 条
  • [21] Neural networks in bacteria: Making connections
    Armitage, JP
    Holland, IB
    Jenal, U
    Kenny, B
    JOURNAL OF BACTERIOLOGY, 2005, 187 (01) : 26 - 36
  • [22] Ehresmann connections and feedforward neural networks
    Pearson, DW
    MATHEMATICAL AND COMPUTER MODELLING, 1999, 29 (09) : 17 - 25
  • [23] FORMAL CONNECTIONS BETWEEN LIGHTNESS ALGORITHMS
    HURLBERT, A
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1986, 3 (10): : 1684 - 1693
  • [24] FERROELECTRIC CONNECTIONS FOR IC NEURAL NETWORKS
    CLARK, LT
    GRONDIN, RO
    DEY, SK
    FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 47 - 51
  • [25] Physics-informed neural networks for approximating dynamic (hyperbolic) PDEs of second order in time: Error analysis and algorithms
    Qian, Yanxia
    Zhang, Yongchao
    Huang, Yunqing
    Dong, Suchuan
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 495
  • [26] Learning algorithms for neural networks
    Yudin, D.B.
    Avtomatika i Telemekhanika, 1996, (11): : 148 - 154
  • [27] Neural networks: Algorithms and applications
    Liu, Derong
    Zhang, Huaguang
    Hu, Sanqing
    NEUROCOMPUTING, 2008, 71 (4-6) : 471 - 473
  • [28] Genetic algorithms and neural networks
    Jenkins, WM
    NEURAL NETWORKS IN THE ANALYSIS AND DESIGN OF STRUCTURES, 2000, (404): : 53 - 92
  • [29] Genetic algorithms in neural networks
    Dumitrescu, D
    Stan, I
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, APPLICATIONS, 1996, 35 : 134 - 140
  • [30] Modeling nonlinear waves and PDEs via cellular neural networks
    Slavova A.
    Annali dell’Università di Ferrara, 1999, 45 (1): : 311 - 326