Deep Neural Networks Motivated by Partial Differential Equations

被引:205
|
作者
Ruthotto, Lars [1 ,3 ]
Haber, Eldad [2 ,3 ]
机构
[1] Emory Univ, Dept Math, 400 Dowman Dr, Atlanta, GA 30322 USA
[2] Univ British Columbia, Dept Earth & Ocean Sci, Vancouver, BC, Canada
[3] Xtract Technol Inc, Vancouver, BC, Canada
基金
美国国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Machine learning; Deep neural networks; Partial differential equations; PDE-constrained optimization; Image classification;
D O I
10.1007/s10851-019-00903-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Partial differential equations (PDEs) are indispensable for modeling many physical phenomena and also commonly used for solving image processing tasks. In the latter area, PDE-based approaches interpret image data as discretizations of multivariate functions and the output of image processing algorithms as solutions to certain PDEs. Posing image processing problems in the infinite-dimensional setting provides powerful tools for their analysis and solution. For the last few decades, the reinterpretation of classical image processing problems through the PDE lens has been creating multiple celebrated approaches that benefit a vast area of tasks including image segmentation, denoising, registration, and reconstruction. In this paper, we establish a new PDE interpretation of a class of deep convolutional neural networks (CNN) that are commonly used to learn from speech, image, and video data. Our interpretation includes convolution residual neural networks (ResNet), which are among the most promising approaches for tasks such as image classification having improved the state-of-the-art performance in prestigious benchmark challenges. Despite their recent successes, deep ResNets still face some critical challenges associated with their design, immense computational costs and memory requirements, and lack of understanding of their reasoning. Guided by well-established PDE theory, we derive three new ResNet architectures that fall into two new classes: parabolic and hyperbolic CNNs. We demonstrate how PDE theory can provide new insights and algorithms for deep learning and demonstrate the competitiveness of three new CNN architectures using numerical experiments.
引用
收藏
页码:352 / 364
页数:13
相关论文
共 50 条
  • [1] Deep Neural Networks Motivated by Partial Differential Equations
    Lars Ruthotto
    Eldad Haber
    [J]. Journal of Mathematical Imaging and Vision, 2020, 62 : 352 - 364
  • [2] Partial Differential Equations for Training Deep Neural Networks
    Chaudhari, Pratik
    Oberman, Adam
    Osher, Stanley
    Soatto, Stefano
    Carlier, Guillaume
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1627 - 1631
  • [3] Deep relaxation: partial differential equations for optimizing deep neural networks
    Chaudhari, Pratik
    Oberman, Adam
    Osher, Stanley
    Soatto, Stefano
    Carlier, Guillaume
    [J]. RESEARCH IN THE MATHEMATICAL SCIENCES, 2018, 5 : 1 - 30
  • [4] Deep relaxation: partial differential equations for optimizing deep neural networks
    Pratik Chaudhari
    Adam Oberman
    Stanley Osher
    Stefano Soatto
    Guillaume Carlier
    [J]. Research in the Mathematical Sciences, 2018, 5
  • [5] PDE-GCN: Novel Architectures for Graph Neural Networks Motivated by Partial Differential Equations
    Eliasof, Moshe
    Haber, Eldad
    Treister, Eran
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Improved Deep Neural Networks with Domain Decomposition in Solving Partial Differential Equations
    Wei Wu
    Xinlong Feng
    Hui Xu
    [J]. Journal of Scientific Computing, 2022, 93
  • [7] Improved Deep Neural Networks with Domain Decomposition in Solving Partial Differential Equations
    Wu, Wei
    Feng, Xinlong
    Xu, Hui
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2022, 93 (01)
  • [8] Transferable Neural Networks for Partial Differential Equations
    Zezhong Zhang
    Feng Bao
    Lili Ju
    Guannan Zhang
    [J]. Journal of Scientific Computing, 2024, 99
  • [9] Transferable Neural Networks for Partial Differential Equations
    Zhang, Zezhong
    Bao, Feng
    Ju, Lili
    Zhang, Guannan
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2024, 99 (01)
  • [10] Simulating Partial Differential Equations with Neural Networks
    Chertock, Anna
    Leonard, Christopher
    [J]. HYPERBOLIC PROBLEMS: THEORY, NUMERICS, APPLICATIONS, VOL II, HYP2022, 2024, 35 : 39 - 49