Greedy training algorithms for neural networks and applications to PDEs

被引:13
|
作者
Siegel, Jonathan W. [1 ]
Hong, Qingguo [1 ]
Jin, Xianlin [2 ]
Hao, Wenrui [1 ]
Xu, Jinchao [1 ]
机构
[1] Penn State Univ, Dept Math, University Pk, PA 16802 USA
[2] Peking Univ, Sch Math Sci, Beijing, Peoples R China
关键词
Neural networks; Partial differential equations; Greedy algorithms; Generalization accuracy; UNIVERSAL APPROXIMATION; CONVERGENCE-RATES; ERROR-BOUNDS;
D O I
10.1016/j.jcp.2023.112084
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recently, neural networks have been widely applied for solving partial differential equations (PDEs). Although such methods have been proven remarkably successful on practical engineering problems, they have not been shown, theoretically or empirically, to converge to the underlying PDE solution with arbitrarily high accuracy. The primary difficulty lies in solving the highly non-convex optimization problems resulting from the neural network discretization, which are difficult to treat both theoretically and practically. It is our goal in this work to take a step toward remedying this. For this purpose, we develop a novel greedy training algorithm for shallow neural networks. Our method is applicable to both the variational formulation of the PDE and also to the residual minimization formulation pioneered by physics informed neural networks (PINNs). We analyze the method and obtain a priori error bounds when solving PDEs from the function class defined by shallow networks, which rigorously establishes the convergence of the method as the network size increases. Finally, we test the algorithm on several benchmark examples, including high dimensional PDEs, to confirm the theoretical convergence rate. Although the method is expensive relative to traditional approaches such as finite element methods, we view this work as a proof of concept for neural network-based methods, which shows that numerical methods based upon neural networks can be shown to rigorously converge.(c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Spiking Neural Networks - Algorithms, Hardware Implementations and Applications
    Kulkarni, Shruti R.
    Babu, Anakha V.
    Rajendran, Bipin
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 426 - 431
  • [42] A greedy algorithm for quantizing neural networks
    Lybrand, Eric
    Saab, Rayan
    Journal of Machine Learning Research, 2021, 22 : 1 - 38
  • [43] A Greedy Algorithm for Quantizing Neural Networks
    Lybrand, Eric
    Saab, Rayan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [44] Pore Networks Simulation with Parallel Greedy Algorithms
    Roman-Alonso, G.
    Boukerche, A.
    Matadamas-Hernandez, J.
    Castro-Garcia, M. A.
    2012 IEEE/ACM 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2012, : 93 - 100
  • [45] Training neural networks with harmony search algorithms for classification problems
    Kulluk, Sinem
    Ozbakir, Lale
    Baykasoglu, Adil
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (01) : 11 - 19
  • [46] Adaptive stepsize algorithms for on-line training of neural networks
    Magoulas, GD
    Plagianakos, VP
    Vrahatis, MN
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2001, 47 (05) : 3425 - 3430
  • [47] Global optimization algorithms for training product unit neural networks
    Ismail, A
    Engelbrecht, AP
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL I, 2000, : 132 - 137
  • [48] On Descent Spectral CG algorithms for Training Recurrent Neural Networks
    Livieris, I. E.
    Sotiropoulos, D. G.
    Pintelas, P.
    13TH PANHELLENIC CONFERENCE ON INFORMATICS, PROCEEDINGS, 2009, : 65 - +
  • [49] EFFICIENT GENETIC ALGORITHMS FOR TRAINING LAYERED FEEDFORWARD NEURAL NETWORKS
    YOON, BJ
    HOLMES, DJ
    LANGHOLZ, G
    KANDEL, A
    INFORMATION SCIENCES, 1994, 76 (1-2) : 67 - 85
  • [50] Levenberg-Marquardt Training Algorithms for Random Neural Networks
    Basterrech, Sebastian
    Mohammed, Samir
    Rubino, Gerardo
    Soliman, Mostafa
    COMPUTER JOURNAL, 2011, 54 (01): : 125 - 135