Greedy training algorithms for neural networks and applications to PDEs

被引：13

作者：

Siegel, Jonathan W. ^{[1
]}

Hong, Qingguo ^{[1
]}

Jin, Xianlin ^{[2
]}

Hao, Wenrui ^{[1
]}

Xu, Jinchao ^{[1
]}

机构：

[1] Penn State Univ, Dept Math, University Pk, PA 16802 USA

[2] Peking Univ, Sch Math Sci, Beijing, Peoples R China

来源：

JOURNAL OF COMPUTATIONAL PHYSICS | 2023年 / 484卷

关键词：

Neural networks; Partial differential equations; Greedy algorithms; Generalization accuracy; UNIVERSAL APPROXIMATION; CONVERGENCE-RATES; ERROR-BOUNDS;

D O I：

10.1016/j.jcp.2023.112084

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Recently, neural networks have been widely applied for solving partial differential equations (PDEs). Although such methods have been proven remarkably successful on practical engineering problems, they have not been shown, theoretically or empirically, to converge to the underlying PDE solution with arbitrarily high accuracy. The primary difficulty lies in solving the highly non-convex optimization problems resulting from the neural network discretization, which are difficult to treat both theoretically and practically. It is our goal in this work to take a step toward remedying this. For this purpose, we develop a novel greedy training algorithm for shallow neural networks. Our method is applicable to both the variational formulation of the PDE and also to the residual minimization formulation pioneered by physics informed neural networks (PINNs). We analyze the method and obtain a priori error bounds when solving PDEs from the function class defined by shallow networks, which rigorously establishes the convergence of the method as the network size increases. Finally, we test the algorithm on several benchmark examples, including high dimensional PDEs, to confirm the theoretical convergence rate. Although the method is expensive relative to traditional approaches such as finite element methods, we view this work as a proof of concept for neural network-based methods, which shows that numerical methods based upon neural networks can be shown to rigorously converge.(c) 2023 Elsevier Inc. All rights reserved.

引用

页数：27

共 50 条

[41] Spiking Neural Networks - Algorithms, Hardware Implementations and Applications
Kulkarni, Shruti R.
Babu, Anakha V.
Rajendran, Bipin
2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 426 - 431
[42] A greedy algorithm for quantizing neural networks
Lybrand, Eric
Saab, Rayan
Journal of Machine Learning Research, 2021, 22 : 1 - 38
[43] A Greedy Algorithm for Quantizing Neural Networks
Lybrand, Eric
Saab, Rayan
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[44] Pore Networks Simulation with Parallel Greedy Algorithms
Roman-Alonso, G.
Boukerche, A.
Matadamas-Hernandez, J.
Castro-Garcia, M. A.
2012 IEEE/ACM 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2012, : 93 - 100
[45] Training neural networks with harmony search algorithms for classification problems
Kulluk, Sinem
Ozbakir, Lale
Baykasoglu, Adil
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (01) : 11 - 19
[46] Adaptive stepsize algorithms for on-line training of neural networks
Magoulas, GD
Plagianakos, VP
Vrahatis, MN
NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2001, 47 (05) : 3425 - 3430
[47] Global optimization algorithms for training product unit neural networks
Ismail, A
Engelbrecht, AP
IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL I, 2000, : 132 - 137
[48] On Descent Spectral CG algorithms for Training Recurrent Neural Networks
Livieris, I. E.
Sotiropoulos, D. G.
Pintelas, P.
13TH PANHELLENIC CONFERENCE ON INFORMATICS, PROCEEDINGS, 2009, : 65 - +
[49] EFFICIENT GENETIC ALGORITHMS FOR TRAINING LAYERED FEEDFORWARD NEURAL NETWORKS
YOON, BJ
HOLMES, DJ
LANGHOLZ, G
KANDEL, A
INFORMATION SCIENCES, 1994, 76 (1-2) : 67 - 85
[50] Levenberg-Marquardt Training Algorithms for Random Neural Networks
Basterrech, Sebastian
Mohammed, Samir
Rubino, Gerardo
Soliman, Mostafa
COMPUTER JOURNAL, 2011, 54 (01): : 125 - 135

← 1 2 3 4 5 →