A Convergence Analysis of Gradient Descent on Graph Neural Networks

被引：0

作者：

Awasthi, Pranjal ^{[1
]}

Das, Abhimanyu ^{[1
]}

Gollapudi, Sreenivas ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph Neural Networks (GNNs) are a powerful class of architectures for solving learning problems on graphs. While many variants of GNNs have been proposed in the literature and have achieved strong empirical performance, their theoretical properties are less well understood. In this work we study the convergence properties of the gradient descent algorithm when used to train GNNs. In particular, we consider the realizable setting where the data is generated from a network with unknown weights and our goal is to study conditions under which gradient descent on a GNN architecture can recover near optimal solutions. While such analysis has been performed in recent years for other architectures such as fully connected feed-forward networks, the message passing nature of the updates in a GNN poses a new challenge in understanding the nature of the gradient descent updates. We take a step towards overcoming this by proving that for the case of deep linear GNNs gradient descent provably recovers solutions up to error is an element of in O(log(1/is an element of)) iterations, under natural assumptions on the data distribution. Furthermore, for the case of one-round GNNs with ReLU activations, we show that gradient descent provably recovers solutions up to error is an element of in O(1/is an element of(2) log(1/is an element of)) iterations.

引用

页数：13

共 50 条

[1] Convergence of gradient descent for learning linear neural networks
Nguegnang, Gabin Maxime
Rauhut, Holger
Terstiege, Ulrich
[J]. ADVANCES IN CONTINUOUS AND DISCRETE MODELS, 2024, 2024 (01):
[2] Learning Graph Neural Networks with Approximate Gradient Descent
Li, Qunwei
Zou, Shaofeng
Zhong, Wenliang
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8438 - 8446
[3] Optimization of Graph Neural Networks with Natural Gradient Descent
Izadi, Mohammad Rasool
Fang, Yihao
Stevenson, Robert
Lin, Lizhen
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 171 - 179
[4] Convergence of Gradient Descent Algorithm for Diagonal Recurrent Neural Networks
Xu, Dongpo
Li, Zhengxue
Wu, Wei
Ding, Xiaoshuai
Qu, Di
[J]. 2007 SECOND INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, 2007, : 29 - 31
[5] Convergence rates for shallow neural networks learned by gradient descent
Braun, Alina
Kohler, Michael
Langer, Sophie
Walk, Harro
[J]. BERNOULLI, 2024, 30 (01) : 475 - 502
[6] Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Zhang, Guodong
Martens, James
Grosse, Roger
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[7] Exponential convergence of a gradient descent algorithm for a class of recurrent neural networks
Bartlett, P
Dasgupta, S
[J]. 38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 497 - 500
[8] Convergence of Hyperbolic Neural Networks Under Riemannian Stochastic Gradient Descent
Whiting, Wes
Wang, Bao
Xin, Jack
[J]. COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2024, 6 (02) : 1175 - 1188
[9] Analysis of natural gradient descent for multilayer neural networks
Rattray, M
Saad, D
[J]. PHYSICAL REVIEW E, 1999, 59 (04): : 4523 - 4532
[10] Non-convergence of stochastic gradient descent in the training of deep neural networks
Cheridito, Patrick
Jentzen, Arnulf
Rossmannek, Florian
[J]. JOURNAL OF COMPLEXITY, 2021, 64

← 1 2 3 4 5 →