Optimization of Graph Neural Networks with Natural Gradient Descent

被引：15

作者：

Izadi, Mohammad Rasool ^{[1
,2
]}

Fang, Yihao ^{[2
]}

Stevenson, Robert ^{[1
]}

Lin, Lizhen ^{[2
]}

机构：

[1] Univ Notre Dame, Elect Engn, Notre Dame, IN 46556 USA

[2] Univ Notre Dame, Appl & Computat Math & Stat, Notre Dame, IN 46556 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年

关键词：

Graph neural network; Fisher information; natural gradient descent; network data;

D O I：

10.1109/BigData50022.2020.9378063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we propose to employ information geometric tools to optimize a graph neural network architecture such as the graph convolutional networks. More specifically, we develop optimization algorithms for the graph-based semi-supervised learning by employing the natural gradient information in the optimization process. This allows us to efficiently exploit the geometry of the underlying statistical model or parameter space for optimization and inference. To the best of our knowledge, this is the first work that has utilized the natural gradient for the optimization of graph neural networks that can be extended to other semi-supervised problems. Efficient computations algorithms are developed and extensive numerical studies are conducted to demonstrate the superior performance of our algorithms over existing algorithms such as ADAM and SGD.

引用

页码：171 / 179

页数：9

共 50 条

[1] A Convergence Analysis of Gradient Descent on Graph Neural Networks
Awasthi, Pranjal
Das, Abhimanyu
Gollapudi, Sreenivas
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Learning Graph Neural Networks with Approximate Gradient Descent
Li, Qunwei
Zou, Shaofeng
Zhong, Wenliang
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8438 - 8446
[3] Analysis of natural gradient descent for multilayer neural networks
Rattray, M
Saad, D
[J]. PHYSICAL REVIEW E, 1999, 59 (04): : 4523 - 4532
[4] Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Zhang, Guodong
Martens, James
Grosse, Roger
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[5] Learning dynamics of gradient descent optimization in deep neural networks
Wu, Wei
Jing, Xiaoyuan
Du, Wencai
Chen, Guoliang
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (05)
[6] Learning dynamics of gradient descent optimization in deep neural networks
Wei Wu
Xiaoyuan Jing
Wencai Du
Guoliang Chen
[J]. Science China Information Sciences, 2021, 64
[7] Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks
Cui, Xiaodong
Zhang, Wei
Tuske, Zoltan
Picheny, Michael
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[8] Learning dynamics of gradient descent optimization in deep neural networks
Wei WU
Xiaoyuan JING
Wencai DU
Guoliang CHEN
[J]. Science China(Information Sciences), 2021, 64 (05) : 17 - 31
[9] Strengthening Gradient Descent by Sequential Motion Optimization for Deep Neural Networks
Le-Duc, Thang
Nguyen, Quoc-Hung
Lee, Jaehong
Nguyen-Xuan, H.
[J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (03) : 565 - 579
[10] Evaluation of Gradient Descent Optimization: Using Android Applications in Neural Networks
Alshahrani, Hani
Alzahrani, Abdulrahman
Alshehri, Ali
Alharthi, Raed
Fu, Huirong
[J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1471 - 1476

← 1 2 3 4 5 →