共 50 条
- [21] Learning One-hidden-layer ReLU Networks via Gradient Descent 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [22] PLATEAU PHENOMENON IN GRADIENT DESCENT TRAINING OF RELU NETWORKS: EXPLANATION, QUANTIFICATION, AND AVOIDANCE SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2021, 43 (05): : A3438 - A3468
- [24] Impact of Mathematical Norms on Convergence of Gradient Descent Algorithms for Deep Neural Networks Learning AI 2022: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13728 : 131 - 144
- [26] Unboundedness of Linear Regions of Deep ReLU Neural Networks DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022 WORKSHOPS, 2022, 1633 : 3 - 10
- [27] Linear Convergence of Gradient Descent for Finite Width Over-parametrized Linear Networks with General Initialization INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [29] The global convergence of a descent PRP conjugate gradient method COMPUTATIONAL & APPLIED MATHEMATICS, 2012, 31 (01): : 59 - 83
- [30] Convergence of Stochastic Gradient Descent in Deep Neural Network Acta Mathematicae Applicatae Sinica, English Series, 2021, 37 : 126 - 136