共 50 条
- [11] Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [12] Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [14] Stochastic Gradient Descent on Separable Data: Exact Convergence with a Fixed Learning Rate 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [15] A coordinate gradient descent method for nonsmooth separable minimization Mathematical Programming, 2009, 117 : 387 - 423
- [17] Bias of Homotopic Gradient Descent for the Hinge Loss Applied Mathematics & Optimization, 2021, 84 : 621 - 647
- [18] Bias of Homotopic Gradient Descent for the Hinge Loss APPLIED MATHEMATICS AND OPTIMIZATION, 2021, 84 (01): : 621 - 647
- [20] The Implicit Regularization of Momentum Gradient Descent in Overparametrized Models THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10149 - 10156