GLOBALLY CONVERGENT MULTILEVEL TRAINING OF DEEP RESIDUAL NETWORKS

被引:6
|
作者
Kopanicakova, Alena [1 ]
Krause, Rolf [1 ]
机构
[1] Univ Svizzera italiana, Euler Inst, Lugano, Switzerland
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2023年 / 45卷 / 03期
基金
瑞士国家科学基金会;
关键词
trustregion methods; multilevel minimization; deep residual networks; training algorithm; TRUST-REGION METHODS; OPTIMIZATION; ALGORITHMS;
D O I
10.1137/21M1434076
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose a globally convergent multilevel training method for deep residual networks (ResNets). The devised method can be seen as a novel variant of the recursive multilevel trustregion (RMTR) method, which operates in hybrid (stochastic-deterministic) settings by adaptively adjusting minibatch sizes during the training. The multilevel hierarchy and the transfer operators are constructed by exploiting a dynamical system's viewpoint, which interprets forward propagation through the ResNet as a forward Euler discretization of an initial value problem. In contrast to traditional training approaches, our novel RMTR method also incorporates curvature information on all levels of the multilevel hierarchy by means of the limited-memory SR1 method. The overall performance and the convergence properties of the our multilevel training method are numerically investigated using examples from the field of classification and regression.
引用
收藏
页码:S254 / S280
页数:27
相关论文
共 50 条
  • [41] Globally Convergent Modification of the Quickprop Method
    Michael N. Vrahatis
    George D. Magoulas
    Vassilis P. Plagianakos
    Neural Processing Letters, 2000, 12 : 159 - 170
  • [42] A globally convergent ball Stirling method
    Sen, R
    Guhathakurta, P
    APPLIED NUMERICAL MATHEMATICS, 2004, 51 (2-3) : 329 - 340
  • [43] A globally convergent incremental Newton method
    M. Gürbüzbalaban
    A. Ozdaglar
    P. Parrilo
    Mathematical Programming, 2015, 151 : 283 - 313
  • [44] A GLOBALLY CONVERGENT STABILIZED SQP METHOD
    Gill, Philip E.
    Robinson, Daniel P.
    SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (04) : 1983 - 2010
  • [45] Globally convergent algorithms for unconstrained optimization
    Shi, YX
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2000, 16 (03) : 295 - 308
  • [46] Making Network Solvers Globally Convergent
    Clees, Tanja
    Nikitin, Igor
    Nikitinao, Lialia
    SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, SIMULTECH 2016, 2018, 676 : 140 - 153
  • [47] A Globally Convergent Algorithm of Variational Inequality
    Yin, Chengjiang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (04) : 73 - 75
  • [48] A globally convergent adaptive IIR filter
    David, A
    Aboulnasr, T
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 531 - 534
  • [49] A globally convergent incremental Newton method
    Guerbuzbalaban, M.
    Ozdaglar, A.
    Parrilo, P.
    MATHEMATICAL PROGRAMMING, 2015, 151 (01) : 283 - 313
  • [50] GLOBALLY CONVERGENT HOMOTOPY METHODS - A TUTORIAL
    WATSON, LT
    APPLIED MATHEMATICS AND COMPUTATION, 1989, 31 : 369 - 396