On the Iteration Complexity of Hypergradient Computation

被引:0
|
作者
Grazzi, Riccardo [1 ,2 ]
Franceschi, Luca [1 ,2 ]
Pontil, Massimiliano [1 ,2 ]
Salzo, Saverio [1 ]
机构
[1] Ist Italiano Tecnol, Computat Stat & Machine Learning, Genoa, Italy
[2] UCL, Dept Comp Sci, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study a general class of bilevel problems, consisting in the minimization of an upper-level objective which depends on the solution to a parametric fixed-point equation. Important instances arising in machine learning include hyperparameter optimization, meta-learning, and certain graph and recurrent neural networks. Typically the gradient of the upper-level objective (hypergradient) is hard or even impossible to compute exactly, which has raised the interest in approximation methods. We investigate some popular approaches to compute the hypergradient, based on reverse mode iterative differentiation and approximate implicit differentiation. Under the hypothesis that the fixed point equation is defined by a contraction mapping, we present a unified analysis which allows for the first time to quantitatively compare these methods, providing explicit bounds for their iteration complexity. This analysis suggests a hierarchy in terms of computational efficiency among the above methods, with approximate implicit differentiation based on conjugate gradient performing best. We present an extensive experimental comparison among the methods which confirm the theoretical findings.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] On the Iteration Complexity of Hypergradient Computation
    Grazzi, Riccardo
    Franceschi, Luca
    Pontil, Massimiliano
    Salzo, Saverio
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [2] ITERATION AND EVALUATION COMPLEXITY FOR THE MINIMIZATION OF FUNCTIONS WHOSE COMPUTATION IS INTRINSICALLY INEXACT
    Birgin, E. G.
    Krejic, N.
    Martinez, J. M.
    MATHEMATICS OF COMPUTATION, 2020, 89 (321) : 253 - 278
  • [3] Complexity in iteration of polynomials
    Yu, Zhiheng
    Liu, Liu
    AEQUATIONES MATHEMATICAE, 2019, 93 (05) : 985 - 1007
  • [4] Complexity in iteration of polynomials
    Zhiheng Yu
    Liu Liu
    Aequationes mathematicae, 2019, 93 : 985 - 1007
  • [5] On the complexity of policy iteration
    Mansour, Y
    Singh, S
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 401 - 408
  • [6] Efficient computation of the iteration of functions
    Tsai, Tsung-Hsi
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (8-10) : 988 - 993
  • [7] Federated Hypergradient Descent
    Kan, Andrew K.
    arXiv, 2022,
  • [8] Iteration, tatonnement, computation and economic dynamics
    Velupillai, K. Vela
    CAMBRIDGE JOURNAL OF ECONOMICS, 2015, 39 (06) : 1551 - 1567
  • [9] Iteration complexity of generalized complementarity problems
    Salahuddin
    Alesemi, Meshari
    JOURNAL OF INEQUALITIES AND APPLICATIONS, 2019, 2019 (1)
  • [10] Iteration complexity of generalized complementarity problems
    Meshari Salahuddin
    Journal of Inequalities and Applications, 2019