On the Iteration Complexity of Hypergradient Computation

被引:0
|
作者
Grazzi, Riccardo [1 ,2 ]
Franceschi, Luca [1 ,2 ]
Pontil, Massimiliano [1 ,2 ]
Salzo, Saverio [1 ]
机构
[1] Ist Italiano Tecnol, Computat Stat & Machine Learning, Genoa, Italy
[2] UCL, Dept Comp Sci, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study a general class of bilevel problems, consisting in the minimization of an upper-level objective which depends on the solution to a parametric fixed-point equation. Important instances arising in machine learning include hyperparameter optimization, meta-learning, and certain graph and recurrent neural networks. Typically the gradient of the upper-level objective (hypergradient) is hard or even impossible to compute exactly, which has raised the interest in approximation methods. We investigate some popular approaches to compute the hypergradient, based on reverse mode iterative differentiation and approximate implicit differentiation. Under the hypothesis that the fixed point equation is defined by a contraction mapping, we present a unified analysis which allows for the first time to quantitatively compare these methods, providing explicit bounds for their iteration complexity. This analysis suggests a hierarchy in terms of computational efficiency among the above methods, with approximate implicit differentiation based on conjugate gradient performing best. We present an extensive experimental comparison among the methods which confirm the theoretical findings.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] On the Complexity of Symbolic Computation
    van der Hoeven, Joris
    PROCEEDINGS OF THE 2022 INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND ALGEBRAIC COMPUTATION, ISSAC 2022, 2022, : 3 - 12
  • [22] The Complexity of Stratification Computation
    E. Rannou
    Discrete & Computational Geometry, 1998, 19 : 47 - 78
  • [23] ON THE COMPLEXITY OF INCREMENTAL COMPUTATION
    CHARI, S
    RANJAN, D
    ROHATGI, P
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 629 : 172 - 180
  • [24] Complexity in Dynamics and Computation
    Masanori Ohya
    Acta Applicandae Mathematica, 2000, 63 : 293 - 306
  • [25] Energy Complexity of Computation
    Say, Ahmet Celal Cem
    REVERSIBLE COMPUTATION, RC 2023, 2023, 13960 : 3 - 11
  • [26] Complexity in dynamics and computation
    Ohya, M
    ACTA APPLICANDAE MATHEMATICAE, 2000, 63 (1-3) : 293 - 306
  • [27] Iteration and Parallel Computation on Computational Fluid Dynamics
    Huang Xin-cheng
    Xiao Ai-ling
    2014 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA), 2014, : 318 - 321
  • [28] Precise fixpoint computation through strategy iteration
    Gawlitza, Thomas
    Seidl, Helmut
    PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2007, 4421 : 300 - +
  • [29] An iteration method for computation of flexible fender piles
    Hu, LW
    Liu, XZ
    CHINA OCEAN ENGINEERING, 1998, 12 (04) : 435 - 442
  • [30] Incremental Iteration Method for Fast PageRank Computation
    Kim, Kyung Soo
    Choi, Yong Suk
    ACM IMCOM 2015, Proceedings, 2015,