A SCALE INVARIANT MEASURE OF FLATNESS FOR DEEP NETWORK MINIMA

被引:3
|
作者
Rangamani, Akshay [1 ]
Nguyen, Nam H. [2 ]
Kumar, Abhishek [3 ]
Dzung Phan [2 ]
Chin, Sang [4 ]
Tran, Trac D. [5 ]
机构
[1] MIT, Ctr Brains Minds & Machines, Cambridge, MA 02139 USA
[2] IBM Res, Armonk, NY USA
[3] Google Brain, Mountain View, CA USA
[4] Boston Univ, CS Dept, Boston, MA 02215 USA
[5] Johns Hopkins Univ, ECE Dept, Baltimore, MD 21218 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Deep Learning; Generalization; Flat Minima; Riemannian Quotient Manifolds;
D O I
10.1109/ICASSP39728.2021.9413771
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It has been empirically observed that the flatness of minima obtained from training deep networks seems to correlate with better generalization. However, for deep networks with positively homogeneous activations, most measures of flatness are not invariant to rescaling of the network parameters. This means that the measure of flatness can be made as small or as large as possible through rescaling, rendering the quantitative measures meaningless. In this paper we show that for deep networks with positively homogenous activations, these rescalings constitute equivalence relations, and that these equivalence relations induce a quotient manifold structure in the parameter space. Using an appropriate Riemannian metric, we propose a Hessian-based measure for flatness that is invariant to rescaling and perform simulations to empirically verify our claim. Finally we perform experiments to verify that our flatness measure correlates with generalization by using minibatch stochastic gradient descent with different batch sizes to find deep network minima with different generalization properties.
引用
收藏
页码:1680 / 1684
页数:5
相关论文
共 50 条
  • [1] Minima of classically scale-invariant potentials
    Kristjan Kannike
    Kaius Loos
    Luca Marzola
    Journal of High Energy Physics, 2021
  • [2] Minima of classically scale-invariant potentials
    Kannike, Kristjan
    Loos, Kaius
    Marzola, Luca
    JOURNAL OF HIGH ENERGY PHYSICS, 2021, 2021 (06)
  • [3] A scale invariant measure of clutter
    Bravo, Mary J.
    Farid, Hany
    JOURNAL OF VISION, 2008, 8 (01):
  • [4] Flatness of minima in random inflationary landscapes
    He, Yang-Hui
    Jejjala, Vishnu
    Pontiggia, Luca
    Xiao, Yan
    Zhou, Da
    INTERNATIONAL JOURNAL OF MODERN PHYSICS A, 2019, 34 (17):
  • [5] A scale invariant distance measure for texture retrieval
    O' Callaghan, RJ
    Bull, DR
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 425 - 428
  • [6] ON THE SINGULAR N-POINT MOTION OF A BROWNIAN FLOW - ASYMPTOTIC FLATNESS AND INVARIANT MEASURE
    BASAK, G
    KANNAN, D
    STOCHASTIC ANALYSIS AND APPLICATIONS, 1993, 11 (04) : 369 - 397
  • [7] Stretched shadings and a Banach measure that is not scale-invariant
    Mabry, Richard D.
    FUNDAMENTA MATHEMATICAE, 2010, 209 (02) : 95 - 113
  • [8] Activity Recognition Using Deep Recurrent Neural Network on Translation and Scale-Invariant Features
    Uddin, Md. Zia
    Khaksar, Weria
    Torresen, Jim
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 475 - 479
  • [9] GETTING THE MEASURE OF THE FLATNESS PROBLEM
    EVRARD, G
    COLES, P
    CLASSICAL AND QUANTUM GRAVITY, 1995, 12 (10) : L93 - L97
  • [10] Caloric measure and Reifenberg flatness
    Nystrom, Kaj
    ANNALES ACADEMIAE SCIENTIARUM FENNICAE-MATHEMATICA, 2006, 31 (02) : 405 - 436