Improving generalization of deep neural networks by leveraging margin distribution

被引:9
|
作者
Lyu, Shen-Huan [1 ]
Wang, Lu [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
关键词
Deep neural network; Margin theory; Generalization;
D O I
10.1016/j.neunet.2022.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has used margin theory to analyze the generalization performance for deep neural networks (DNNs). The existed results are almost based on the spectrally-normalized minimum margin. However, optimizing the minimum margin ignores a mass of information about the entire margin distribution, which is crucial to generalization performance. In this paper, we prove a generalization upper bound dominated by the statistics of the entire margin distribution. Compared with the minimum margin bounds, our bound highlights an important measure for controlling the complexity, which is the ratio of the margin standard deviation to the expected margin. We utilize a convex margin distribution loss function on the deep neural networks to validate our theoretical results by optimizing the margin ratio. Experiments and visualizations confirm the effectiveness of our approach and the correlation between generalization gap and margin ratio. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [1] Leveraging the Generalization Ability of Deep Convolutional Neural Networks for Improving Classifiers for Color Fundus Photographs
    Son, Jaemin
    Kim, Jaeyoung
    Kong, Seo Taek
    Jung, Kyu-Hwan
    APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 10
  • [2] Improving the Generalization of Deep Neural Networks in Seismic Resolution Enhancement
    Zhang, Haoran
    Alkhalifah, Tariq
    Liu, Yang
    Birnie, Claire
    Di, Xi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] Improving the Generalization of Deep Neural Networks in Seismic Resolution Enhancement
    Zhang, Haoran
    Alkhalifah, Tariq
    Liu, Yang
    Birnie, Claire
    Di, Xi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [4] Generalization Error of Deep Neural Networks: Role of Classification Margin and Data Structure
    Sokolic, Jure
    Giryes, Raja
    Sapiro, Guillermo
    Rodrigues, Miguel R. D.
    2017 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2017, : 147 - 151
  • [5] Improving Generalization Ability of Deep Neural Networks for Visual Recognition Tasks
    Okatani, Takayuki
    Liu, Xing
    Suganuma, Masanori
    COMPUTATIONAL COLOR IMAGING, CCIW 2019, 2019, 11418 : 3 - 13
  • [6] Improving adversarial robustness of deep neural networks via adaptive margin evolution
    Ma, Linhai
    Liang, Liang
    NEUROCOMPUTING, 2023, 551
  • [7] Improving generalization capabilities of dynamic neural networks
    Galicki, M
    Leistritz, L
    Zwick, EB
    Witte, H
    NEURAL COMPUTATION, 2004, 16 (06) : 1253 - 1282
  • [8] Interaction of Generalization and Out-of-Distribution Detection Capabilities in Deep Neural Networks
    Aboitiz, Francisco Javier Klaiber
    Legenstein, Robert
    Oezdenizci, Ozan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 248 - 259
  • [9] Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
    Otalora, Sebastian
    Atzori, Manfredo
    Andrearczyk, Vincent
    Khan, Amjad
    Mueller, Henning
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7 (AUG):
  • [10] Improving the Generalization Ability of Deep Neural Networks for Cross-Domain Visual Recognition
    Zheng, Jianwei
    Lu, Chao
    Hao, Cong
    Chen, Deming
    Guo, Donghui
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 607 - 620