Improving generalization of deep neural networks by leveraging margin distribution

被引:9
|
作者
Lyu, Shen-Huan [1 ]
Wang, Lu [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
关键词
Deep neural network; Margin theory; Generalization;
D O I
10.1016/j.neunet.2022.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has used margin theory to analyze the generalization performance for deep neural networks (DNNs). The existed results are almost based on the spectrally-normalized minimum margin. However, optimizing the minimum margin ignores a mass of information about the entire margin distribution, which is crucial to generalization performance. In this paper, we prove a generalization upper bound dominated by the statistics of the entire margin distribution. Compared with the minimum margin bounds, our bound highlights an important measure for controlling the complexity, which is the ratio of the margin standard deviation to the expected margin. We utilize a convex margin distribution loss function on the deep neural networks to validate our theoretical results by optimizing the margin ratio. Experiments and visualizations confirm the effectiveness of our approach and the correlation between generalization gap and margin ratio. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [31] IMPROVING THE GENERALIZATION OF NEURAL NETWORKS BY CHANGING THE STRUCTURE OF ARTIFICIAL NEURON
    Daliri, Mohammad Reza
    Fattan, Mehdi
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2011, 24 (04) : 195 - 204
  • [32] A Novel Ensemble Approach for Improving Generalization Ability of Neural Networks
    Lu, Lei
    Zeng, Xiaoqin
    Wu, Shengli
    Zhong, Shuiming
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2008, 2008, 5326 : 164 - +
  • [33] Improving Generalization of Deep Networks for Inverse Reconstruction of Image Sequences
    Ghimire, Sandesh
    Kumar, Prashnna
    Dhamala, Gyawali Jwala
    Sapp, John L.
    Horacek, Milan
    Wang, Linwei
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 : 153 - 166
  • [34] Improving the Generalization Properties of Radial Basis Function Neural Networks
    Bishop, Chris
    NEURAL COMPUTATION, 1991, 3 (04) : 579 - 588
  • [35] Mutual Information Generation for Improving Generalization and Interpretation in Neural Networks
    Kamimura, Ryotaro
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] A distributed genetic algorithm improving the generalization behavior of neural networks
    Branke, J
    Kohlmorgen, U
    Schmeck, H
    MACHINE LEARNING: ECML-95, 1995, 912 : 107 - 121
  • [37] Fuzzification of input vectors for improving the generalization ability of neural networks
    Ishibuchi, H
    Nii, M
    1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1153 - 1158
  • [38] Improving the Generalization Properties of Neural Networks: an Application to Vehicle Detection
    Ludwig Junior, Oswaldo
    Nunes, Urbano
    PROCEEDINGS OF THE 11TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2008, : 310 - 315
  • [39] Improving generalization performance of artificial neural networks with genetic algorithms
    Wu, JS
    Liu, MZ
    2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 288 - 291
  • [40] Improving neural networks generalization with new constructive and pruning methods
    Costa, MA
    Braga, AP
    de Menezes, BR
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2002, 13 (2-4) : 75 - 83