Understanding Weight Normalized Deep Neural Networks with Rectified Linear Units

被引:0
|
作者
Xu, Yixi [1 ]
Wang, Xiao [1 ]
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a general framework for norm-based capacity control for L-p,(q) weight normalized deep neural networks. We establish the upper bound on the Rademacher complexities of this family. With an L-p,(q) normalization where q <= p* and 1/p +1/p* = 1, we discuss properties of a width-independent capacity control, which only depends on the depth by a square root term. We further analyze the approximation properties of L-p,(q) weight normalized deep neural networks. In particular, for an L-i,L-infinity weight normalized network, the approximation error can be controlled by the L-1 norm of the output layer, and the corresponding generalization error only depends on the architecture by the square root of the depth.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Improving Deep Neural Networks Using Softplus Units
    Zheng, Hao
    Yang, Zhanlei
    Liu, Wenju
    Liang, Jizhong
    Li, Yanpeng
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [42] Hierarchical Weight Averaging for Deep Neural Networks
    Gu, Xiaozhe
    Zhang, Zixun
    Jiang, Yuncheng
    Luo, Tao
    Zhang, Ruimao
    Cui, Shuguang
    Li, Zhen
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12276 - 12287
  • [43] Solving Parametric Partial Differential Equations with Deep Rectified Quadratic Unit Neural Networks
    Zhen Lei
    Lei Shi
    Chenyu Zeng
    [J]. Journal of Scientific Computing, 2022, 93
  • [44] Solving Parametric Partial Differential Equations with Deep Rectified Quadratic Unit Neural Networks
    Lei, Zhen
    Shi, Lei
    Zeng, Chenyu
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2022, 93 (03)
  • [45] Properties of the Geometry of Solutions and Capacity of Multilayer Neural Networks with Rectified Linear Unit Activations
    Baldassi, Carlo
    Malatesta, Enrico M.
    Zecchina, Riccardo
    [J]. PHYSICAL REVIEW LETTERS, 2019, 123 (17)
  • [46] Adaptive Normalized Risk-Averting Training for Deep Neural Networks
    Wang, Zhiguang
    Oates, Tim
    Lo, James
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2201 - 2207
  • [47] Piecewise linear neural networks and deep learning
    Qinghua Tao
    Li Li
    Xiaolin Huang
    Xiangming Xi
    Shuning Wang
    Johan A. K. Suykens
    [J]. Nature Reviews Methods Primers, 2
  • [48] Piecewise linear neural networks and deep learning
    Tao, Qinghua
    Li, Li
    Huang, Xiaolin
    Xi, Xiangming
    Wang, Shuning
    Suykens, Johan A. K.
    [J]. NATURE REVIEWS METHODS PRIMERS, 2022, 2 (01):
  • [49] Piecewise linear neural networks and deep learning
    [J]. Nature Reviews Methods Primers, 2 (1):
  • [50] On the Number of Linear Regions of Deep Neural Networks
    Montufar, Guido
    Pascanu, Razvan
    Cho, Kyunghyun
    Bengio, Yoshua
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27