On the regularization of convolutional kernel tensors in neural networks

被引:2
|
作者
Guo, Pei-Chang [1 ]
Ye, Qiang [2 ]
机构
[1] China Univ Geosci, Sch Sci, Beijing 100083, Peoples R China
[2] Univ Kentucky, Dept Math, Lexington, KY USA
来源
LINEAR & MULTILINEAR ALGEBRA | 2022年 / 70卷 / 12期
关键词
Penalty function; transformation matrix; convolutional layers; generalizability; unstable gradient;
D O I
10.1080/03081087.2020.1795058
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Convolutional neural network is an important model in deep learning, where a convolution operation can be represented by a tensor. To avoid exploding/vanishing gradient problems and to improve the generalizability of a neural network, it is desirable to have a convolution operation that nearly preserves the norm, or to have the singular values of the transformation matrix corresponding to the tensor bounded around 1. We propose a penalty function that can constrain the singular values of the transformation matrix to be around 1. We derive an algorithm to carry out the gradient descent minimization of this penalty function in terms of convolution kernel tensors. Numerical examples are presented to demonstrate the effectiveness of the method.
引用
收藏
页码:2318 / 2330
页数:13
相关论文
共 50 条
  • [1] New regularization methods for convolutional kernel tensors
    Guo, Pei-Chang
    [J]. AIMS MATHEMATICS, 2023, 8 (11): : 26188 - 26198
  • [2] Convolutional Neural Networks With Dynamic Regularization
    Wang, Yi
    Bian, Zhen-Peng
    Hou, Junhui
    Chau, Lap-Pui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2299 - 2304
  • [3] Kernel Pooling for Convolutional Neural Networks
    Cui, Yin
    Zhou, Feng
    Wang, Jiang
    Liu, Xiao
    Lin, Yuanqing
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3049 - 3058
  • [4] Kernel Graph Convolutional Neural Networks
    Nikolentzos, Giannis
    Meladianos, Polykarpos
    Tixier, Antoine Jean-Pierre
    Skianis, Konstantinos
    Vazirgiannis, Michalis
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 22 - 32
  • [5] Multiscale Conditional Regularization for Convolutional Neural Networks
    Lu, Yao
    Lu, Guangming
    Li, Jinxing
    Xu, Yuanrong
    Zhang, Zheng
    Zhang, David
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 444 - 458
  • [6] LMix: regularization strategy for convolutional neural networks
    Yan, Linyu
    Zheng, Kunpeng
    Xia, Jinyao
    Li, Ke
    Ling, Hefei
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1245 - 1253
  • [7] REGULARIZATION OF CONVOLUTIONAL NEURAL NETWORKS USING SHUFFLENODE
    Chen, Yihao
    Wang, Hanli
    Long, Yu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 355 - 360
  • [8] LMix: regularization strategy for convolutional neural networks
    Linyu Yan
    Kunpeng Zheng
    Jinyao Xia
    Ke Li
    Hefei Ling
    [J]. Signal, Image and Video Processing, 2023, 17 : 1245 - 1253
  • [9] The Kernel Dynamics of Convolutional Neural Networks in Manifolds
    WU Wei
    JING Xiaoyuan
    DU Wencai
    [J]. Chinese Journal of Electronics, 2020, 29 (06) : 1185 - 1192
  • [10] Vector-kernel convolutional neural networks
    Ou, Jun
    Li, Yujian
    [J]. NEUROCOMPUTING, 2019, 330 : 253 - 258