Inductive Bias of Multi-Channel Linear Convolutional Networks with BoundedWeight Norm

被引:0
|
作者
Jagadeesan, Meena [1 ]
Razenshteyn, Ilya [2 ]
Gunasekar, Suriya [3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] CipherMode Labs, Los Angeles, CA USA
[3] Microsoft Res, Mountain View, CA USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide a function space characterization of the inductive bias resulting from minimizing the l(2) norm of the weights in multi-channel convolutional neural networks with linear activations and empirically test our resulting hypothesis on ReLU networks trained using gradient descent. We define an induced regularizer in the function space as the minimum l(2) norm of weights of a network required to realize a function. For two layer linear convolutional networks with C output channels and kernel size K, we show the following: (a) If the inputs to the network are single channeled, the induced regularizer for any K is independent of the number of output channels C. Furthermore, we derive the regularizer is a norm given by a semidefinite program (SDP). (b) In contrast, for multi-channel inputs, multiple output channels can be necessary to merely realize all matrix-valued linear functions and thus the inductive bias does depend on C. However, for sufficiently large C, the induced regularizer is again given by an SDP that is independent of C. In particular, the induced regularizer for K = 1 and K = D (input dimension) are given in closed form as the nuclear norm and the l(2,1) group-sparse norm, respectively, of the Fourier coefficients of the linear predictor. We investigate the broader applicability of our theoretical results to implicit regularization from gradient descent on linear and ReLU networks through experiments on MNIST and CIFAR-10 datasets.
引用
收藏
页数:50
相关论文
共 50 条
  • [21] Multi-channel speech enhancement using early and late fusion convolutional neural networks
    Priyanka, S. Siva
    Kumar, T. Kishore
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 973 - 979
  • [22] Segmentation of Brain Tumor Tissues in Multi-channel MRI Using Convolutional Neural Networks
    Naveena, C.
    Poornachandra, S.
    Aradhya, V. N. Manjunath
    BRAIN INFORMATICS, BI 2020, 2020, 12241 : 128 - 137
  • [23] Stacked fully convolutional networks with multi-channel learning: application to medical image segmentation
    Lei Bi
    Jinman Kim
    Ashnil Kumar
    Michael Fulham
    Dagan Feng
    The Visual Computer, 2017, 33 : 1061 - 1071
  • [24] Semantic Segmentation of Multi-Channel Polycrystalline Structure Micrographs Using Convolutional Neural Networks
    Selmaier, Andreas
    Lutz, Benjamin
    Kisskalt, Dominik
    Boernicke, Simon
    Fuerst, Jens
    Franke, Joerg
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 847 - 852
  • [25] Classifying Sightseeing Tweets using Convolutional Neural Networks with Multi-Channel Distributed Representation
    Hashida, Shuichi
    Tamura, Keiichi
    Sakai, Tatsuhiro
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 178 - 183
  • [26] Multi-channel speech enhancement using early and late fusion convolutional neural networks
    S. Siva Priyanka
    T. Kishore Kumar
    Signal, Image and Video Processing, 2023, 17 : 973 - 979
  • [27] Acoustic Scene Classification Based on Dense Convolutional Networks Incorporating Multi-channel Features
    Wang, Dezhi
    Zhang, Lilun
    Xu, Kele
    Wang, Yongxian
    2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [28] PCA-aided Fully Convolutional Networks for Semantic Segmentation of Multi-channel fMRI
    Tai, Lei
    Ye, Haoyang
    Ye, Qiong
    Liu, Ming
    2017 18TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2017, : 124 - 130
  • [29] Author Identification of Micro-Messages via Multi-Channel Convolutional Neural Networks
    Aykent, Sarp
    Dozier, Gerry
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 675 - 681
  • [30] Spatial Steganalysis Based on Non-Local Block and Multi-Channel Convolutional Networks
    Han, Xu
    Zhang, Tao
    IEEE ACCESS, 2022, 10 : 87241 - 87253