Inductive Bias of Multi-Channel Linear Convolutional Networks with BoundedWeight Norm

被引:0
|
作者
Jagadeesan, Meena [1 ]
Razenshteyn, Ilya [2 ]
Gunasekar, Suriya [3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] CipherMode Labs, Los Angeles, CA USA
[3] Microsoft Res, Mountain View, CA USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide a function space characterization of the inductive bias resulting from minimizing the l(2) norm of the weights in multi-channel convolutional neural networks with linear activations and empirically test our resulting hypothesis on ReLU networks trained using gradient descent. We define an induced regularizer in the function space as the minimum l(2) norm of weights of a network required to realize a function. For two layer linear convolutional networks with C output channels and kernel size K, we show the following: (a) If the inputs to the network are single channeled, the induced regularizer for any K is independent of the number of output channels C. Furthermore, we derive the regularizer is a norm given by a semidefinite program (SDP). (b) In contrast, for multi-channel inputs, multiple output channels can be necessary to merely realize all matrix-valued linear functions and thus the inductive bias does depend on C. However, for sufficiently large C, the induced regularizer is again given by an SDP that is independent of C. In particular, the induced regularizer for K = 1 and K = D (input dimension) are given in closed form as the nuclear norm and the l(2,1) group-sparse norm, respectively, of the Fourier coefficients of the linear predictor. We investigate the broader applicability of our theoretical results to implicit regularization from gradient descent on linear and ReLU networks through experiments on MNIST and CIFAR-10 datasets.
引用
收藏
页数:50
相关论文
共 50 条
  • [11] AM-GCN: Adaptive Multi-channel Graph Convolutional Networks
    Wang, Xiao
    Zhu, Meiqi
    Bo, Deyu
    Cui, Peng
    Shi, Chuan
    Pei, Jian
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1243 - 1253
  • [12] Multi-Channel Graph Convolutional Networks for Graphs with Inconsistent Structures and Features
    Chang, Xinglong
    Wang, Jianrong
    Wang, Rui
    Wang, Tao
    Wang, Yingkui
    Li, Weihao
    ELECTRONICS, 2024, 13 (03)
  • [13] Multi-channel lung sound classification with convolutional recurrent neural networks
    Messner, Elmar
    Fediuk, Melanie
    Swatek, Paul
    Scheidl, Stefan
    Smolle-Juettner, Freyja-Maria
    Olschewski, Horst
    Pernkopf, Franz
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 122
  • [14] Explicit Inductive Bias for Transfer Learning with Convolutional Networks
    Li, Xuhong
    Grandvalet, Yves
    Davoine, Franck
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [15] Multi-Channel Weather Radar Echo Extrapolation with Convolutional Recurrent Neural Networks
    Tran, Quang-Khai
    Song, Sa-kwang
    REMOTE SENSING, 2019, 11 (19)
  • [16] Classifying tweets using convolutional neural networks with multi-channel distributed representation
    Hashida, Shuichi
    Tamura, Keiichi
    Sakai, Tatsuhiro
    IAENG International Journal of Computer Science, 2019, 46 (01)
  • [17] Vehicle counting in crowded scenes with multi-channel and multi-task convolutional neural networks
    Sun, Maojin
    Wang, Yan
    Li, Teng
    Lv, Jing
    Wu, Jun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 412 - 419
  • [18] Convolutional Dictionary Learning for Multi-Channel Signals
    Garcia-Cardona, Cristina
    Wohlberg, Brendt
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 335 - 342
  • [19] A natural geometric norm for multi-channel image processing
    Kimmel, R
    MATHEMATICAL METHODS FOR CURVES AND SURFACES II, 1998, : 271 - 278
  • [20] Channel Grouping Architecture for Multi-Channel Networks
    Baziana, Peristera A.
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,