Is My Neural Net Driven by the MDL Principle?

被引:0
|
作者
Brandao, Eduardo [1 ]
Duffner, Stefan [2 ]
Emonet, Remi [1 ]
Habrard, Amaury [1 ,3 ]
Jacquenet, Francois [1 ]
Sebban, Marc [1 ]
机构
[1] Univ Jean Monnet St Etienne, CNRS, Inst Opt Grad Sch, Lab Hubert Curien,UMR 5516, F-42023 St Etienne, France
[2] Univ Lyon, CNRS, INSA Lyon, LIRIS,UMR5205, F-69621 Villeurbanne, France
[3] Inst Univ France IUF, Paris, France
关键词
Neural Networks; MDL; Signal-Noise; Point Jacobians; MINIMUM DESCRIPTION LENGTH;
D O I
10.1007/978-3-031-43415-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Minimum Description Length principle (MDL) is a formalization of Occam's razor for model selection, which states that a good model is one that can losslessly compress the data while including the cost of describing the model itself. While MDL can naturally express the behavior of certain models such as autoencoders (that inherently compress data) most representation learning techniques do not rely on such models. Instead, they learn representations by training on general or, for self-supervised learning, pretext tasks. In this paper, we propose a new formulation of the MDL principle that relies on the concept of signal and noise, which are implicitly defined by the learning task at hand. Additionally, we introduce ways to empirically measure the complexity of the learned representations by analyzing the spectra of the point Jacobians. Under certain assumptions, we show that the singular values of the point Jacobians of Neural Networks driven by the MDL principle should follow either a power law or a lognormal distribution. Finally, we conduct experiments to evaluate the behavior of the proposed measure applied to deep neural networks on different datasets, with respect to several types of noise. We observe that the experimental spectral distribution is in agreement with the spectral distribution predicted by our MDL principle, which suggests that neural networks trained with gradient descent on noisy data implicitly abide the MDL principle.
引用
收藏
页码:173 / 189
页数:17
相关论文
共 50 条
  • [41] Botnet Detection Based on Non-negative Matrix Factorization and the MDL Principle
    Yamauchi, Sayaka
    Kawakita, Masanori
    Takeuchi, Jun'ichi
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 400 - 409
  • [42] Hierarchical indexing of ocean survey video by mean shift clustering and MDL principle
    Luo, QM
    Khoshgoftaar, TM
    An, E
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2005, : 404 - 409
  • [43] REPRESENTING CLUMPS OF CELL NUCLEI AS UNIONS OF ELLIPTIC SHAPES BY USING THE MDL PRINCIPLE
    Hukkanen, Jenni
    Sabo, Edmond
    Tabus, Ioan
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1010 - 1014
  • [44] Alternating projection algorithm for detecting the number of coherent signals based on the MDL principle
    Suzuki, M
    Sanada, H
    Nagai, N
    2000 IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS: ELECTRONIC COMMUNICATION SYSTEMS, 2000, : 699 - 702
  • [45] Regularization Using the MDL Principle: Estimation of Regularization Parameters for Regions Containing Discontinuities
    Miyajima, Koji
    Mukawa, Naoki
    Okada, Mamoru
    Systems and Computers in Japan, 1999, 30 (08): : 61 - 71
  • [46] The World Wide Web as neural net - Implications for market-driven Web enabling
    Laxton, R
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2000, 64 (01) : 55 - 70
  • [47] EEG driven model predictive position control of an artificial limb using neural net
    Roy, Rinku
    Konar, Amit
    Tibarewala, D. N.
    Janarthanan, R.
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [48] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    Xiaoxing Weixing Jisuanji Xitong/Mini-Micro Systems, 2000, 21 (04): : 379 - 382
  • [49] Limitation and expansion of my intensity principle
    Schmidt, A
    PHYSIKALISCHE ZEITSCHRIFT, 1904, 5 : 528 - 529
  • [50] Learning the architectures and parameters of RBF neural network based on MDL
    Liu, Meiqin
    Chen, Jida
    Cai, Zixing
    2000, Shenyang Inst Comput Technol, China (21):