Using an MDL-based cost function with neural networks

被引:0
|
作者
Lappalainen, H [1 ]
机构
[1] Helsinki Univ Technol, Neural Networks Res Ctr, FIN-02015 HUT, Finland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The minimum description length (MDL) principle is an information theoretically based method to learn models from data. This paper presents how to efficiently use an MDL-based cost function with neural networks. As usual, the cost function can be used to adapt the parameters in the network, but it can also include terms to measure the complexity of the structure of the network and can thus be applied to determine the optimal structure. The basic idea is to convert a conventional neural network such that each parameter and each output of the neurons is assigned a mean and a variance. This greatly simplifies the computation of the description length and its gradient with respect to the parameters, which can then be adapted using standard gradient descent.
引用
收藏
页码:2384 / 2389
页数:6
相关论文
共 50 条
  • [31] Three new MDL-based pruning techniques for robust rule induction
    Pham, D. T.
    Afify, A. A.
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2006, 220 (04) : 553 - 564
  • [32] Mint: MDL-based approach for Mining INTeresting Numerical Pattern Sets
    Makhalova, Tatiana
    Kuznetsov, Sergei O.
    Napoli, Amedeo
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (01) : 108 - 145
  • [33] MDL-based context-dependent subword modeling for speech recognition
    Shinoda, Koichi
    Watanabe, Takao
    [J]. Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (02): : 79 - 86
  • [34] Enhanced multi-task compressive sensing using Laplace priors and MDL-based task classification
    Ying-Gui Wang
    Le Yang
    Liang Tang
    Zheng Liu
    Wen-Li Jiang
    [J]. EURASIP Journal on Advances in Signal Processing, 2013
  • [35] Towards a Robust Classifier: An MDL-Based Method for Generating Adversarial Examples
    Asadi, Behzad
    Varadharajan, Vijay
    [J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 793 - 801
  • [36] MDL-based Posture Stabilization for Wheeled Mobile Robots with Nonholonomic Constraints
    Shi, Pu
    Zhao, Yiwen
    Hua, Jianning
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 2255 - +
  • [37] Time Series Discretization via MDL-based Histogram Density Estimation
    Kameya, Yoshitaka
    [J]. 2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 732 - 739
  • [38] Mint: MDL-based approach for Mining INTeresting Numerical Pattern Sets
    Tatiana Makhalova
    Sergei O. Kuznetsov
    Amedeo Napoli
    [J]. Data Mining and Knowledge Discovery, 2022, 36 : 108 - 145
  • [39] GraphMDL plus : Interleaving the Generation and MDL-based Selection of Graph Patterns
    Bariatti, Francesco
    Cellier, Peggy
    Ferre, Sebastien
    [J]. 36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 355 - 363
  • [40] A Novel Cost Function for Despeckling using Convolutional Neural Networks
    Ferraioli, Giampaolo
    Pascazio, Vito
    Vitale, Sergio
    [J]. 2019 JOINT URBAN REMOTE SENSING EVENT (JURSE), 2019,