Using an MDL-based cost function with neural networks

被引:0
|
作者
Lappalainen, H [1 ]
机构
[1] Helsinki Univ Technol, Neural Networks Res Ctr, FIN-02015 HUT, Finland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The minimum description length (MDL) principle is an information theoretically based method to learn models from data. This paper presents how to efficiently use an MDL-based cost function with neural networks. As usual, the cost function can be used to adapt the parameters in the network, but it can also include terms to measure the complexity of the structure of the network and can thus be applied to determine the optimal structure. The basic idea is to convert a conventional neural network such that each parameter and each output of the neurons is assigned a mean and a variance. This greatly simplifies the computation of the description length and its gradient with respect to the parameters, which can then be adapted using standard gradient descent.
引用
收藏
页码:2384 / 2389
页数:6
相关论文
共 50 条
  • [1] An efficient MDL-based construction of RBF networks
    Leonardis, A
    Bischof, H
    [J]. NEURAL NETWORKS, 1998, 11 (05) : 963 - 973
  • [2] Design of vector quantization networks by MDL-based principles
    Bischof, H
    Leonardis, A
    [J]. IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2294 - 2299
  • [3] An MDL-based Hammerstein recurrent neural network for control applications
    Wang, Jeen-Shing
    Hsu, Yu-Liang
    [J]. NEUROCOMPUTING, 2010, 74 (1-3) : 315 - 327
  • [4] MDL-Based Hierarchical Clustering
    Markov, Zdravko
    [J]. 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 471 - 474
  • [5] MDL-based time series clustering
    Rakthanmanon, Thanawin
    Keogh, Eamonn J.
    Lonardi, Stefano
    Evans, Scott
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 33 (02) : 371 - 399
  • [6] MDL-based design of vector quantizers
    Bischof, H
    Leonardis, A
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 891 - 893
  • [7] MDL-based Fitness for Feature Construction
    Shafti, Leila S.
    Perez, Eduardo
    [J]. GECCO 2007: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2007, : 1875 - 1875
  • [8] MDL-based time series clustering
    Thanawin Rakthanmanon
    Eamonn J. Keogh
    Stefano Lonardi
    Scott Evans
    [J]. Knowledge and Information Systems, 2012, 33 : 371 - 399
  • [9] A new MDL-based function for feature selection for Bayesian network classifiers
    Drugan, MM
    van der Gaag, LC
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 999 - 1000
  • [10] Widening for MDL-Based Retail Signature Discovery
    Gautrais, Clement
    Cellier, Peggy
    van Leeuwen, Matthijs
    Termier, Alexandre
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 197 - 209