Clustgrams: an extension to histogram densities based on the minimum description length principle

被引:1
|
作者
Luosto, Panu [1 ]
Kontkanen, Petri [1 ,2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Aalto Univ, Helsinki Inst Informat Technol, Helsinki, Finland
基金
芬兰科学院;
关键词
density estimation; minimum description length (MDL) principle; clustering; histograms;
D O I
10.2478/s13537-011-0033-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Density estimation is one of the most important problems in statistical inference and machine learning. A common approach to the problem is to use histograms, i.e., piecewise constant densities. Histograms are flexible and can adapt to any density given enough bins. However, due to the simplicity of histograms, a large number of parameters and a large sample size might be needed for learning an accurate density, especially in more complex problem instances. In this paper, we extend the histogram density estimation framework by introducing a model called clustgram, which uses arbitrary density functions as components of the density rather than just uniform components. The new model is based on finding a clustering of the sample points and determining the type of the density function for each cluster. We regard the problem of learning clustgrams as a model selection problem and use the theoretically appealing minimum description length principle for solving the task.
引用
收藏
页码:466 / 481
页数:16
相关论文
共 50 条
  • [1] Histograms based on the minimum description length principle
    Hai Wang
    Kenneth C. Sevcik
    [J]. The VLDB Journal, 2008, 17 : 419 - 442
  • [2] Histograms based on the minimum description length principle
    Wang, Hai
    Sevcik, Kenneth C.
    [J]. VLDB JOURNAL, 2008, 17 (03): : 419 - 442
  • [3] Minimum description length denoising with histogram models
    Kumar, Vibhor
    Heikkonen, Jukka
    Rissanen, Jorma
    Kaski, Kimmo
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (08) : 2922 - 2928
  • [4] Introducing the minimum description length principle
    Grünwald, P
    [J]. ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 3 - 21
  • [5] A minimum description length principle for perception
    Chater, N
    [J]. ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 385 - 409
  • [6] Cluster Validity Measures Based on the Minimum Description Length Principle
    Georgieva, Olga
    Tschumitschew, Katharina
    Klawonn, Frank
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I: 15TH INTERNATIONAL CONFERENCE, KES 2011, 2011, 6881 : 82 - 89
  • [7] The minimum description length principle in coding and modeling
    Barron, A
    Rissanen, J
    Yu, B
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2743 - 2760
  • [8] Incremental Learning with the Minimum Description Length Principle
    Murena, Pierre-Alexandre
    Cornuejols, Antoine
    Dessalles, Jean-Louis
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1908 - 1915
  • [9] Model selection and the principle of minimum description length
    Hansen, MH
    Yu, B
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) : 746 - 774
  • [10] A first look at the minimum description length principle
    Grunwald, Peter D.
    [J]. INTELLIGENT ALGORITHMS IN AMBIENT AND BIOMEDICAL COMPUTING, 2006, 7 : 187 - 213