Clustgrams: an extension to histogram densities based on the minimum description length principle

被引:1
|
作者
Luosto, Panu [1 ]
Kontkanen, Petri [1 ,2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Aalto Univ, Helsinki Inst Informat Technol, Helsinki, Finland
基金
芬兰科学院;
关键词
density estimation; minimum description length (MDL) principle; clustering; histograms;
D O I
10.2478/s13537-011-0033-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Density estimation is one of the most important problems in statistical inference and machine learning. A common approach to the problem is to use histograms, i.e., piecewise constant densities. Histograms are flexible and can adapt to any density given enough bins. However, due to the simplicity of histograms, a large number of parameters and a large sample size might be needed for learning an accurate density, especially in more complex problem instances. In this paper, we extend the histogram density estimation framework by introducing a model called clustgram, which uses arbitrary density functions as components of the density rather than just uniform components. The new model is based on finding a clustering of the sample points and determining the type of the density function for each cluster. We regard the problem of learning clustgrams as a model selection problem and use the theoretically appealing minimum description length principle for solving the task.
引用
收藏
页码:466 / 481
页数:16
相关论文
共 50 条
  • [31] Minimum Description Length Principle for Maximum Entropy Model Selection
    Pandey, Gaurav
    Dukkipati, Ambedkar
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 1521 - 1525
  • [32] Towards Web Spam Filtering using a Classifier based on the Minimum Description Length Principle
    Silva, Renato M.
    Yamakami, Akebo
    Almeida, Tiago A.
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 470 - 475
  • [33] Comparative analysis of structural representations of images, based on the principle of representational minimum description length
    Potapov, A. S.
    [J]. JOURNAL OF OPTICAL TECHNOLOGY, 2008, 75 (11) : 715 - 720
  • [34] Optimization Framework with Minimum Description Length Principle for Probabilistic Programming
    Potapov, Alexey
    Batishcheva, Vita
    Rodionov, Sergey
    [J]. ARTIFICIAL GENERAL INTELLIGENCE (AGI 2015), 2015, 9205 : 331 - 340
  • [35] Regression spline smoothing using the minimum description length principle
    Lee, TCM
    [J]. STATISTICS & PROBABILITY LETTERS, 2000, 48 (01) : 71 - 82
  • [36] Quantitative description of the laws of perceptual grouping by means of the principle of representational minimum description length
    Potapov, A. S.
    Petrochenko, V. G.
    [J]. JOURNAL OF OPTICAL TECHNOLOGY, 2008, 75 (08) : 509 - 513
  • [37] Adaptive Multitimescale Event Detection in Nonintrusive Load Monitoring Based on Minimum Description Length Principle
    Liu, Bo
    Zhang, Jianfeng
    Luan, Wenpeng
    Zhao, Bochao
    Liu, Zishuai
    Yu, Yixin
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14
  • [38] A Minimum Description Length Principle Based Method for Signal Change Detection in Machine Condition Monitoring
    Hulkkonen, Jenni
    Heikkonen, Jukka
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3531 - 3534
  • [39] Clustering of a set of identified points on images of dynamic scenes, based on the principle of minimum description length
    Peterson, M. V.
    [J]. JOURNAL OF OPTICAL TECHNOLOGY, 2010, 77 (11) : 701 - 706
  • [40] Unified Model Selection Approach Based on Minimum Description Length Principle in Granger Causality Analysis
    Li, Fei
    Wang, Xuewei
    Lin, Qiang
    Hu, Zhenghui
    [J]. IEEE ACCESS, 2020, 8 : 68400 - 68416