Gaussian Clusters and Noise: An Approach Based on the Minimum Description Length Principle

被引:0
|
作者
Luosto, Panu [1 ]
Kivinen, Jyrki [1 ]
Mannila, Heikki [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, FIN-00014 Helsinki, Finland
[2] Aalto Univ, Dept Informat & Comp Sci, Helsinki, Finland
来源
DISCOVERY SCIENCE, DS 2010 | 2010年 / 6332卷
关键词
STOCHASTIC COMPLEXITY; INFORMATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a well-grounded minimum description length (MDL) based quality measure for a clustering consisting of either spherical or axis-aligned normally distributed clusters and a cluster with a uniform distribution in an axis-aligned rectangular box. The uniform component extends the practical usability of the model e. g. in the presence of noise, and using the MDL principle for the model selection makes comparing the quality of clusterings with a different number of clusters possible. We also introduce a novel search heuristic for finding the best clustering with an unknown number of clusters. The heuristic is based on the idea of moving points from the Gaussian clusters to the uniform one and using MDL for determining the optimal amount of noise. Tests with synthetic data having a clear cluster structure imply that the search method is effective in finding the intuitively correct clustering.
引用
收藏
页码:251 / 265
页数:15
相关论文
共 50 条
  • [1] Histograms based on the minimum description length principle
    Hai Wang
    Kenneth C. Sevcik
    The VLDB Journal, 2008, 17 : 419 - 442
  • [2] Histograms based on the minimum description length principle
    Wang, Hai
    Sevcik, Kenneth C.
    VLDB JOURNAL, 2008, 17 (03): : 419 - 442
  • [3] Learning Conditional Preference Networks: An Approach Based on the Minimum Description Length Principle
    Gimenezi, Pierre-Francois
    Mengin, Jerome
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3395 - 3403
  • [4] Introducing the minimum description length principle
    Grünwald, P
    ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 3 - 21
  • [5] A minimum description length principle for perception
    Chater, N
    ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 385 - 409
  • [6] Cluster Validity Measures Based on the Minimum Description Length Principle
    Georgieva, Olga
    Tschumitschew, Katharina
    Klawonn, Frank
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I: 15TH INTERNATIONAL CONFERENCE, KES 2011, 2011, 6881 : 82 - 89
  • [7] Unified Model Selection Approach Based on Minimum Description Length Principle in Granger Causality Analysis
    Li, Fei
    Wang, Xuewei
    Lin, Qiang
    Hu, Zhenghui
    IEEE ACCESS, 2020, 8 : 68400 - 68416
  • [8] Model selection and the principle of minimum description length
    Hansen, MH
    Yu, B
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) : 746 - 774
  • [9] The minimum description length principle in coding and modeling
    Barron, A
    Rissanen, J
    Yu, B
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2743 - 2760
  • [10] Incremental Learning with the Minimum Description Length Principle
    Murena, Pierre-Alexandre
    Cornuejols, Antoine
    Dessalles, Jean-Louis
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1908 - 1915