A Clustering Method Based on the Maximum Entropy Principle

被引:35
|
作者
Aldana-Bobadilla, Edwin [1 ]
Kuri-Morales, Angel [2 ]
机构
[1] Univ Nacl Autonoma Mexico, Inst Invest Matemat Aplicadas & Sistemas, Mexico City 04510, DF, Mexico
[2] Inst Tecnol Autonomo Mexico, Mexico City 01080, DF, Mexico
关键词
clustering; Shannon's entropy; genetic algorithms; INFORMATION; OPTIMIZATION; NUMBER; VALIDATION; ALGORITHM;
D O I
10.3390/e17010151
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Clustering is an unsupervised process to determine which unlabeled objects in a set share interesting properties. The objects are grouped into k subsets (clusters) whose elements optimize a proximity measure. Methods based on information theory have proven to be feasible alternatives. They are based on the assumption that a cluster is one subset with the minimal possible degree of "disorder". They attempt to minimize the entropy of each cluster. We propose a clustering method based on the maximum entropy principle. Such a method explores the space of all possible probability distributions of the data to find one that maximizes the entropy subject to extra conditions based on prior information about the clusters. The prior information is based on the assumption that the elements of a cluster are "similar" to each other in accordance with some statistical measure. As a consequence of such a principle, those distributions of high entropy that satisfy the conditions are favored over others. Searching the space to find the optimal distribution of object in the clusters represents a hard combinatorial problem, which disallows the use of traditional optimization techniques. Genetic algorithms are a good alternative to solve this problem. We benchmark our method relative to the best theoretical performance, which is given by the Bayes classifier when data are normally distributed, and a multilayer perceptron network, which offers the best practical performance when data are not normal. In general, a supervised classification method will outperform a non-supervised one, since, in the first case, the elements of the classes are known a priori. In what follows, we show that our method's effectiveness is comparable to a supervised one. This clearly exhibits the superiority of our method.
引用
收藏
页码:151 / 180
页数:30
相关论文
共 50 条
  • [1] A clustering algorithm based on maximum entropy principle
    Zhao, Yang
    Liu, Fangai
    [J]. 2ND ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2017), 2017, 887
  • [2] The principle of the maximum entropy method
    Sakata, M
    Takata, M
    [J]. HIGH PRESSURE RESEARCH, 1996, 14 (4-6) : 327 - 333
  • [3] Clustering based on density estimation Using variable kernel and maximum entropy principle
    El Fattahi, Loubna
    Lakhdar, Yissam
    Sbai, El Hassan
    [J]. 2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2017,
  • [4] Data-driven fuzzy clustering based on maximum entropy principle and PSO
    Chen, Debao
    Zhao, Chunxia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) : 625 - 633
  • [5] An improved algorithm for support vector clustering based on maximum entropy principle and kernel matrix
    Guo, Chonghui
    Li, Fang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8138 - 8143
  • [6] A Novel Method for Predicting Network Traffic Based on Maximum Entropy Principle
    Wang, Jingyu
    Zhao, Yang
    [J]. INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2016, 9 (01): : 97 - 106
  • [7] Composition Vector Method Based on Maximum Entropy Principle for Sequence Comparison
    Chan, Raymond H.
    Chan, Tony H.
    Yeung, Hau Man
    Wang, Roger Wei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (01) : 79 - 87
  • [8] Gaussian clustering method based on maximum-fuzzy-entropy interpretation
    Li, RP
    Mukaidono, M
    [J]. FUZZY SETS AND SYSTEMS, 1999, 102 (02) : 253 - 258
  • [9] UNCERTAINTY ANALYSIS METHOD BASED ON A COMBINATION OF THE MAXIMUM ENTROPY PRINCIPLE AND THE POINT ESTIMATION METHOD
    Zhang, Xiao-Ling
    Huang, Hong-Zhong
    Wang, Zhong-Lai
    Xiao, Ning-Cong
    Li, Yan-Feng
    [J]. EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2012, 14 (02): : 114 - 119
  • [10] Scale Detection Based on Maximum Entropy Principle
    Zhang, Xiaochun
    Duan, Qing
    Yang, Hongji
    [J]. 2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 448 - 453