A Clustering Method Based on the Maximum Entropy Principle

被引:36
|
作者
Aldana-Bobadilla, Edwin [1 ]
Kuri-Morales, Angel [2 ]
机构
[1] Univ Nacl Autonoma Mexico, Inst Invest Matemat Aplicadas & Sistemas, Mexico City 04510, DF, Mexico
[2] Inst Tecnol Autonomo Mexico, Mexico City 01080, DF, Mexico
关键词
clustering; Shannon's entropy; genetic algorithms; INFORMATION; OPTIMIZATION; NUMBER; VALIDATION; ALGORITHM;
D O I
10.3390/e17010151
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Clustering is an unsupervised process to determine which unlabeled objects in a set share interesting properties. The objects are grouped into k subsets (clusters) whose elements optimize a proximity measure. Methods based on information theory have proven to be feasible alternatives. They are based on the assumption that a cluster is one subset with the minimal possible degree of "disorder". They attempt to minimize the entropy of each cluster. We propose a clustering method based on the maximum entropy principle. Such a method explores the space of all possible probability distributions of the data to find one that maximizes the entropy subject to extra conditions based on prior information about the clusters. The prior information is based on the assumption that the elements of a cluster are "similar" to each other in accordance with some statistical measure. As a consequence of such a principle, those distributions of high entropy that satisfy the conditions are favored over others. Searching the space to find the optimal distribution of object in the clusters represents a hard combinatorial problem, which disallows the use of traditional optimization techniques. Genetic algorithms are a good alternative to solve this problem. We benchmark our method relative to the best theoretical performance, which is given by the Bayes classifier when data are normally distributed, and a multilayer perceptron network, which offers the best practical performance when data are not normal. In general, a supervised classification method will outperform a non-supervised one, since, in the first case, the elements of the classes are known a priori. In what follows, we show that our method's effectiveness is comparable to a supervised one. This clearly exhibits the superiority of our method.
引用
收藏
页码:151 / 180
页数:30
相关论文
共 50 条
  • [41] IMPLICATIONS OF THE ENTROPY MAXIMUM PRINCIPLE
    KAZES, E
    CUTLER, PH
    AMERICAN JOURNAL OF PHYSICS, 1988, 56 (06) : 560 - 561
  • [42] Reinforced fuzzy neural networks based on maximum entropy clustering and conjugate gradient method
    Dong, Qingmei
    Fan, Qinwei
    Xing, Zhiwei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [43] Hairiness detection based on maximum entropy and density clustering
    Li P.
    Yan K.
    Zhang H.
    Jing J.
    Fangzhi Xuebao/Journal of Textile Research, 2019, 40 (07): : 158 - 162
  • [44] ENTROPY ESTIMATION USING THE PRINCIPLE OF MAXIMUM ENTROPY
    Behmardi, Behrouz
    Raich, Raviv
    Hero, Alfred O., III
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2008 - 2011
  • [45] On the Entropy and the Maximum Entropy Principle of Uncertain Variables
    Liu, Yujun
    Ma, Guanzhong
    ENTROPY, 2023, 25 (08)
  • [46] Informational Entropy of Structure and Maximum Entropy Principle
    Chen, Jianjun
    Cao, Yibo
    Duan, Baoyan
    Ying Yong Li Xue Xue Bao/Chinese Journal of Applied Mechanics, 1998, 15 (04): : 116 - 121
  • [47] Nonsymmetric entropy and maximum nonsymmetric entropy principle
    Liu, Cheng-shi
    CHAOS SOLITONS & FRACTALS, 2009, 40 (05) : 2469 - 2474
  • [48] Remarks on the Maximum Entropy Principle with Application to the Maximum Entropy Theory of Ecology
    Favretti, Marco
    ENTROPY, 2018, 20 (01):
  • [49] Weighted Kernel Deterministic Annealing: A Maximum-Entropy Principle Approach for Shape Clustering
    Baranwal, Mayank
    Salapaka, Srinivasa M.
    2018 INDIAN CONTROL CONFERENCE (ICC), 2018, : 1 - 6
  • [50] The Method Used for Process Precision Prediction of Micro-assembly Based on Principle of Maximum Entropy
    Zhang, Xiaofeng
    Zhang, Zhijing
    Sun, Yuan
    Ye, Xin
    Ye, Zhipeng
    MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 409 - 413