共 50 条
Two-way clustering for contingency tables: Maximizing a dependence measure
被引:0
|作者:
Bock, HH
[1
]
机构:
[1] Univ Aachen, Inst Stat, D-52056 Aachen, Germany
来源:
关键词:
D O I:
暂无
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
We consider the simultaneous clustering of the rows and columns of a contingency table such that the dependence between row clusters and column clusters is maximized in the sense of maximizing a general dependence measure. We use Csiszar's phi-divergence between the given two-way distribution and the independence case with the same margins. This includes the classical chi(2) measure, Kullback-Leibler's discriminating information, and variation distance. By using the general theory of 'convexity-based clustering criteria' (Bock 1992, 2002a, 2002b) we derive a k-means-like clustering algorithm that uses 'maximum support-plane partitions' (in terms of likelihood ratio vectors) in the same way as classical SSQ clustering uses 'minimum-distance partitions'.
引用
收藏
页码:143 / 154
页数:12
相关论文