Two-way clustering for contingency tables: Maximizing a dependence measure

被引:0
|
作者
Bock, HH [1 ]
机构
[1] Univ Aachen, Inst Stat, D-52056 Aachen, Germany
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider the simultaneous clustering of the rows and columns of a contingency table such that the dependence between row clusters and column clusters is maximized in the sense of maximizing a general dependence measure. We use Csiszar's phi-divergence between the given two-way distribution and the independence case with the same margins. This includes the classical chi(2) measure, Kullback-Leibler's discriminating information, and variation distance. By using the general theory of 'convexity-based clustering criteria' (Bock 1992, 2002a, 2002b) we derive a k-means-like clustering algorithm that uses 'maximum support-plane partitions' (in terms of likelihood ratio vectors) in the same way as classical SSQ clustering uses 'minimum-distance partitions'.
引用
收藏
页码:143 / 154
页数:12
相关论文
共 50 条