Directional co-clustering

被引:0
|
作者
Aghiles Salah
Mohamed Nadif
机构
[1] SIS,
[2] Singapore Management University,undefined
[3] LIPADE,undefined
[4] Paris Descartes University,undefined
关键词
Co-clustering; Directional data; von Mises-Fisher distribution; EM algorithm; Document clustering; Main 62H30; Secondary 62H11;
D O I
暂无
中图分类号
学科分类号
摘要
Co-clustering addresses the problem of simultaneous clustering of both dimensions of a data matrix. When dealing with high dimensional sparse data, co-clustering turns out to be more beneficial than one-sided clustering even if one is interested in clustering along one dimension only. Aside from being high dimensional and sparse, some datasets, such as document-term matrices, exhibit directional characteristics, and the L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} normalization of such data, so that it lies on the surface of a unit hypersphere, is useful. Popular co-clustering assumptions such as Gaussian or Multinomial are inadequate for this type of data. In this paper, we extend the scope of co-clustering to directional data. We present Diagonal Block Mixture of Von Mises–Fisher distributions (dbmovMFs), a co-clustering model which is well suited for directional data lying on a unit hypersphere. By setting the estimate of the model parameters under the maximum likelihood (ML) and classification ML approaches, we develop a class of EM algorithms for estimating dbmovMFs from data. Extensive experiments, on several real-world datasets, confirm the advantage of our approach and demonstrate the effectiveness of our algorithms.
引用
收藏
页码:591 / 620
页数:29
相关论文
共 50 条
  • [1] Directional co-clustering
    Salah, Aghiles
    Nadif, Mohamed
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 591 - 620
  • [2] Regularized bi-directional co-clustering
    Affeldt, Severine
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. STATISTICS AND COMPUTING, 2021, 31 (03)
  • [3] Regularized bi-directional co-clustering
    Séverine Affeldt
    Lazhar Labiod
    Mohamed Nadif
    [J]. Statistics and Computing, 2021, 31
  • [4] Co-clustering directed graphs to discover asymmetries and directional communities
    Rohe, Karl
    Qin, Tai
    Yu, Bin
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (45) : 12679 - 12684
  • [5] Joint co-clustering: Co-clustering of genomic and clinical bioimaging data
    Ficarra, Elisa
    De Micheli, Giovanni
    Yoon, Sungroh
    Benini, Luca
    Macii, Enrico
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 938 - 949
  • [6] Bayesian Co-clustering
    Shan, Hanhuai
    Banerjee, Arindam
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 530 - 539
  • [7] Co-Clustering on Manifolds
    Gu, Quanquan
    Zhou, Jie
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 359 - 367
  • [8] Bayesian co-clustering
    Domeniconi, Carlotta
    Laskey, Kathryn
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2015, 7 (05): : 347 - 356
  • [9] Spectral co-clustering ensemble
    Huang, Shudong
    Wang, Hongjun
    Li, Dingcheng
    Yang, Yan
    Li, Tianrui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 84 : 46 - 55
  • [10] Evolutionary Spectral Co-Clustering
    Green, Nathan
    Rege, Manjeet
    Liu, Xumin
    Bailey, Reynold
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1074 - 1081