Directional co-clustering

被引:0
|
作者
Aghiles Salah
Mohamed Nadif
机构
[1] SIS,
[2] Singapore Management University,undefined
[3] LIPADE,undefined
[4] Paris Descartes University,undefined
关键词
Co-clustering; Directional data; von Mises-Fisher distribution; EM algorithm; Document clustering; Main 62H30; Secondary 62H11;
D O I
暂无
中图分类号
学科分类号
摘要
Co-clustering addresses the problem of simultaneous clustering of both dimensions of a data matrix. When dealing with high dimensional sparse data, co-clustering turns out to be more beneficial than one-sided clustering even if one is interested in clustering along one dimension only. Aside from being high dimensional and sparse, some datasets, such as document-term matrices, exhibit directional characteristics, and the L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} normalization of such data, so that it lies on the surface of a unit hypersphere, is useful. Popular co-clustering assumptions such as Gaussian or Multinomial are inadequate for this type of data. In this paper, we extend the scope of co-clustering to directional data. We present Diagonal Block Mixture of Von Mises–Fisher distributions (dbmovMFs), a co-clustering model which is well suited for directional data lying on a unit hypersphere. By setting the estimate of the model parameters under the maximum likelihood (ML) and classification ML approaches, we develop a class of EM algorithms for estimating dbmovMFs from data. Extensive experiments, on several real-world datasets, confirm the advantage of our approach and demonstrate the effectiveness of our algorithms.
引用
收藏
页码:591 / 620
页数:29
相关论文
共 50 条
  • [41] Sleeved co-clustering of lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2012, 31 : 251 - 279
  • [42] HCC: A Hierarchical Co-Clustering Algorithm
    Li, Jingxuan
    Li, Tao
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 861 - 862
  • [43] Joint cluster based co-clustering for clustering ensembles
    Hu, Tianming
    Liu, Liping
    Qu, Chao
    Sung, Sam Yuan
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 284 - 295
  • [44] Co-clustering for Weblogs in Semantic Space
    Zong, Yu
    Xu, Guandong
    Dolog, Peter
    Zhang, Yanchun
    Liu, Renjin
    [J]. WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 120 - +
  • [45] Co-clustering of fuzzy lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2015, 44 : 217 - 252
  • [46] Provable Convex Co-clustering of Tensors
    Chi, Eric C.
    Gaines, Brian R.
    Sun, Will Wei
    Zhou, Hua
    Yang, Jian
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [47] Coverage Constrained Spatial CO-clustering
    Ohriniuc, Roxana
    Reich, Aaron
    Yang, KwangSoo
    [J]. 26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 492 - 495
  • [48] Co-Clustering Enterprise Social Networks
    Hu, Ruiqi
    Pan, Shirui
    Long, Guodong
    Zhu, Xingquan
    Jiang, Jing
    Zhang, Chengqi
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 107 - 114
  • [49] A Spectral algorithm for Topographical Co-clustering
    Nicoleta, Rogovschi
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [50] Robust fuzzy co-clustering algorithm
    Tjhi, William-Chandra
    Chen, Lihui
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1591 - 1595