Mixed Membership Subspace Clustering

被引:4
|
作者
Guennemann, Stephan [1 ]
Faloutsos, Christos [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
HIGH-DIMENSIONAL DATA;
D O I
10.1109/ICDM.2013.109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is one of the fundamental data mining tasks. While traditional clustering techniques assign each object to a single cluster only, in many applications it has been observed that objects might belong to multiple clusters with different degrees. In this work, we present a Bayesian framework to tackle the challenge of mixed membership clustering for vector data. We exploit the ideas of subspace clustering where the relevance of dimensions might be different for each cluster. Combining the relevance of the dimensions with the cluster membership degree of the objects, we propose a novel type of mixture model able to represent data containing mixed membership subspace clusters. For learning our model, we develop an efficient algorithm based on variational inference allowing easy parallelization. In our empirical study on synthetic and real data we show the strengths of our novel clustering technique.
引用
收藏
页码:221 / 230
页数:10
相关论文
共 50 条
  • [1] Nonparametric Estimation of Probabilistic Membership for Subspace Clustering
    Lee, Jieun
    Lee, Hyeogjin
    Lee, Minsik
    Kwak, Nojun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) : 1023 - 1036
  • [2] A General and Scalable Approach to Mixed Membership Clustering
    Lin, Frank
    Cohen, William W.
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 429 - 438
  • [3] Bayesian mixed membership models for soft clustering and classification
    Erosheva, EA
    Fienberg, SE
    [J]. CLASSIFICATION - THE UBIQUITOUS CHALLENGE, 2005, : 11 - 26
  • [4] Mixed Membership Graph Clustering via Systematic Edge Query
    Ibrahim, Shahana
    Fu, Xiao
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 5189 - 5205
  • [5] Subspace clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Zimek, Arthur
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (04) : 351 - 364
  • [6] Subspace Clustering
    Vidal, Rene
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2011, 28 (02) : 52 - 68
  • [7] A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets
    Ahmad, Amir
    Dey, Lipika
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 1062 - 1069
  • [8] Exponential family mixed membership models for soft clustering of multivariate data
    White, Arthur
    Murphy, Thomas Brendan
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2016, 10 (04) : 521 - 540
  • [9] Exponential family mixed membership models for soft clustering of multivariate data
    Arthur White
    Thomas Brendan Murphy
    [J]. Advances in Data Analysis and Classification, 2016, 10 : 521 - 540
  • [10] A Dirichlet Model of Alignment Cost in Mixed-Membership Unsupervised Clustering
    Liu, Xiran
    Kopelman, Naama M.
    Rosenberg, Noah A.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1145 - 1159