Multiple Independent Subspace Clusterings

被引:0
|
作者
Wang, Xing [1 ]
Wang, Jun [1 ]
Domeniconi, Carlotta [2 ]
Yu, Guoxian [1 ,3 ]
Xiao, Guoqiang [1 ]
Guo, Maozu [4 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[3] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan, Hubei, Peoples R China
[4] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple clustering aims at discovering diverse ways of organizing data into clusters. Despite the progress made, it's still a challenge for users to analyze and understand the distinctive structure of each output clustering. To ease this process, we consider diverse clusterings embedded in different subspaces, and analyze the embedding subspaces to shed light into the structure of each clustering. To this end, we provide a two-stage approach called MISC (Multiple Independent Subspace Clusterings). In the first stage, MISC uses independent subspace analysis to seek multiple and statistical independent (i.e. non-redundant) subspaces, and determines the number of subspaces via the minimum description length principle. In the second stage, to account for the intrinsic geometric structure of samples embedded in each subspace, MISC performs graph regularized semi-nonnegative matrix factorization to explore clusters. It additionally integrates the kernel trick into matrix factorization to handle non-linearly separable clusters. Experimental results on synthetic datasets show that MISC can find different interesting clusterings from the sought independent subspaces, and it also outperforms other related and competitive approaches on real-world datasets.
引用
收藏
页码:5353 / 5360
页数:8
相关论文
共 50 条
  • [1] Are clusterings of multiple data views independent?
    Gao, Lucy L.
    Bien, Jacob
    Witten, Daniela
    BIOSTATISTICS, 2020, 21 (04) : 692 - 708
  • [2] Comparing subspace clusterings
    Patrikainen, Anne
    Meila, Marina
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (07) : 902 - 916
  • [3] Finding multiple stable clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 991 - 1021
  • [4] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [5] Finding Multiple Stable Clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 171 - 180
  • [6] Finding multiple stable clusterings
    Juhua Hu
    Qi Qian
    Jian Pei
    Rong Jin
    Shenghuo Zhu
    Knowledge and Information Systems, 2017, 51 : 991 - 1021
  • [7] Multiple Co-Clusterings
    Wang, Xing
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Yu, Zhiwen
    Zhang, Zili
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1308 - 1313
  • [8] Combining multiple weak clusterings
    Topchy, A
    Jain, AK
    Punch, W
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 331 - 338
  • [9] CLICOM: Cliques for combining multiple clusterings
    Mimaroglu, Selim
    Yagci, Murat
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1889 - 1901
  • [10] A framework to uncover multiple alternative clusterings
    Dang, Xuan Hong
    Bailey, James
    MACHINE LEARNING, 2015, 98 (1-2) : 7 - 30