Clustering in applications with multiple data sources-A mutual subspace clustering approach

被引:6
|
作者
Hua, Ming [2 ]
Pei, Jian [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[2] Facebook Inc, Palo Alto, CA USA
基金
加拿大自然科学与工程研究理事会;
关键词
Clustering; Multiple sources;
D O I
10.1016/j.neucom.2011.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many applications, such as bioinformatics and cross-market customer relationship management, there are data from multiple sources jointly describing the same set of objects. An important data mining task is to find interesting groups of objects that form clusters in subspaces of the data sources jointly supported by those data sources. In this paper, we study a novel problem of mining mutual subspace clusters from multiple sources. We develop two interesting models and the corresponding methods for mutual subspace clustering. The density-based model identifies dense regions in subspaces as clusters. The bottom-up method searches for density-based mutual subspace clusters systematically from low-dimensional subspaces to high-dimensional ones. The partitioning model divides points in a data set into k exclusive clusters and a signature subspace is found for each cluster, where k is the number of clusters desired by a user. The top-down method interleaves the well-known k-means clustering procedures in multiple sources. We use experimental results on synthetic data sets and real data sets to report the effectiveness and the efficiency of the methods. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [41] Semi supervised approach towards subspace clustering
    Harikumar, Sandhya
    Akhil, A. S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1619 - 1629
  • [42] An approach for training subspace distribution clustering HMM
    Wei, Q
    Gang, W
    International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 1450 - 1453
  • [43] Subspace Discovery for Promotion: A Cell Clustering Approach
    Wu, Tianyi
    Han, Jiawei
    DISCOVERY SCIENCE, PROCEEDINGS, 2009, 5808 : 362 - 376
  • [44] Mining Differential Dependencies: A Subspace Clustering Approach
    Kwashie, Selasi
    Liu, Jixue
    Li, Jiuyong
    Ye, Feiyue
    DATABASES THEORY AND APPLICATIONS, ADC 2014, 2014, 8506 : 50 - 61
  • [45] Innovation Pursuit: A New Approach to Subspace Clustering
    Rahmani, Mostafa
    Atia, George K.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (23) : 6276 - 6291
  • [46] Data Recovery Technology Based on Subspace Clustering
    Sun, Li
    Song, Bing
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [47] Subspace clustering of high dimensional data streams
    Wang, Shuyun
    Fan, Yingjie
    Zhang, Chenghong
    Xu, HeXiang
    Hao, Xiulan
    Hu, Yunfa
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 165 - +
  • [48] Deep Successive Subspace Learning for Data Clustering
    Sadeghi, Mohammadreza
    Armanfard, Narges
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [49] Analysis of Recipe Data Based on Subspace Clustering
    Liu, Shan-Zhong
    Song, Xiao-Na
    Wang, Xin-Yong
    FUZZY SYSTEM AND DATA MINING, 2016, 281 : 323 - 330
  • [50] Efficient incremental subspace clustering in data streams
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 53 - 60