Clustering in applications with multiple data sources-A mutual subspace clustering approach

被引:6
|
作者
Hua, Ming [2 ]
Pei, Jian [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[2] Facebook Inc, Palo Alto, CA USA
基金
加拿大自然科学与工程研究理事会;
关键词
Clustering; Multiple sources;
D O I
10.1016/j.neucom.2011.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many applications, such as bioinformatics and cross-market customer relationship management, there are data from multiple sources jointly describing the same set of objects. An important data mining task is to find interesting groups of objects that form clusters in subspaces of the data sources jointly supported by those data sources. In this paper, we study a novel problem of mining mutual subspace clusters from multiple sources. We develop two interesting models and the corresponding methods for mutual subspace clustering. The density-based model identifies dense regions in subspaces as clusters. The bottom-up method searches for density-based mutual subspace clusters systematically from low-dimensional subspaces to high-dimensional ones. The partitioning model divides points in a data set into k exclusive clusters and a signature subspace is found for each cluster, where k is the number of clusters desired by a user. The top-down method interleaves the well-known k-means clustering procedures in multiple sources. We use experimental results on synthetic data sets and real data sets to report the effectiveness and the efficiency of the methods. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [21] A Novel Scalable Signature Based Subspace Clustering Approach for Big Data
    Gayathri, T.
    Bhaskari, D. Lalitha
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2019, 14 (02) : 41 - 51
  • [22] Extending Data Reliability Measure to a Filter Approach for Soft Subspace Clustering
    Boongoen, Tossapon
    Shang, Changjing
    Iam-On, Natthakan
    Shen, Qiang
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (06): : 1705 - 1714
  • [23] Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Zimek, Arthur
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (01)
  • [24] Deep Mutual Information Subspace Clustering Network for Hyperspectral Images
    Li, Tiancong
    Cai, Yaoming
    Zhang, Yongshan
    Cai, Zhihua
    Liu, Xiaobo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [25] A mutual information based online evolving clustering approach and its applications
    Tseng F.
    Filev D.
    Chinnam R.B.
    Evolving Systems, 2017, 8 (3) : 179 - 191
  • [26] Sparse Subspace Clustering: Algorithm, Theory, and Applications
    Elhamifar, Ehsan
    Vidal, Rene
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2765 - 2781
  • [27] Multiple Kernel Clustering With Compressed Subspace Alignment
    Zhou, Sihang
    Ou, Qiyuan
    Liu, Xinwang
    Wang, Siqi
    Liu, Luyan
    Wang, Siwei
    Zhu, En
    Yin, Jianping
    Xu, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 252 - 263
  • [28] Multiple Kernel Subspace Learning for Clustering and Classification
    Chi, Ziqiu
    Wang, Zhe
    Wang, Bolu
    Fang, Zhongli
    Zhu, Zonghai
    Li, Dongdong
    Du, Wenli
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7278 - 7290
  • [29] A Method with Adaptive Graphs to Constrain Multi-View Subspace Clustering of Geospatial Big Data from Multiple Sources
    Liu, Qiliang
    Huan, Weihua
    Deng, Min
    REMOTE SENSING, 2022, 14 (17)
  • [30] Subspace clustering and multiple matrix rank minimization approach to image inpainting algorithm
    Takahashi, Tomohiro
    Konishi, Katsumi
    Uruma, Kazunori
    Furukawa, Toshihiro
    2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 1052 - 1055