Clustering Large Graphs via the Singular Value Decomposition

被引:1
|
作者
P. Drineas
A. Frieze
R. Kannan
S. Vempala
V. Vinay
机构
[1] Rensselaer Polytechnic Institute,Computer Science Department
[2] Carnegie Mellon University,Department of Mathematical Sciences
[3] Yale University,Computer Science Department
[4] M.I.T.,Department of Mathematics
[5] Indian Institute of Science,undefined
来源
Machine Learning | 2004年 / 56卷
关键词
Singular Value Decomposition; randomized algorithms; -means clustering;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the problem of partitioning a set of m points in the n-dimensional Euclidean space into k clusters (usually m and n are variable, while k is fixed), so as to minimize the sum of squared distances between each point and its cluster center. This formulation is usually the objective of the k-means clustering algorithm (Kanungo et al. (2000)). We prove that this problem in NP-hard even for k = 2, and we consider a continuous relaxation of this discrete problem: find the k-dimensional subspace V that minimizes the sum of squared distances to V of the m points. This relaxation can be solved by computing the Singular Value Decomposition (SVD) of the m × n matrix A that represents the m points; this solution can be used to get a 2-approximation algorithm for the original problem. We then argue that in fact the relaxation provides a generalized clustering which is useful in its own right.
引用
收藏
页码:9 / 33
页数:24
相关论文
共 50 条
  • [1] Clustering large graphs via the Singular Value Decomposition
    Drineas, P
    Frieze, A
    Kannan, R
    Vempala, S
    Vinay, V
    [J]. MACHINE LEARNING, 2004, 56 (1-3) : 9 - 33
  • [2] Community detection in graphs using singular value decomposition
    Sarkar, Somwrita
    Dong, Andy
    [J]. PHYSICAL REVIEW E, 2011, 83 (04)
  • [3] Community detection in graphs using singular value decomposition
    Sarkar, Somwrita
    Dong, Andy
    [J]. Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, 2011, 83 (04):
  • [4] Properties of the singular value decomposition for efficient data clustering
    Lee, S
    Hayes, MH
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (11) : 862 - 866
  • [5] System identification via singular value decomposition
    Wang, SH
    Lee, TF
    Zachery, R
    [J]. ELECTRONICS LETTERS, 1996, 32 (01) : 76 - 78
  • [6] Biclustering via Sparse Singular Value Decomposition
    Lee, Mihee
    Shen, Haipeng
    Huang, Jianhua Z.
    Marron, J. S.
    [J]. BIOMETRICS, 2010, 66 (04) : 1087 - 1095
  • [7] System identification via singular value decomposition
    Wang, SH
    Lee, TF
    Zachery, R
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2638 - 2641
  • [8] Solution for search result clustering by Using Singular Value Decomposition
    Snasel, Vaclav
    Abdulla, Hussam Dahwa
    [J]. 2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 647 - 651
  • [9] Thresholding and Clustering with Singular Value Decomposition for Alcoholic EEG Analysis
    Rajaguru, Harikumar
    Prabhakar, Sunil Kumar
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 615 - 623
  • [10] Using Singular Value Decomposition as a Solution for Search Result Clustering
    Abdulla, Hussam D.
    Abdelrahman, Abdella S.
    Snasel, Vaclav
    Aldosari, Hamoud
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INNOVATIONS IN BIO-INSPIRED COMPUTING AND APPLICATIONS (IBICA 2014), 2014, 303 : 53 - 61