Unsupervised locally embedded clustering for automatic high-dimensional data labeling

被引:0
|
作者
Fu, Yun [1 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
LEA; LEC; manifold; high-dimensional data clustering; dimensionality reduction;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In most machine learning and pattern recognition problems, the large number of high-dimensional sensory data, such as images and videos, are often labeled manually for training classifiers and modeling features, which is time-consuming and tedious. To automatically execute this process by machine, we present the unsupervised high-dimensional data clustering and automatic labeling algorithms, called Locally Embedded Clustering (LEC): (i) Constructing the neighborhood weighted graph with an appropriate distance metric; (ii) Tuning the regularization parameter to smooth the approximated manifold; (iii) Calculating the unified projection in a closed-form solution for the embedding and dimensionality reduction; (iv) Choosing the top or bottom coordinates of the embedded low-dimensional space for data representation; (v) Normalizing the low-dimensional representation to have unit length; (vi) Clustering and labeling the data via K-means. Experimental results demonstrate that LEC provides better data representation, more efficient dimensionality reduction and better clustering performance than many existing methods.
引用
收藏
页码:1057 / +
页数:2
相关论文
共 50 条
  • [1] Unsupervised Deep Embedded Clustering for High-Dimensional Visual Features of Fashion Images
    Malhi, Umar Subhan
    Zhou, Junfeng
    Yan, Cairong
    Rasool, Abdur
    Siddeeq, Shahbaz
    Du, Ming
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [2] Discriminative Embedded Clustering: A Framework for Grouping High-Dimensional Data
    Hou, Chenping
    Nie, Feiping
    Yi, Dongyun
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (06) : 1287 - 1299
  • [3] Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data
    Mwangi, Benson
    Soares, Jair C.
    Hasan, Khader M.
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2014, 236 : 19 - 25
  • [4] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [5] Clustering High-Dimensional Data
    Masulli, Francesco
    Rovetta, Stefano
    [J]. CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 1 - 13
  • [6] Clustering of High-Dimensional and Correlated Data
    McLachlan, Geoffrey J.
    Ng, Shu-Kay
    Wang, K.
    [J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 3 - 11
  • [7] Clustering in high-dimensional data spaces
    Murtagh, FD
    [J]. STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 279 - 292
  • [8] Compressive Clustering of High-dimensional Data
    Ruta, Andrzej
    Porikli, Fatih
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 380 - 385
  • [9] Locally fitting hyperplanes to high-dimensional data
    M. Hou
    C. Kambhampati
    [J]. Neural Computing and Applications, 2022, 34 : 8885 - 8896
  • [10] Locally fitting hyperplanes to high-dimensional data
    Hou, M.
    Kambhampati, C.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 8885 - 8896