TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data

被引:143
|
作者
Chen, Xiaojun [1 ,2 ]
Xu, Xiaofei [3 ]
Huang, Joshua Zhexue [2 ,4 ]
Ye, Yunming [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, C202,HIT Campus Xili Univ Town, Shenzhen 518055, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software, Shenzhen 518060, Peoples R China
[3] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
关键词
Data mining; clustering; multiview learning; k-means; variable weighting; SELECTION; OBJECTS;
D O I
10.1109/TKDE.2011.262
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes TW-k-means, an automated two-level variable weighting clustering algorithm for multiview data, which can simultaneously compute weights for views and individual variables. In this algorithm, a view weight is assigned to each view to identify the compactness of the view and a variable weight is also assigned to each variable in the view to identify the importance of the variable. Both view weights and variable weights are used in the distance function to determine the clusters of objects. In the new algorithm, two additional steps are added to the iterative k-means clustering process to automatically compute the view weights and the variable weights. We used two real-life data sets to investigate the properties of two types of weights in TW-k-means and investigated the difference between the weights of TW-k-means and the weights of the individual variable weighting method. The experiments have revealed the convergence property of the view weights in TW-k-means. We compared TW-k-means with five clustering algorithms on three real-life data sets and the results have shown that the TW-k-means algorithm significantly outperformed the other five clustering algorithms in four evaluation indices.
引用
收藏
页码:932 / 944
页数:13
相关论文
共 50 条
  • [21] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [22] Standardization and weighting of variables for the fuzzy K-means clustering of discontinuity data
    Hammah, RE
    Curran, JH
    PACIFIC ROCKS 2000: ROCK AROUND THE RIM, 2000, : 659 - 666
  • [23] A Two-level Moderated Latent Variable Model with Single Level Data
    Liu, Hongyun
    Yuan, Ke-Hai
    Liu, Fang
    MULTIVARIATE BEHAVIORAL RESEARCH, 2020, 55 (06) : 873 - 893
  • [24] Parallel weighting K-means clustering algorithm based on graphics processing unit
    Huang, Xiaohui
    Xiong, Liyan
    Wang, Juan
    Ye, Yunming
    Li, Chuan
    Journal of Information and Computational Science, 2015, 12 (18): : 7031 - 7040
  • [25] The SKM Algorithm: A K-Means Algorithm for Clustering Sequential Data
    Dias, Jose G.
    Cortinhal, Maria Joao
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 173 - 182
  • [26] Comparative analysis of clustering methodology and application for market segmentation: K-means, SOM and a two-level SOM
    Lee, Sang-Chul
    Gu, Ja-Chul
    Suh, Yung-Ho
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 435 - 444
  • [27] A Novel Sample Weighting K-Means Clustering Algorithm based on Angles Information
    Gu, Lei
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3697 - 3702
  • [28] PCA-Guided k-Means with Variable Weighting and Its Application to Document Clustering
    Honda, Katsuhiro
    Notsu, Akira
    Ichihashi, Hidetomo
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5861 : 282 - 292
  • [29] Two-Stage Clustering with k-Means Algorithm
    Salman, Raied
    Kecman, Vojislav
    Li, Qi
    Strack, Robert
    Test, Erick
    RECENT TRENDS IN WIRELESS AND MOBILE NETWORKS, 2011, 162 : 110 - 122
  • [30] Parallelization of K-Means Clustering Algorithm for Data Mining
    Jiang, Hao
    Yu, Liyan
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12