TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data

被引:143
|
作者
Chen, Xiaojun [1 ,2 ]
Xu, Xiaofei [3 ]
Huang, Joshua Zhexue [2 ,4 ]
Ye, Yunming [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, C202,HIT Campus Xili Univ Town, Shenzhen 518055, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software, Shenzhen 518060, Peoples R China
[3] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
关键词
Data mining; clustering; multiview learning; k-means; variable weighting; SELECTION; OBJECTS;
D O I
10.1109/TKDE.2011.262
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes TW-k-means, an automated two-level variable weighting clustering algorithm for multiview data, which can simultaneously compute weights for views and individual variables. In this algorithm, a view weight is assigned to each view to identify the compactness of the view and a variable weight is also assigned to each variable in the view to identify the importance of the variable. Both view weights and variable weights are used in the distance function to determine the clusters of objects. In the new algorithm, two additional steps are added to the iterative k-means clustering process to automatically compute the view weights and the variable weights. We used two real-life data sets to investigate the properties of two types of weights in TW-k-means and investigated the difference between the weights of TW-k-means and the weights of the individual variable weighting method. The experiments have revealed the convergence property of the view weights in TW-k-means. We compared TW-k-means with five clustering algorithms on three real-life data sets and the results have shown that the TW-k-means algorithm significantly outperformed the other five clustering algorithms in four evaluation indices.
引用
收藏
页码:932 / 944
页数:13
相关论文
共 50 条
  • [1] Automated variable weighting in k-means type clustering
    Huang, JZX
    Ng, MK
    Rong, HQ
    Li, ZC
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (05) : 657 - 668
  • [2] TW-Co-k-means: Two-level weighted collaborative k-means for multi-view clustering
    Zhang, Guang-Yu
    Wang, Chang-Dong
    Huang, Dong
    Zheng, Wei-Shi
    Zhou, Yu-Ren
    KNOWLEDGE-BASED SYSTEMS, 2018, 150 : 127 - 138
  • [3] An iterative algorithm for optimal variable weighting in K-means clustering
    Zhang, Shaonan
    Li, Shanshan
    Hu, Jiaqiao
    Xing, Haipeng
    Zhu, Wei
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (05) : 1346 - 1365
  • [4] Spectral Clustering of Customer Transaction Data With a Two-Level Subspace Weighting Method
    Chen, Xiaojun
    Sun, Wenya
    Wang, Bo
    Li, Zhihui
    Wang, Xizhao
    Ye, Yunming
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (09) : 3230 - 3241
  • [5] Two-level k-means clustering algorithm for k-τ relationship establishment and linear-time classification
    Chitta, Radha
    Murty, M. Narasimha
    PATTERN RECOGNITION, 2010, 43 (03) : 796 - 804
  • [6] Automated Attribute Weighting Fuzzy k-Centers Algorithm for Categorical Data Clustering
    Mau, Toan Nguyen
    Huynh, Van-Nam
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2021), 2021, 12898 : 205 - 217
  • [7] A Heuristically Weighting K-Means Algorithm for Subspace Clustering
    Li, Boyang
    Jiang, Qingshan
    Chen, Lifei
    2008 2ND INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION, 2008, : 268 - +
  • [8] Co-clustering algorithms for distributional data with automated variable weighting
    De Carvalho, Francisco de A. T.
    Balzanella, Antonio
    Irpino, Antonio
    Verde, Rosanna
    INFORMATION SCIENCES, 2021, 549 : 87 - 115
  • [9] Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters
    Khan, Imran
    Luo, Zongwei
    Huang, Joshua Zhexue
    Shahzad, Waseem
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (09) : 1838 - 1853
  • [10] A PRELIMINARY-STUDY OF OPTIMAL VARIABLE WEIGHTING IN K-MEANS CLUSTERING
    GREEN, PE
    CARMONE, FJ
    KIM, J
    JOURNAL OF CLASSIFICATION, 1990, 7 (02) : 271 - 285