Particle swarm optimizer for variable weighting in clustering high-dimensional data

被引:0
|
作者
Yanping Lu
Shengrui Wang
Shaozi Li
Changle Zhou
机构
[1] University of Sherbrooke,Department of Computer Science
[2] Xiamen University,Department of Cognitive Science
来源
Machine Learning | 2011年 / 82卷
关键词
High-dimensional data; Projected clustering; Variable weighting; Particle swarm optimization; Text clustering;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present a particle swarm optimizer (PSO) to solve the variable weighting problem in projected clustering of high-dimensional data. Many subspace clustering algorithms fail to yield good cluster quality because they do not employ an efficient search strategy. In this paper, we are interested in soft projected clustering. We design a suitable k-means objective weighting function, in which a change of variable weights is exponentially reflected. We also transform the original constrained variable weighting problem into a problem with bound constraints, using a normalized representation of variable weights, and we utilize a particle swarm optimizer to minimize the objective function in order to search for global optima to the variable weighting problem in clustering. Our experimental results on both synthetic and real data show that the proposed algorithm greatly improves cluster quality. In addition, the results of the new algorithm are much less dependent on the initial cluster centroids. In an application to text clustering, we show that the algorithm can be easily adapted to other similarity measures, such as the extended Jaccard coefficient for text data, and can be very effective.
引用
收藏
页码:43 / 70
页数:27
相关论文
共 50 条
  • [31] An Adaptive Stochastic Dominant Learning Swarm Optimizer for High-Dimensional Optimization
    Yang, Qiang
    Chen, Wei-Neng
    Gu, Tianlong
    Jin, Hu
    Mao, Wentao
    Zhang, Jun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1960 - 1976
  • [32] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Shenkai Gu
    Ran Cheng
    Yaochu Jin
    [J]. Soft Computing, 2018, 22 : 811 - 822
  • [33] A random elite ensemble learning swarm optimizer for high-dimensional optimization
    Qiang Yang
    Gong-Wei Song
    Xu-Dong Gao
    Zhen-Yu Lu
    Sang-Woon Jeon
    Jun Zhang
    [J]. Complex & Intelligent Systems, 2023, 9 : 5467 - 5500
  • [34] Clustering of High-Dimensional and Correlated Data
    McLachlan, Geoffrey J.
    Ng, Shu-Kay
    Wang, K.
    [J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 3 - 11
  • [35] Clustering in high-dimensional data spaces
    Murtagh, FD
    [J]. STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 279 - 292
  • [36] A Fast Hybrid Feature Selection Based on Correlation-Guided Clustering and Particle Swarm Optimization for High-Dimensional Data
    Song, Xian-Fang
    Zhang, Yong
    Gong, Dun-Wei
    Gao, Xiao-Zhi
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9573 - 9586
  • [37] Compressive Clustering of High-dimensional Data
    Ruta, Andrzej
    Porikli, Fatih
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 380 - 385
  • [38] Extended particle swarm optimization for feature selection of high-dimensional biomedical data
    Al-Shammary, Dhiah
    Albukhnefis, Adil L.
    Alsaeedi, Ali Hakem
    Al-Asfoor, Muntasir
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (10):
  • [39] A Clustering-Guided Integer Brain Storm Optimizer for Feature Selection in High-Dimensional Data
    Jia Yun-Tao
    Zhang Wan-Qiu
    He Chun-Lin
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021
  • [40] Feature Weighting for Clustering by Particle Swarm Optimization
    Swetha, K. P.
    Devi, V. Susheela
    [J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 441 - 444