Sample-weighted clustering methods

被引:21
|
作者
Yu, Jian [2 ]
Yang, Miin-Shen [1 ]
Lee, E. Stanley [3 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Chungli 32023, Taiwan
[2] Beijing Jiaotong Univ, Dept Comp Sci, Beijing 100044, Peoples R China
[3] Kansas State Univ, Dept Ind & Mfg Syst Engn, Manhattan, KS 66506 USA
关键词
Cluster analysis; Maximum entropy principle; k-means; Fuzzy c-means; Sample weights; Robustness; FUZZY C-MEANS; CONVERGENCE PROPERTIES; MEAN SHIFT; ALGORITHM; QUANTIZATION; SELECTION;
D O I
10.1016/j.camwa.2011.07.005
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Although there have been many researches on cluster analysis considering feature (or variable) weights, little effort has been made regarding sample weights in clustering. In practice, not every sample in a data set has the same importance in cluster analysis. Therefore, it is interesting to obtain the proper sample weights for clustering a data set. In this paper, we consider a probability distribution over a data set to represent its sample weights. We then apply the maximum entropy principle to automatically compute these sample weights for clustering. Such method can generate the sample-weighted versions of most clustering algorithms, such as k-means, fuzzy c-means (FCM) and expectation & maximization (EM), etc. The proposed sample-weighted clustering algorithms will be robust for data sets with noise and outliers. Furthermore, we also analyze the convergence properties of the proposed algorithms. This study also uses some numerical data and real data sets for demonstration and comparison. Experimental results and comparisons actually demonstrate that the proposed sample-weighted clustering algorithms are effective and robust clustering methods. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2200 / 2208
页数:9
相关论文
共 50 条
  • [1] Sample-Weighted Multi-View Clustering
    Hong, Min
    Jia, Caiyan
    Li, Yafang
    Yu, Jian
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (08): : 1677 - 1685
  • [2] A kernel-based and sample-weighted fuzzy clustering algorithm
    Xia, Shixiong
    Liu, Qiang
    Zhou, Yong
    Liu, Bing
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 113 - 116
  • [3] Local Sample-Weighted Multiple Kernel Clustering With Consensus Discriminative Graph
    Li, Liang
    Wang, Siwei
    Liu, Xinwang
    Zhu, En
    Shen, Li
    Li, Kenli
    Li, Keqin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1721 - 1734
  • [4] Optimized sample-weighted partial least squares
    Xu, Lu
    Jiang, Jian-Hui
    Lin, Wei-Qi
    Zhou, Yan-Ping
    Wu, Hai-Long
    Shen, Guo-Li
    Yu, Ru-Qin
    [J]. TALANTA, 2007, 71 (02) : 561 - 566
  • [5] A general sample-weighted framework for epileptic seizure prediction
    Gao, Yikai
    Liu, Aiping
    Cui, Xinrui
    Qian, Ruobing
    Chen, Xun
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [6] Decremental Learning based on Sample-Weighted Support Vector Regression
    Qing, Li
    Ling, Wang
    Zheng, Zhang De
    Cun, Zhang Wei
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 1322 - 1325
  • [7] Comparing the sample-weighted and unweighted meta-analysis: An applied perspective
    Fuller, JB
    Hester, K
    [J]. JOURNAL OF MANAGEMENT, 1999, 25 (06) : 803 - 828
  • [8] Sample-Weighted RDPCM for lossless image compression with HEVC intra- prediction
    Akman, Ali
    Cekli, Serap
    [J]. PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2023, 29 (03): : 230 - 237
  • [9] Knowledge transfer accelerated turbine blade optimization via an sample-weighted variational autoencoder
    Guo, Zhendong
    Li, Cunxi
    Chen, Yun
    Song, Liming
    Li, Jun
    Feng, Zhenping
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 147
  • [10] Protein-protein interaction sites prediction by ensembling SVM and sample-weighted random forests
    Wei, Zhi-Sen
    Han, Ke
    Yang, Jing-Yu
    Shen, Hong-Bin
    Yu, Dong-Jun
    [J]. NEUROCOMPUTING, 2016, 193 : 201 - 212