Adaptive Explicit Kernel Minkowski Weighted K-means

被引:15
|
作者
Aradnia, Amir [1 ]
Haeri, Maryam Amir [2 ]
Ebadzadeh, Mohammad Mehdi [1 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn, Tehran, Iran
[2] Univ Twente, Learning Data Analyt & Technol Dept, Enschede, Netherlands
关键词
Kernel clustering; Minkowski metric; Features map; K-means; ALGORITHM; SPACES;
D O I
10.1016/j.ins.2021.10.048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The K-means algorithm is among the most commonly used data clustering methods. However, the regular K-means can only be applied in the input space, and it is applicable when clusters are linearly separable. The kernel K-means, which extends K-means into the kernel space, is able to capture nonlinear structures and identify arbitrarily shaped clusters. However, kernel methods often operate on the kernel matrix of the data, which scale poorly with the size of the matrix, or suffer from the high clustering cost due to the repetitive calculations of kernel values. Another issue is that algorithms access the data only through evaluation of K (x(i), x(j)), which limits many processes that can be done on data through the clustering task. This paper proposes a method to combine the advantages of the linear and nonlinear approaches by using derived corresponding approximate finite-dimensional feature maps based on spectral analysis. Applying approximate finite-dimensional feature maps have been discussed before only in the context of Support Vector Machines (SVM) problems. We suggest using this method in the kernel K-means context as it does not require storing a huge kernel matrix in memory, calculates cluster centers more efficiently, and accesses the data explicitly in the feature space; thus taking advantage of K-means extensions in that space. We demonstrate that our Explicit Kernel Minkowski Weighted K-means (Explicit KMWK-means) method is able to achieve high accuracy in terms of cluster recovery in the new space by applying additional Minkowski exponent and feature weights. The proposed method is evaluated by four benchmark data sets, and its performance is compared with the commonly used kernel clustering approaches. Experiments show the proposed method consistently achieves superior clustering performances while reducing the memory consumption. (C) 2021 The Authors. Published by Elsevier Inc.
引用
收藏
页码:503 / 518
页数:16
相关论文
共 50 条
  • [41] Entropy Weighted Power k-Means Clustering
    Chakraborty, Saptarshi
    Paul, Debolina
    Das, Swagatam
    Xu, Jason
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 691 - 700
  • [42] Weighted adjacent matrix for K-means clustering
    Zhou, Jukai
    Liu, Tong
    Zhu, Jingting
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33415 - 33434
  • [43] Adaptive K-Means clustering algorithm
    Chen, Hailin
    Wu, Xiuqing
    Hu, Junhua
    [J]. MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
  • [44] Weighted adjacent matrix for K-means clustering
    Jukai Zhou
    Tong Liu
    Jingting Zhu
    [J]. Multimedia Tools and Applications, 2019, 78 : 33415 - 33434
  • [45] Adaptive Sampling for k-Means Clustering
    Aggarwal, Ankit
    Deshpande, Amit
    Kannan, Ravi
    [J]. APPROXIMATION, RANDOMIZATION, AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2009, 5687 : 15 - +
  • [46] K-TRACE:: A kernel k-means procedure for classification
    Cifarelli, C.
    Nieddu, L.
    Seref, O.
    Pardalos, P. M.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2007, 34 (10) : 3154 - 3161
  • [47] A Combination of Particle Swarm Optimization and Minkowski Weighted K-Means Clustering: Application in Lateralization of Temporal Lobe Epilepsy
    Jamali-Dinan, Samira-Sadat
    Soltanian-Zadeh, Hamid
    Bowyer, Susan M.
    Almohri, Haidar
    Dehghani, Hamed
    Elisevich, Kost
    Nazem-Zadeh, Mohammad-Reza
    [J]. BRAIN TOPOGRAPHY, 2020, 33 (04) : 519 - 532
  • [48] A Combination of Particle Swarm Optimization and Minkowski Weighted K-Means Clustering: Application in Lateralization of Temporal Lobe Epilepsy
    Samira-Sadat Jamali-Dinan
    Hamid Soltanian-Zadeh
    Susan M. Bowyer
    Haidar Almohri
    Hamed Dehghani
    Kost Elisevich
    Mohammad-Reza Nazem-Zadeh
    [J]. Brain Topography, 2020, 33 : 519 - 532
  • [49] Under-sampling Algorithm with Weighted Distance Based on Adaptive K-Means Clustering
    Qian, Zhou
    Zhen, Yao
    Bo, Sun
    [J]. Data Analysis and Knowledge Discovery, 2022, 6 (05) : 127 - 136
  • [50] Regularized Simple Multiple Kernel k-Means With Kernel Average Alignment
    Li, Miaomiao
    Zhang, Yi
    Ma, Chuan
    Liu, Suyuan
    Liu, Zhe
    Yin, Jianping
    Liu, Xinwang
    Liao, Qing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10