Stratification-based semi-supervised clustering algorithm for arbitrary shaped datasets

被引:3
|
作者
Wang, Fei [2 ]
Li, Le [1 ]
Liu, Zhiqiang [3 ]
机构
[1] Inner Mongolia Elect Informat Vocat Tech Coll, Hohhot 010000, Peoples R China
[2] Inner Mongolia Peoples Congress, Hohhot 010000, Peoples R China
[3] Inner Mongolia Univ Technol, Hohhot 010000, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised clustering; Kmeans; Seeded-Kmeans; Partitional clustering; Influence space; K-MEANS; SEARCH;
D O I
10.1016/j.ins.2023.119004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering is not only an important branch of semi-supervised learning but also an improvement direction for clustering. Semi-supervised clustering algorithms designed based on Kmeans, such as the classical Seeded-Kmeans and Constrained-Kmeans, where supervision information is used to guide clustering iterations, have the same disadvantages as the original Kmeans algorithm: they are confined to the assumption of isotropic spherical clusters, leading to the narrow adaptability in handling data of various characteristics. To solve the problem, we propose the scattered centroids initialization clustering algorithm based on Stratification (SCICS). First, based on the concept of influence space, a method for modeling the cluster -level location of any object is presented, according to which we can obtain well-defined cluster decision boundaries through stratification. On this basis, by extending the seed thought, we propose a semi-supervised subclustering algorithm that can break through the limitations of partitional clustering methods that rely on strict assumptions on particular cluster distributions. Experiments on artificial and real-world datasets show that the proposed algorithm gains the ability of clustering arbitrary shaped data and surpasses the competitors in terms of performance and adaptability.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Semi-Supervised Clustering Fingerprint Positioning Algorithm Based on Distance Constraints
    Ying Xia
    Zhongzhao Zhang
    Lin Ma
    Yao Wang
    Journal of Harbin Institute of Technology(New series), 2015, (06) : 55 - 61
  • [22] A Semi-supervised Clustering Algorithm Based on Must-Link Set
    Huang, Haichao
    Cheng, Yong
    Zhao, Ruilian
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 492 - 499
  • [23] An improved semi-supervised clustering algorithm based on initial center points
    Xia, Z. (xiazg@cumt.edu.cn), 1600, Advanced Institute of Convergence Information Technology (07):
  • [24] K-means clustering algorithm based on semi-supervised learning
    Department of Mathematics and Computer, Shangrao Normal College, Shangrao 334001, China
    不详
    J. Comput. Inf. Syst., 2008, 5 (2007-2013):
  • [25] Regularized semi-supervised KLFDA algorithm based on density peak clustering
    Xinmin Tao
    Yixuan Bao
    Xiaohan Zhang
    Tian Liang
    Lin Qi
    Zhiting Fan
    Shan Huang
    Neural Computing and Applications, 2022, 34 : 19791 - 19817
  • [26] Semi-supervised Affinity Propagation Clustering Algorithm Based On Kernel Function
    Zhao Xiaoqiang
    Xie Yaping
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 3275 - 3279
  • [27] Regularized semi-supervised KLFDA algorithm based on density peak clustering
    Tao, Xinmin
    Bao, Yixuan
    Zhang, Xiaohan
    Liang, Tian
    Qi, Lin
    Fan, Zhiting
    Huang, Shan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19791 - 19817
  • [28] Research of semi-supervised spectral clustering algorithm based on pairwise constraints
    Shifei Ding
    Hongjie Jia
    Liwen Zhang
    Fengxiang Jin
    Neural Computing and Applications, 2014, 24 : 211 - 219
  • [29] Research of semi-supervised spectral clustering algorithm based on pairwise constraints
    Ding, Shifei
    Jia, Hongjie
    Zhang, Liwen
    Jin, Fengxiang
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (01): : 211 - 219
  • [30] Research of Immune Intrusion Detection Algorithm Based on Semi-supervised Clustering
    Wang, Xiaowei
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT II, 2011, 7003 : 69 - 74