Stratification-based semi-supervised clustering algorithm for arbitrary shaped datasets

被引:3
|
作者
Wang, Fei [2 ]
Li, Le [1 ]
Liu, Zhiqiang [3 ]
机构
[1] Inner Mongolia Elect Informat Vocat Tech Coll, Hohhot 010000, Peoples R China
[2] Inner Mongolia Peoples Congress, Hohhot 010000, Peoples R China
[3] Inner Mongolia Univ Technol, Hohhot 010000, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised clustering; Kmeans; Seeded-Kmeans; Partitional clustering; Influence space; K-MEANS; SEARCH;
D O I
10.1016/j.ins.2023.119004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering is not only an important branch of semi-supervised learning but also an improvement direction for clustering. Semi-supervised clustering algorithms designed based on Kmeans, such as the classical Seeded-Kmeans and Constrained-Kmeans, where supervision information is used to guide clustering iterations, have the same disadvantages as the original Kmeans algorithm: they are confined to the assumption of isotropic spherical clusters, leading to the narrow adaptability in handling data of various characteristics. To solve the problem, we propose the scattered centroids initialization clustering algorithm based on Stratification (SCICS). First, based on the concept of influence space, a method for modeling the cluster -level location of any object is presented, according to which we can obtain well-defined cluster decision boundaries through stratification. On this basis, by extending the seed thought, we propose a semi-supervised subclustering algorithm that can break through the limitations of partitional clustering methods that rely on strict assumptions on particular cluster distributions. Experiments on artificial and real-world datasets show that the proposed algorithm gains the ability of clustering arbitrary shaped data and surpasses the competitors in terms of performance and adaptability.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A Research of Data Stratification Algorithm based on Semi-supervised Clustering
    Yang, Shaobo
    Yu, Jianmin
    Liu, Yi
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 196 - 200
  • [2] Active Semi-Supervised Clustering Algorithm for Multi-Density Datasets
    Atwa, Walid
    Almazroi, Abdulwahab Ali
    Aldhahr, Eman A.
    Janbi, Nourah Fahad
    International Journal of Advanced Computer Science and Applications, 2024, 15 (10) : 493 - 500
  • [3] Novel Semi-supervised Clustering Algorithm for Finding Clusters of Arbitrary Shapes
    Baghshah, Mahdieh Soleymani
    Shouraki, Saeed Bagheri
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 876 - 879
  • [4] Semi-supervised fuzzy clustering algorithm based on QPSO
    School of IoT Engineering, Jiangnan University, Wuxi 214122, China
    J. Inf. Comput. Sci., 1 (93-101):
  • [5] A semi-supervised document clustering algorithm based on EM
    Rigutini, L
    Maggini, M
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 200 - 206
  • [6] A Semi-supervised Clustering Algorithm Based on Rough Reduction
    Lin, Liandong
    Qu, Wei
    Yu, Xiang
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 5427 - +
  • [7] Semi-supervised clustering based on affinity propagation algorithm
    Xiao, Yu
    Yu, Jian
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (11): : 2803 - 2813
  • [8] An Improved Semi-Supervised Clustering Algorithm for Multi-Density Datasets with Fewer Constraints
    Chen, Xiaoyun
    Liu, Sha
    Chen, Tao
    Zhang, Zhengquan
    Zhang, Hairong
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 4325 - 4329
  • [9] The Recommendation System Based on Semi-Supervised PSO Clustering Algorithm
    Zhou Wen Min
    Pan Xiu Qin
    Li Rui Xiang
    Lu Yong
    PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MECHANICAL, CONTROL AND AUTOMATION (IFMCA 2016), 2017, 113 : 63 - 71
  • [10] Semi-supervised clustering ensemble based on genetic algorithm model
    Bi, Sheng
    Li, Xiangli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55851 - 55865