Stratification-based semi-supervised clustering algorithm for arbitrary shaped datasets

被引:3
|
作者
Wang, Fei [2 ]
Li, Le [1 ]
Liu, Zhiqiang [3 ]
机构
[1] Inner Mongolia Elect Informat Vocat Tech Coll, Hohhot 010000, Peoples R China
[2] Inner Mongolia Peoples Congress, Hohhot 010000, Peoples R China
[3] Inner Mongolia Univ Technol, Hohhot 010000, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised clustering; Kmeans; Seeded-Kmeans; Partitional clustering; Influence space; K-MEANS; SEARCH;
D O I
10.1016/j.ins.2023.119004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering is not only an important branch of semi-supervised learning but also an improvement direction for clustering. Semi-supervised clustering algorithms designed based on Kmeans, such as the classical Seeded-Kmeans and Constrained-Kmeans, where supervision information is used to guide clustering iterations, have the same disadvantages as the original Kmeans algorithm: they are confined to the assumption of isotropic spherical clusters, leading to the narrow adaptability in handling data of various characteristics. To solve the problem, we propose the scattered centroids initialization clustering algorithm based on Stratification (SCICS). First, based on the concept of influence space, a method for modeling the cluster -level location of any object is presented, according to which we can obtain well-defined cluster decision boundaries through stratification. On this basis, by extending the seed thought, we propose a semi-supervised subclustering algorithm that can break through the limitations of partitional clustering methods that rely on strict assumptions on particular cluster distributions. Experiments on artificial and real-world datasets show that the proposed algorithm gains the ability of clustering arbitrary shaped data and surpasses the competitors in terms of performance and adaptability.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Semi-Supervised Density-Based Clustering
    Lelis, Levi
    Sander, Joerg
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 842 - 847
  • [42] Semi-supervised Classification Based on Clustering Ensembles
    Chen, Si
    Guo, Gongde
    Chen, Lifei
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PROCEEDINGS, 2009, 5855 : 629 - 638
  • [43] Density-based semi-supervised clustering
    Carlos Ruiz
    Myra Spiliopoulou
    Ernestina Menasalvas
    Data Mining and Knowledge Discovery, 2010, 21 : 345 - 370
  • [44] An efficient semi-supervised graph based clustering
    Viet-Vu Vu
    INTELLIGENT DATA ANALYSIS, 2018, 22 (02) : 297 - 307
  • [45] Density-based semi-supervised clustering
    Ruiz, Carlos
    Spiliopoulou, Myra
    Menasalvas, Ernestina
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (03) : 345 - 370
  • [46] Semi-Supervised Clustering Based on Exemplars Constraints
    Wang, Sailan
    Yang, Zhenzhi
    Yang, Jin
    Wang, Hongjun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1231 - 1241
  • [47] A robust semi-supervised EM-based clustering algorithm with a reject option
    Saint-Jean, C
    Frélicot, C
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 399 - 402
  • [48] Semi-supervised Clustering Based on Artificial Bee Colony Algorithm with Kernel Strategy
    Dai, Jianhua
    Han, Huifeng
    Hu, Hu
    Hu, Qinghua
    Wei, Bingjie
    Yan, Yuejun
    Web-Age Information Management, Pt II, 2016, 9659 : 403 - 414
  • [49] Semi-supervised Affinity Propagation Clustering Algorithm based on Fireworks Explosion ptimization
    Wang Limin
    Han Xuming
    Ji Qiang
    2014 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT (ICMECG), 2014, : 273 - 279
  • [50] ADAPTIVE SEMI-SUPERVISED AFFINITY PROPAGATION CLUSTERING ALGORITHM BASED ON STRUCTURAL SIMILARITY
    Wang, Limin
    Ji, Qiang
    Han, Xuming
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2016, 23 (02): : 425 - 435