A review on semi-supervised clustering

被引:38
|
作者
Cai, Jianghui [1 ,2 ]
Hao, Jing [1 ]
Yang, Haifeng [1 ,3 ]
Zhao, Xujun [1 ,3 ]
Yang, Yuqing [1 ]
机构
[1] Taiyuan Univ Sci & Technol TYUST, Taiyuan 030024, Shanxi, Peoples R China
[2] North Univ China NUC, Taiyuan 030051, Shanxi, Peoples R China
[3] Shanxi Key Lab Big Data Anal & Parallel Comp, Taiyuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised clustering; Constraints K-means; Semi-supervised fuzzy clustering; Constraints spectral clustering; NMF-based semi-supervised clustering; Random subspace-based semi-supervised; clustering; NONNEGATIVE MATRIX FACTORIZATION; FRAMEWORK; CONSTRAINTS; ALGORITHM; EFFICIENT; ENSEMBLE; CLASSIFICATION; NETWORK; SEARCH; LEVEL;
D O I
10.1016/j.ins.2023.02.088
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering (SSC), a technique integrating semi-supervised learning and clustering analysis, incorporates the given prior information (e.g., class labels and pairwise constraints) into clustering to guide the clustering process and improve the performance. In recent years, a large number of valuable works have emerged, focusing on theoretical research and application in different fields. In this paper, a detailed review of SSC is provided from a new perspective. Firstly, all SSC studies are organized as partition-based SSC, hierarchical-based SSC, density-based SSC, graph-based SSC, neural network-based SSC, Nonnegative Matrix Factorization-based SSC and random subspace technique-based SSC. Thus, the semi-supervised researches can be in-depth discussed in each clustering idea. Secondly, the general overviews are detailed in each category respectively, including the performance, the suitable scenarios and the way to add supervising information. Thirdly, the recent successful applications of SSC are summarized according to different backgrounds such as medical, biological, business, journalism, financial and so on. Based on this, some application caveats and development trends of SSC are particularly given in the end. This comprehensive review and analysis of SSC can provide an overall outline, the scope of research topics, and a relative complete analysis of existing SSC methods for researchers.
引用
收藏
页码:164 / 200
页数:37
相关论文
共 50 条
  • [2] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [3] Semi-supervised clustering methods
    Bair, Eric
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [4] Semi-Supervised Clustering for Architectural Modularisation
    Feist, Sofia
    Sanhudo, Luis
    Esteves, Vitor
    Pires, Miguel
    Costa, Antonio Aguiar
    [J]. BUILDINGS, 2022, 12 (03)
  • [5] Semi-supervised clustering with soft labels
    Nebu, Cynthia Marea
    Joseph, Sumy
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 612 - 616
  • [6] Semi-supervised spectral clustering ensemble
    [J]. 1600, ICIC Express Letters Office (10):
  • [7] Spectral clustering: A semi-supervised approach
    Chen, Weifu
    Feng, Guocan
    [J]. NEUROCOMPUTING, 2012, 77 (01) : 229 - 242
  • [8] Research Progress on Semi-Supervised Clustering
    Yue Qin
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    [J]. Cognitive Computation, 2019, 11 : 599 - 612
  • [9] Semi-supervised clustering of unknown expressions
    Jalal, Ahsan
    Tariq, Usman
    [J]. PATTERN RECOGNITION LETTERS, 2019, 120 : 46 - 53
  • [10] Image Annotation with Semi-Supervised Clustering
    Sayar, Ahmet
    Yannan-Vural, Fatos T.
    [J]. 2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 517 - 520