Research Progress on Semi-Supervised Clustering

被引:54
|
作者
Qin, Yue [1 ,2 ]
Ding, Shifei [1 ,2 ]
Wang, Lijuan [1 ,2 ,3 ]
Wang, Yanru [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[2] Minist Educ Peoples Republ China, Mine Digitizat Engn Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xu Zhou Coll Ind Technol, Sch Informat & Elect Engn, Xuzhou 221400, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; Clustering; Semi-supervised clustering; Pairwise constraints; Labeled; CLASSIFICATION; SAMPLES;
D O I
10.1007/s12559-019-09664-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised clustering is a new learning method which combines semi-supervised learning (SSL) and cluster analysis. It is widely valued and applied to machine learning. Traditional unsupervised clustering algorithm based on data partition does not need any property; however, there are a small amount of independent class labels or pair constraint information data samples in practice; in order to obtain better clustering results, scholars have proposed a semi-supervised clustering. Compared with traditional clustering methods, it can effectively improve clustering performance through a small number of supervised information, and it has been used widely in machine learning. Firstly, this paper introduces the research status and classification of semi-supervised learning and compares the four classification methods as follows: decentralized model, support vector machine, graph, and collaborative training. Secondly, the semi-supervised clustering is described in detail, the current status of semi-supervised clustering is analyzed, and the Cop-kmeans algorithm, Lcop-kmeans algorithm, Seeded-kmeans algorithm, SC-kmeans algorithm, and other algorithms are introduced. The introduction of several semi-supervised clustering methods in this paper can show the advantages of semi-supervised clustering over traditional clustering, and the related literature in recent years is summarized. This paper summarized the latest development of semi-supervised learning and semi-supervised clustering and discussed the application of semi-supervised clustering and the future research direction.
引用
收藏
页码:599 / 612
页数:14
相关论文
共 50 条
  • [41] Convergence Analysis of Semi-supervised Clustering Ensemble
    Chen, Dahai
    Yang, Yan
    Wang, Hongjun
    Mahmood, Amjad
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 783 - 788
  • [42] A semi-supervised clustering algorithm for data exploration
    Bouchachia, A
    Pedrycz, W
    FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 328 - 337
  • [43] MVS-based Semi-Supervised Clustering
    Yan, Yang
    Chen, Lihui
    Chan, Chee Keong
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,
  • [44] Semi-supervised discriminative clustering with graph regularization
    Smieja, Marek
    Myronov, Oleksandr
    Tabor, Jacek
    KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 24 - 36
  • [45] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200
  • [46] Semi-supervised Clustering Using Heterogeneous Dissimilarities
    Martin-Merino, Manuel
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 375 - 384
  • [47] A Novel Initialization Method for Semi-supervised Clustering
    Dang, Yanzhong
    Xuan, Zhaoguo
    Rong, Lili
    Liu, Ming
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2010, 6291 : 317 - 328
  • [48] Semi-supervised Clustering with Pairwise and Size Constraints
    Zhang, Shaohong
    Wong, Hau-San
    Xie, Dongqing
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2450 - 2457
  • [49] Semi-supervised Clustering with Deep Metric Learning
    Li, Xiaocui
    Yin, Hongzhi
    Zhou, Ke
    Chen, Hongxu
    Sadiq, Shazia
    Zhou, Xiaofang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 383 - 386
  • [50] Semi-Supervised Clustering for Short Answer Scoring
    Horbach, Andrea
    Pinkal, Manfred
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4065 - 4071