Semi-supervised clustering methods

被引:110
|
作者
Bair, Eric [1 ,2 ]
机构
[1] Univ N Carolina, Dept Endodont, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
关键词
cluster analysis; high-dimensional data; semi-supervised methods; machine learning;
D O I
10.1002/wics.1270
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, information about the clusters is available in addition to the values of the features. For example, the cluster labels of some observations may be known, or certain observations may be known to belong to the same cluster. In other cases, one may wish to identify clusters that are associated with a particular outcome variable. This review describes several clustering algorithms (known as 'semi-supervised clustering' methods) that can be applied in these situations. The majority of these methods are modifications of the popular k-means clustering method, and several of them will be described in detail. A brief description of some other semi-supervised clustering algorithms is also provided. (C) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:349 / 361
页数:13
相关论文
共 50 条
  • [1] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [2] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [3] Semi-supervised clustering of large data sets with kernel methods
    Fausser, Stefan
    Schwenker, Friedhelm
    [J]. PATTERN RECOGNITION LETTERS, 2014, 37 : 78 - 84
  • [4] Semi-Supervised Clustering for Architectural Modularisation
    Feist, Sofia
    Sanhudo, Luis
    Esteves, Vitor
    Pires, Miguel
    Costa, Antonio Aguiar
    [J]. BUILDINGS, 2022, 12 (03)
  • [5] Semi-supervised clustering with soft labels
    Nebu, Cynthia Marea
    Joseph, Sumy
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 612 - 616
  • [6] Spectral clustering: A semi-supervised approach
    Chen, Weifu
    Feng, Guocan
    [J]. NEUROCOMPUTING, 2012, 77 (01) : 229 - 242
  • [7] Research Progress on Semi-Supervised Clustering
    Yue Qin
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    [J]. Cognitive Computation, 2019, 11 : 599 - 612
  • [8] Image Annotation with Semi-Supervised Clustering
    Sayar, Ahmet
    Yannan-Vural, Fatos T.
    [J]. 2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 517 - 520
  • [9] Semi-supervised clustering of unknown expressions
    Jalal, Ahsan
    Tariq, Usman
    [J]. PATTERN RECOGNITION LETTERS, 2019, 120 : 46 - 53
  • [10] Composite kernels for semi-supervised clustering
    Domeniconi, Carlotta
    Peng, Jing
    Yan, Bojun
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (01) : 99 - 116