Semi-supervised model-based clustering with positive and negative constraints

被引:0
|
作者
Volodymyr Melnykov
Igor Melnykov
Semhar Michael
机构
[1] University of Alabama,Department of Information Systems, Statistics, and Management Science
[2] Colorado State University-Pueblo,Department of Mathematics and Physics
关键词
Semi-supervised clustering; Model-based clustering ; Finite mixture models; Positive and negative constraints; BIC; 62H30;
D O I
暂无
中图分类号
学科分类号
摘要
Cluster analysis is a popular technique in statistics and computer science with the objective of grouping similar observations in relatively distinct groups generally known as clusters. Semi-supervised clustering assumes that some additional information about group memberships is available. Under the most frequently considered scenario, labels are known for some portion of data and unavailable for the rest of observations. In this paper, we discuss a general type of semi-supervised clustering defined by so called positive and negative constraints. Under positive constraints, some data points are required to belong to the same cluster. On the contrary, negative constraints specify that particular points must represent different data groups. We outline a general framework for semi-supervised clustering with constraints naturally incorporating the additional information into the EM algorithm traditionally used in mixture modeling and model-based clustering. The developed methodology is illustrated on synthetic and classification datasets. A dendrochronology application is considered and thoroughly discussed.
引用
收藏
页码:327 / 349
页数:22
相关论文
共 50 条
  • [1] Semi-supervised model-based clustering with positive and negative constraints
    Melnykov, Volodymyr
    Melnykov, Igor
    Michael, Semhar
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2016, 10 (03) : 327 - 349
  • [2] Semi-supervised projected model-based clustering
    Guerra, Luis
    Bielza, Concha
    Robles, Victor
    Larranaga, Pedro
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) : 882 - 917
  • [3] Semi-supervised projected model-based clustering
    Luis Guerra
    Concha Bielza
    Víctor Robles
    Pedro Larrañaga
    [J]. Data Mining and Knowledge Discovery, 2014, 28 : 882 - 917
  • [4] Semi-supervised model-based document clustering: A comparative study
    Zhong, Shi
    [J]. MACHINE LEARNING, 2006, 65 (01) : 3 - 29
  • [5] Semi-supervised model-based document clustering: A comparative study
    Shi Zhong
    [J]. Machine Learning, 2006, 65 : 3 - 29
  • [6] Semi-supervised model-based clustering with controlled clusters leakage
    Smieja, Marek
    Struski, Lukasz
    Tabor, Jacek
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 85 : 146 - 157
  • [7] Semi-Supervised Clustering Based on Exemplars Constraints
    Wang, Sailan
    Yang, Zhenzhi
    Yang, Jin
    Wang, Hongjun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1231 - 1241
  • [8] Classifying GABAergic interneurons with semi-supervised projected model-based clustering
    Mihaljevic, Bojan
    Benavides-Piccione, Ruth
    Guerra, Luis
    DeFelipe, Javier
    Larranaga, Pedro
    Bielza, Concha
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2015, 65 (01) : 49 - 59
  • [9] Research of semi-supervised spectral clustering based on constraints expansion
    Ding, Shifei
    Qi, Bingjuan
    Jia, Hongjie
    Zhu, Hong
    Zhang, Liwen
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 22 : S405 - S410
  • [10] Research of semi-supervised spectral clustering based on constraints expansion
    Shifei Ding
    Bingjuan Qi
    Hongjie Jia
    Hong Zhu
    Liwen Zhang
    [J]. Neural Computing and Applications, 2013, 22 : 405 - 410