Semi-Supervised Clustering with Partial Background Information

被引:0
|
作者
Gao, Jing [1 ]
Tan, Pang-Ning [1 ]
Cheng, Haibin [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incorporating background knowledge into unsupervised clustering algorithms has been the subject of extensive research in recent years. Nevertheless, existing algorithms implicitly assume that the background information, typically specified in the form of labeled examples or pairwise constraints, has the same feature space as the unlabeled data to be clustered. In this paper, we are concerned with a new problem of incorporating partial background knowledge into clustering, where the labeled examples have moderate overlapping features with the unlabeled data. We formulate this as a constrained optimization problem, and propose two learning algorithms to solve the problem, based on hard and fuzzy clustering methods. An empirical study performed on a variety of real data sets shows that our proposed algorithms improve the quality of clustering results with limited labeled examples.
引用
收藏
页码:489 / 493
页数:5
相关论文
共 50 条
  • [1] Semi-supervised clustering with limited background knowledge
    Basu, S
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 979 - 980
  • [2] Semi-supervised information-maximization clustering
    Calandriello, Daniele
    Niu, Gang
    Sugiyama, Masashi
    [J]. NEURAL NETWORKS, 2014, 57 : 103 - 111
  • [3] Two phase semi-supervised clustering using background knowledge
    Shin, Kwangcheol
    Abraham, Ajith
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 707 - 712
  • [4] Customer Clustering Using Semi-supervised Geographic Information
    Lin, Zhonglin
    Chen, Gang
    Bai, Xinxin
    Lv, Hairong
    Yin, Wenjun
    Dong, Jin
    [J]. PROCEEDINGS OF 2009 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATION, LOGISTICS AND INFORMATICS, 2009, : 465 - +
  • [5] Semi-supervised Clustering in Attributed Heterogeneous Information Networks
    Li, Xiang
    Wu, Yao
    Ester, Martin
    Kao, Ben
    Wang, Xin
    Zheng, Yudian
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1621 - 1629
  • [6] Explainable Impact of Partial Supervision in Semi-Supervised Fuzzy Clustering
    Kmita, Kamil
    Kaczmarek-Majer, Katarzyna
    Hryniewicz, Olgierd
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (05) : 3189 - 3198
  • [7] Semi-supervised clustering methods
    Bair, Eric
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [8] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [9] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [10] Semi-supervised image clustering with multi-modal information
    Jianqing Liang
    Yahong Han
    Qinghua Hu
    [J]. Multimedia Systems, 2016, 22 : 149 - 160