An efficient semi-supervised graph based clustering

被引:7
|
作者
Viet-Vu Vu [1 ]
机构
[1] Vietnam Natl Univ, Informat Technol Inst, 144 Xuan Thuy St, Hanoi, Vietnam
关键词
Semi-supervised clustering; seed; k-nearest neighbors graph; ALGORITHM; SELECTION; NEIGHBORS;
D O I
10.3233/IDA-163296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is one of the most important tools in data mining and knowledge discovery from data. In recent years, semi-supervised clustering, that integrates side information (seeds or constraints) in the clustering process, has been known as a good strategy to boost clustering results. In this article, a new semi-supervised graph based clustering (SSGC) is presented. Using a graph of the k-nearest neighbors and a measure of local density for the similarity between vertex, SSGC integrates the seeds in the process of building clusters and hence can improve the quality of clustering. More over, SSGC can deal with noise, differential density of data, and uses only one parameter (i.e. the number of nearest neighbors). Experiments conducted on real data sets from UCI show that our method can produce good clustering results compared with the related techniques such as semi-supervised density based clustering (SSDBSCAN). Moreover, the computational cost of SSGC is much less than that of SSDBSCAN.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 50 条
  • [21] An Efficient Semi-Supervised Clustering Algorithm with Sequential Constraints
    Yi, Jinfeng
    Zhang, Lijun
    Yang, Tianbao
    Liu, Wei
    Wang, Jun
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1405 - 1414
  • [22] Semi-supervised clustering with deep metric learning and graph embedding
    Li, Xiaocui
    Yin, Hongzhi
    Zhou, Ke
    Zhou, Xiaofang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (02): : 781 - 798
  • [23] Semi-supervised Clustering via Pairwise Constrained Optimal Graph
    Nie, Feiping
    Zhang, Han
    Wang, Rong
    Li, Xuelong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3160 - 3166
  • [24] MVS-based Semi-Supervised Clustering
    Yan, Yang
    Chen, Lihui
    Chan, Chee Keong
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,
  • [25] Semi-Supervised Density-Based Clustering
    Lelis, Levi
    Sander, Joerg
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 842 - 847
  • [26] Semi-supervised Classification Based on Clustering Ensembles
    Chen, Si
    Guo, Gongde
    Chen, Lifei
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PROCEEDINGS, 2009, 5855 : 629 - 638
  • [27] Density-based semi-supervised clustering
    Carlos Ruiz
    Myra Spiliopoulou
    Ernestina Menasalvas
    Data Mining and Knowledge Discovery, 2010, 21 : 345 - 370
  • [28] Semi-Supervised Clustering Based on Exemplars Constraints
    Wang, Sailan
    Yang, Zhenzhi
    Yang, Jin
    Wang, Hongjun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1231 - 1241
  • [29] Graph-based semi-supervised learning
    Zhang, Changshui
    Wang, Fei
    ARTIFICIAL LIFE AND ROBOTICS, 2009, 14 (04) : 445 - 448
  • [30] Semi-Supervised Classification Based on Mixture Graph
    Feng, Lei
    Yu, Guoxian
    ALGORITHMS, 2015, 8 (04) : 1021 - 1034