Semi-Supervised Density Peaks Clustering Based on Constraint Projection

被引:3
|
作者
Yan, Shan [1 ]
Wang, Hongjun [1 ]
Li, Tianrui [1 ]
Chu, Jielei [1 ]
Guo, Jin [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; Density peaks clustering; Pairwise constraint; Constraint projection; PAIRWISE CONSTRAINTS; FAST SEARCH; FIND; CLASSIFICATION; ALGORITHM;
D O I
10.2991/ijcis.d.201102.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering by fast searching and finding density peaks (DPC) method can rapidly identify the centers of clusters which have relatively high densities and high distances according to a decision graph. Various methods have been introduced to extend the DPC model over the past five years. DPC was originally presented as an unsupervised learning algorithm, and the thought of adding some prior information to DPC emerges as an alternative approach for improving its performance. It is extravagant to collect labeled data in real applications, and annotation of class labels is a nontrivial work, while pairwise constraint information is easier to get. Furthermore, the class label information can be converted into pairwise constraint information. Thus, we can take full advantage of pairwise constraints (or prior information) as much as possible. So this paper presents a new semi-supervised density peaks clustering algorithm (SSDPC) that uses constraint projection, which is flexible in loosening a few constraints over the learning stage. In the first stage, instances involving instance-level constraints and the remaining instances are concurrently projected to a lower dimensional data space led by the pairwise constraints, where viewing the distribution of data instances more clearly is available. Subsequently, traditional DPC is executed on the new lower dimensional dataset. Lastly, a few datasets from the Microsoft Research Asia Multimedia (MSRA-MM) image and UCI machine learning repository datasets are adopted in the experimental validation. The experimental results demonstrate that the proposed SSDPC achieves better performance than other three semi-supervised clustering algorithms. (C) 2021 The Authors. Published by Atlantis Press B.V.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 50 条
  • [41] An efficient semi-supervised graph based clustering
    Viet-Vu Vu
    [J]. INTELLIGENT DATA ANALYSIS, 2018, 22 (02) : 297 - 307
  • [42] Semi-Supervised Clustering Based on Exemplars Constraints
    Wang, Sailan
    Yang, Zhenzhi
    Yang, Jin
    Wang, Hongjun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1231 - 1241
  • [43] Constraint Co-Projections for Semi-Supervised Co-Clustering
    Huang, Shudong
    Wang, Hongjun
    Li, Tao
    Yang, Yan
    Li, Tianrui
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (12) : 3047 - 3058
  • [44] Semi-supervised cross-entropy clustering with information bottleneck constraint
    Smieja, Marek
    Geiger, Bernhard C.
    [J]. INFORMATION SCIENCES, 2017, 421 : 254 - 271
  • [45] A unified view of density-based methods for semi-supervised clustering and classification
    Gertrudes, Jadson Castro
    Zimek, Arthur
    Sander, Jorg
    Campello, Ricardo J. G. B.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) : 1894 - 1952
  • [46] A unified view of density-based methods for semi-supervised clustering and classification
    Jadson Castro Gertrudes
    Arthur Zimek
    Jörg Sander
    Ricardo J. G. B. Campello
    [J]. Data Mining and Knowledge Discovery, 2019, 33 : 1894 - 1952
  • [47] Semi-supervised clustering methods
    Bair, Eric
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [48] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    [J]. 2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [49] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [50] Semi-supervised Clustering Method for Multi-density Data
    Atwa, Walid
    Li, Kan
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 313 - 319