Stratified Feature Sampling for Semi-Supervised Ensemble Clustering

被引:3
|
作者
Tian, Jialin [1 ]
Ren, Yazhou [1 ]
Cheng, Xiang [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24060 USA
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Constraint propagation; ensemble clustering; high dimensional data; semi-supervised learning; stratified feature sampling;
D O I
10.1109/ACCESS.2019.2939581
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble Clustering (EC), which seeks to generate a consensus clustering by integrating multiple base clusterings, has attracted increasing attentions. However, traditional EC methods typically have three main limitations: (1) High dimensional data present a huge challenge to ensemble clustering methods. (2) Most EC algorithms can not use prior information, e.g., pairwise constraints, to enhance the clustering performance. (3) Even in existing semi-supervised ensemble clustering methods, prior information is not sufficiently used, e.g., only used in generating base clusterings. To alleviate these problems, we propose Stratified Feature Sampling for Semi-Supervised Ensemble Clustering ((SFSEC)-E-3). Firstly, we develop a novel stratified feature sampling method, which can cope with high dimensional data, guarantee the diversity of base clusterings, and reduce the risk that some features are not selected at the same time. Secondly, semi-supervised clustering, i.e., constraint propagation, is applied to obtain base clusterings. Finally, we propose to utilize prior information in both the base clustering generating process and the consensus process, which guarantees that prior information is sufficiently used. We conduct a series of experiments on ten real-world data sets to demonstrate the effectiveness of the proposed model.
引用
收藏
页码:128669 / 128675
页数:7
相关论文
共 50 条
  • [21] Feature selection and semi-supervised clustering using multiobjective optimization
    Saha, Sriparna
    Ekbal, Asif
    Alok, Abhay Kumar
    Spandana, Rachamadugu
    SPRINGERPLUS, 2014, 3
  • [22] Semi-supervised affinity propagation clustering algorithm based on stratified combination
    Zhang, Z. (zhangzhen2096@163.com), 2013, Science Press (35):
  • [23] Semi-supervised sentiment classification based on sentiment feature clustering
    Li, Suke
    Jiang, Yanbing
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (12): : 2570 - 2577
  • [24] Semi-Supervised Clustering Algorithm Based on Deep Feature Mapping
    Xu, Xiong
    Zhou, Chun
    Wang, Chenggang
    Zhang, Xiaoyan
    Meng, Hua
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 815 - 831
  • [25] A Feature Space Learning Model Based on Semi-Supervised Clustering
    Guan, Renchu
    Wang, Xu
    Marchese, Maurizio
    Liang, Yanchun
    Yang, Chen
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 403 - 409
  • [26] Feature Selection and Semi-supervised Clustering Using Multiobjective Optimization
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    2014 INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE ISCMI 2014, 2014, : 126 - 129
  • [27] SEMI-SUPERVISED ENSEMBLE TRACKING
    Liu, Huaping
    Sun, Fuchun
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1645 - +
  • [28] Eigenvectors selection for spectral clustering based on semi-supervised selective ensemble
    Wang, X. (wangxingliang0911@163.com), 1600, Binary Information Press (10):
  • [29] A Kernel Probabilistic Model for Semi-supervised Co-clustering Ensemble
    Zhang, Yinghui
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 143 - 153
  • [30] Semi-Supervised Selective Affinity Propagation Ensemble Clustering With Active Constraints
    Lei, Qi
    Li, Ting
    IEEE ACCESS, 2020, 8 : 46255 - 46266