Probability of large-scale data set EM clustering algorithms based on partial information constraints

被引:0
|
作者
Liu, Xiaoyan [1 ]
机构
[1] Changchun Univ Sci & Technol, Changchun 130600, Jilin Province, Peoples R China
关键词
Some constraint information; Clustering; The data set; The clustering quality; The probability of clustering algorithm;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The current situation, the need for clustering of data is very large, and the use of traditional algorithm for clustering process often tedious and time consuming is very long, the effect is not obvious. Based on this, this paper proposes a data sets EM probability based on some constraint information clustering algorithm, the detailed implementation process of the whole algorithm is described. Through experiment contrast scalable EM, positive_PC_SEM and full_PC_SEM clustering quality and efficiency of execution of the algorithm, the results show that the positive_PC_SEM algorithm and scalable EM algorithm compared to the clustering quality and efficiency is higher, although full_PC_SEM clustering quality is very high, but requires a lot of time.
引用
收藏
页码:1748 / 1751
页数:4
相关论文
共 50 条
  • [31] Large-scale clustering of CAGE tag expression data
    Kazuro Shimokawa
    Yuko Okamura-Oho
    Takio Kurita
    Martin C Frith
    Jun Kawai
    Piero Carninci
    Yoshihide Hayashizaki
    BMC Bioinformatics, 8
  • [32] Boosting Algorithms for Large-Scale Data and Data Batch Stream
    Yoon, Young Joo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (01) : 197 - 206
  • [33] Large-Scale 802.11 Wireless Networks Data Analysis Based on Graph Clustering
    Capdehourat, German
    Bermolen, Paola
    Fiori, Marcelo
    Frevenza, Nicolas
    Larroca, Federico
    Morales, Gaston
    Rattaro, Claudina
    Zunino, Gianina
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 120 (02) : 1791 - 1819
  • [34] An improved clustering method for large-scale data based on artificial immune system
    Li, Zhonghua
    Tan, Hongzhou
    Yan, Xiaoke
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 920 - 924
  • [35] MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering
    Tripathi, Ashish Kumar
    Saxena, Pranav
    Gupta, Siddharth
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 171 - 175
  • [36] A MapReduce-based artificial bee colony for large-scale data clustering
    Banharnsakun, Anan
    PATTERN RECOGNITION LETTERS, 2017, 93 : 78 - 84
  • [37] Large-Scale 802.11 Wireless Networks Data Analysis Based on Graph Clustering
    Germán Capdehourat
    Paola Bermolen
    Marcelo Fiori
    Nicolás Frevenza
    Federico Larroca
    Gastón Morales
    Claudina Rattaro
    Gianina Zunino
    Wireless Personal Communications, 2021, 120 : 1791 - 1819
  • [38] Large-Scale Data Clustering Algorithm Based on Quantum Immune Regulation Network
    Li, Yangyang
    Bai, Xiaoyu
    Hou, Xiaoju
    Jiao, Licheng
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [39] A Sampling-Based Density Peaks Clustering Algorithm for Large-Scale Data
    Ding, Shifei
    Li, Chao
    Xu, Xiao
    Ding, Ling
    Zhang, Jian
    Guo, Lili
    Shi, Tianhao
    PATTERN RECOGNITION, 2023, 136
  • [40] Analysis of large-scale power quality monitoring data based on quantum clustering
    Zhong, Qing
    Liang, Jiahao
    Xu, Zhong
    Meyer, Jan
    Wang, Longjun
    Wang, Gang
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 220