Probability of large-scale data set EM clustering algorithms based on partial information constraints

被引:0
|
作者
Liu, Xiaoyan [1 ]
机构
[1] Changchun Univ Sci & Technol, Changchun 130600, Jilin Province, Peoples R China
关键词
Some constraint information; Clustering; The data set; The clustering quality; The probability of clustering algorithm;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The current situation, the need for clustering of data is very large, and the use of traditional algorithm for clustering process often tedious and time consuming is very long, the effect is not obvious. Based on this, this paper proposes a data sets EM probability based on some constraint information clustering algorithm, the detailed implementation process of the whole algorithm is described. Through experiment contrast scalable EM, positive_PC_SEM and full_PC_SEM clustering quality and efficiency of execution of the algorithm, the results show that the positive_PC_SEM algorithm and scalable EM algorithm compared to the clustering quality and efficiency is higher, although full_PC_SEM clustering quality is very high, but requires a lot of time.
引用
收藏
页码:1748 / 1751
页数:4
相关论文
共 50 条
  • [1] A EM Probabilistic Clustering Algorithm for Large Scale Data Sets based on Partial Constraints Information
    Yan S.
    Shunlin S.
    Yuquan Z.
    Advances in Information Sciences and Service Sciences, 2011, 3 (10): : 20 - 29
  • [2] Affinity propagation clustering algorithm based on large-scale data-set
    Wang L.
    Zheng K.
    Tao X.
    Han X.
    International Journal of Computers and Applications, 2018, 40 (03) : 1 - 6
  • [3] Large-scale spectral clustering based on pairwise constraints
    Semertzidis, T.
    Rafailidis, D.
    Strintzis, M. G.
    Daras, P.
    INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (05) : 616 - 624
  • [4] A study of large-scale data clustering based on fuzzy clustering
    Li, Yangyang
    Yang, Guoli
    He, Haiyang
    Jiao, Licheng
    Shang, Ronghua
    SOFT COMPUTING, 2016, 20 (08) : 3231 - 3242
  • [5] A study of large-scale data clustering based on fuzzy clustering
    Yangyang Li
    Guoli Yang
    Haiyang He
    Licheng Jiao
    Ronghua Shang
    Soft Computing, 2016, 20 : 3231 - 3242
  • [6] Integrating Large-Scale Soft Data by Simulated Annealing and Probability Constraints
    C. V. Deutsch
    X. H. Wen
    Mathematical Geology, 2000, 32 : 49 - 67
  • [7] Integrating large-scale soft data by simulated annealing and probability constraints
    Deutsch, CV
    Wen, XH
    MATHEMATICAL GEOLOGY, 2000, 32 (01): : 49 - 67
  • [8] PROBABILITY FUNCTIONS AND SYSTEMATICS OF LARGE-SCALE CLUSTERING
    MO, HJ
    INTERNATIONAL JOURNAL OF MODERN PHYSICS A, 1988, 3 (06): : 1373 - 1383
  • [9] Large-Scale Information Extraction from Emails with Data Constraints
    Gupta, Rajeev
    Kondapally, Ranganath
    Guha, Siddharth
    BIG DATA ANALYTICS (BDA 2019), 2019, 11932 : 124 - 139
  • [10] Genetic algorithms for large-scale clustering problems
    Franti, P
    Kivijarvi, J
    Kaukoranta, T
    Nevalainen, O
    COMPUTER JOURNAL, 1997, 40 (09): : 547 - 554