SMIP an incremental and parallel clustering algorithm based on statistics and morphology

被引:0
|
作者
Qiang, Zhang [1 ]
Zheng, Zhao [1 ]
Shu, Yantai [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
关键词
mathematics morphology; incremental clustering; distribution parallel clustering; statistics method;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new clustering algorithm called SMIP. It uses a statistics method to obtain the clustering parameters automatically. Mathematics morphology theory is introduced into clustering to acquire high speed and accuracy. Based on it, we realize incremental clustering and distribution parallel clustering. Our incremental clustering can yield significant speed-up factors for new coming data in an already processed database. Our distribution parallel clustering can be run on a number of workstations connected via network. It is robust and efficient with low overhead We realized SMIP by JAVA language. The tests show that SAEP is very efficient with a complexity of O(N), N being the number of points in databases; it is much faster than DBSCAN; it is effective in discovering clusters of arbitrary shape; it is not sensitive to noise; It has some ability to deal with high dimensional points; incremental clustering can speed up the process over 30 times than complete re-clustering; the total overhead of parallel clustering on four workstations is below 13% SMIP is an ideal clustering method for very large databases.
引用
收藏
页码:430 / +
页数:2
相关论文
共 50 条
  • [41] A New Incremental Pairwise Clustering Algorithm
    Seo, Sambu
    Mohr, Johannes
    Obermayer, Klaus
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 223 - 228
  • [42] An Incremental Algorithm for Clustering Search Results
    Liu, Yongli
    Ouyang, Yuanxin
    Sheng, Hao
    Xiong, Zhang
    [J]. SITIS 2008: 4TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY AND INTERNET BASED SYSTEMS, PROCEEDINGS, 2008, : 112 - 117
  • [43] An efficient parallel direction-based clustering algorithm
    Zhong, Kai
    Zhou, Xu
    Zhou, Liqian
    Yang, Zhibang
    Liu, Chubo
    Xiao, Na
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 145 : 24 - 33
  • [44] Parallel Diffrential Evolution Clustering Algorithm based on MapReduce
    Daoudi, Meroua
    Hamena, Soumiya
    Benmounah, Zakaria
    Batouche, Mohamed
    [J]. 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 337 - 341
  • [45] A parallel text document clustering algorithm based on neighbors
    Yanjun Li
    Congnan Luo
    Soon M. Chung
    [J]. Cluster Computing, 2015, 18 : 933 - 948
  • [46] A parallel Clustering algorithm implementation based on Apache Mahout
    Xia Daoping
    Zhong Alin
    Long Yubo
    [J]. PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 790 - 795
  • [47] PSubCLUS: A Parallel Subspace Clustering Algorithm Based On Spark
    Wen, Xiao
    Juan, Hu
    [J]. IEEE ACCESS, 2021, 9 : 2535 - 2544
  • [48] A parallel text document clustering algorithm based on neighbors
    Li, Yanjun
    Luo, Congnan
    Chung, Soon M.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 933 - 948
  • [49] A Binary Morphology-Based Clustering Algorithm Directed by Genetic Algorithm
    Pedrino, E. C.
    Nicoletti, M. C.
    Saito, J. H.
    Cura, L. M. V.
    Roda, V. O.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 409 - 414
  • [50] REPRESENTATIVE POINTS AND CLUSTER ATTRIBUTES BASED INCREMENTAL SEQUENCE CLUSTERING ALGORITHM
    Wu, Di
    Ren, Jiadong
    [J]. COMPUTING AND INFORMATICS, 2017, 36 (06) : 1361 - 1384