PFClust: an optimised implementation of a parameter-free clustering algorithm

被引:7
|
作者
Musayeva, Khadija [1 ]
Henderson, Tristan [1 ]
Mitchell, John Bo [2 ]
Mavridis, Lazaros [2 ]
机构
[1] Univ St Andrews, Sch Comp Sci, St Andrews KY16 9SX, Fife, Scotland
[2] Univ St Andrews, EaStCHEM Sch Chem & Biomed Sci Res Complex, North Haugh, St Andrews KY16 9ST, Scotland
来源
关键词
Clustering; Cluster analysis; Number of clusters;
D O I
10.1186/1751-0473-9-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. Results: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. Conclusions: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] PFClust: a novel parameter free clustering algorithm
    Mavridis, Lazaros
    Nath, Neetika
    Mitchell, John B. O.
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [2] PFClust: a novel parameter free clustering algorithm
    Lazaros Mavridis
    Neetika Nath
    John BO Mitchell
    [J]. BMC Bioinformatics, 14
  • [3] A Parameter-Free Clustering Algorithm Based on Density Model
    Mu, Jun
    Fei, Hongxiao
    Dong, Xin
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1825 - 1831
  • [4] DSets-DBSCAN: A Parameter-Free Clustering Algorithm
    Hou, Jian
    Gao, Huijun
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3182 - 3193
  • [5] PARAMETER-FREE CLUSTERING MODEL
    GITMAN, I
    [J]. PATTERN RECOGNITION, 1972, 4 (03) : 307 - &
  • [6] A Hybrid and Parameter-Free Clustering Algorithm for Large Data Sets
    Shao, Hengkang
    Zhang, Ping
    Chen, Xinye
    Li, Fang
    Du, Guanglong
    [J]. IEEE ACCESS, 2019, 7 : 24806 - 24818
  • [7] A Parameter-free Clustering Algorithm based K-means
    Slaoui, Said
    Dafir, Zineb
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 612 - 619
  • [8] A Parameter-Free Hybrid Clustering algorithm used for Malware Categorization
    Han, ZhiXue
    Feng, Shaorong
    Ye, Yanfang
    Jiang, Qingshan
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION IN COMMUNICATION, 2009, : 480 - +
  • [9] A parameter-free affinity based clustering
    Mukhoty, Bhaskar
    Gupta, Ruchir
    Lakshmanan, K.
    Kumar, Mayank
    [J]. APPLIED INTELLIGENCE, 2020, 50 (12) : 4543 - 4556
  • [10] Parameter-Free Minimum Spanning Tree (PFMST) Based Clustering Algorithm
    Raju, B. H. V. S. Ramakrisbnam
    Kumari, V. Valli
    [J]. ADVANCES IN PARALLEL, DISTRIBUTED COMPUTING, 2011, 203 : 552 - +