Assessment of microarray data clustering results based on a new geometrical index for cluster validity

被引:11
|
作者
Lam, Benson S. Y.
Yan, Hong
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
cluster validity; clustering; data classification;
D O I
10.1007/s00500-006-0087-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A measurement of cluster quality is often needed for DNA microarray data analysis. In this paper, we introduce a new cluster validity index, which measures geometrical features of the data. The essential concept of this index is to evaluate the ratio between the squared total length of the data eigen-axes with respect to the between-cluster separation. We show that this cluster validity index works well for data that contain clusters closely distributed or with different sizes. We verify the method using three simulated data sets, two real world data sets and two microarray data sets. The experiment results show that the proposed index is superior to five other cluster validity indices, including partition coefficients (PC), General silhouette index (GS), Dunn's index (DI), CH Index and I-Index. Also, we have given a theorem to show for what situations the proposed index works well.
引用
收藏
页码:341 / 348
页数:8
相关论文
共 50 条
  • [31] A new efficient fuzzy cluster validity index: Application to images clustering
    Haouas, Fatma
    Ben Dhiaf, Zouhour
    Hammouda, Atef
    Solaiman, Basel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [32] A new clustering algorithm based on cluster validity indices
    Kim, M
    Ramakrishna, RS
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 322 - 329
  • [33] Cluster validity index for adaptive clustering algorithms
    Cui, Hongyan
    Xie, Mingzhi
    Cai, Yunlong
    Huang, Xu
    Liu, Yunjie
    [J]. IET COMMUNICATIONS, 2014, 8 (13) : 2256 - 2263
  • [34] Improved cluster validity index for fuzzy clustering
    Kwon, Soon Hak
    Kim, Jihong
    Son, Seo Ho
    [J]. ELECTRONICS LETTERS, 2021, 57 (21) : 792 - 794
  • [35] A novel cluster validity index for fuzzy clustering based on bipartite modularity
    Zhang, Dawei
    Ji, Min
    Yang, Jun
    Zhang, Yong
    Xie, Fuding
    [J]. FUZZY SETS AND SYSTEMS, 2014, 253 : 122 - 137
  • [36] An effective partitional clustering algorithm based on new clustering validity index
    Zhu, Erzhou
    Ma, Ruhui
    [J]. APPLIED SOFT COMPUTING, 2018, 71 : 608 - 621
  • [37] Determination of cluster number in clustering microarray data
    Shen, JD
    Chang, SI
    Lee, ES
    Deng, YP
    Brown, SJ
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2005, 169 (02) : 1172 - 1185
  • [38] Cluster Validity Measures Based Agglomerative Hierarchical Clustering for Network Data
    Hamasuna, Yukihiro
    Nakano, Shusuke
    Ozaki, Ryo
    Endo, Yasunori
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (03) : 577 - 583
  • [39] A Data Clustering Tool with Cluster Validity Indices
    Qiao, Haiyan
    Edwards, Brandon
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING AND INFORMATION, 2009, : 303 - 309
  • [40] A new validity index for fuzzy clustering
    Zhang, Fangfang
    Qian, Xuezhong
    [J]. Journal of Computational Information Systems, 2012, 8 (14): : 5875 - 5883