Determination of the minimum sample size in microarray experiments to cluster genes using K-means clustering

被引:0
|
作者
Wu, FX [1 ]
Zhang, WJ [1 ]
Kusalik, AJ [1 ]
机构
[1] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression profiles obtained from time-series microarray experiments can reveal important information about biological processes. However, conducting such experiments is costly and time consuming. The cost and time required are linearly proportional to sample size. Therefore, it is worthwhile to provide a way to determine the minimal number of samples or trials required in a microarray experiment. One of the uses of microarray hybridization experiments is to group together genes with similar patterns of the expression using clustering techniques. In this paper, the k -means clustering technique is used. The basic idea of our approach is an incremental process in which testing, analysis and evaluation are integrated and iterated. The process is terminated when the evaluation of the results of two consecutive experiments shows they are sufficiently close. Two measures of "closeness" are proposed and two real microarray datasets are used to validate our approach. The results show that the sample size required to cluster genes in these two datasets can be reduced; i.e. the same results can be achieved with less cost. The approach can be used with other clustering techniques as well.
引用
收藏
页码:401 / 406
页数:6
相关论文
共 50 条
  • [1] Algorithm for the k-means clustering based on minimum cluster size
    Wang, Shou-Qiang
    Zhu, Da-Ming
    Tongxin Xuebao/Journal on Communications, 2010, 31 (07): : 46 - 52
  • [2] Local Search Algorithm for k-means Clustering Based on Minimum Sub-cluster Size
    Wang, Shouqiang
    Wang, Xiaomei
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 1 - +
  • [3] Data Clustering with Cluster Size Constraints Using a Modified k-means Algorithm
    Ganganath, Nuwan
    Cheng, Chi-Tsun
    Tse, Chi K.
    2014 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2014, : 158 - 161
  • [4] Using K-Means Clustering to Cluster Provinces in Indonesia
    Ahmar, Ansari Saleh
    Napitupulu, Darmawan
    Rahim, Robbi
    Hidayat, Rahmat
    Sonatha, Yance
    Azmi, Meri
    2ND INTERNATIONAL CONFERENCE ON STATISTICS, MATHEMATICS, TEACHING, AND RESEARCH 2017, 2018, 1028
  • [5] Minimum entropy, k-means, spectral clustering
    Lee, Y
    Choi, S
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 117 - 122
  • [6] Fuzzy K-Means Clustering on Infrasound Sample
    Wang, Wei
    Wei, Shimin
    Liao, Qizheng
    Xia, Yaqin
    Li, Danlin
    Li, Junzi
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 756 - +
  • [7] Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering
    Kaile Zhou
    Shanlin Yang
    Pattern Analysis and Applications, 2020, 23 : 455 - 466
  • [8] Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering
    Zhou, Kaile
    Yang, Shanlin
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (01) : 455 - 466
  • [9] Determination of Rice Quality Using the K-Means Clustering Method
    Fahlevi, Muhammad Reza
    Putri, Dini Ridha Dwiki
    Putri, Fetty Ade
    Rahman, Maulia
    Sipahutar, Lahmudin
    Muhatri
    PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2020, : 174 - 179
  • [10] Clustering microarray data using model-based double K-means
    Martella, Francesca
    Vichi, Maurizio
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (09) : 1853 - 1869