A data structure and function classification based method to evaluate clustering models for gene expression data

被引:0
|
作者
易东
杨梦苏
黄明辉
李辉智
王文昌
机构
[1] Applied Research Centre for Genomics Technology
[2] Department of Electronic Technology
[3] 83 Tat Chee Avenue
[4] Kowloon
[5] Department of Biology & Chemistry
[6] Department of Medical Statistics
[7] Chongqing 400031
[8] Southwest University of Politics and Law Science
[9] China
[10] Third Military Medical University
[11] City University of Hong Kong
[12] Chongqing 400038
关键词
gene expression; evaluation of clustering; adjust-; FOM; entropy;
D O I
暂无
中图分类号
R311 [医用数学];
学科分类号
1001 ;
摘要
Objective: To establish a systematic framework for selecting the best clustering algorithm and provide an evaluation method for clustering analyses of gene expression data. Methods: Based on data structure (internal information) and function classification (external information), the evaluation of gene expression data analyses were carried out by using 2 approaches. Firstly, to assess the predictive power of clustering algorithms, Entropy was introduced to measure the consistency between the clustering results from different algorithms and the known and validated functional classifications. Secondly, a modified method of figure of merit (adjust-FOM) was used as internal assessment method. In this method, one clustering algorithm was used to analyze all data but one experimental condition, the remaining condition was used to assess the predictive power of the resulting clusters. This method was applied on 3 gene expression data sets (2 from the Lyer’s Serum Data Sets, and 1 from the Ferea’s Saccharomyces
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
  • [41] A Sequential Gene Expression Data Bi-clustering Method: Clustering and Verification
    Zhang Yanjie
    Hu Zhanyi
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL I, PROCEEDINGS, 2009, : 591 - +
  • [42] Consensus clustering of gene expression data and its application to gene function prediction
    Xiao, Guanghua
    Pan, Wei
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (03) : 733 - 751
  • [43] Event models for tumor classification with SAGE gene expression data
    Jin, Xin
    Xu, Anbang
    Zhao, Guoxing
    Ma, Jixin
    Bie, Rongfang
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 775 - 782
  • [44] Latent structure models for the analysis of gene expression data
    Hua, D
    Chen, DC
    Cheng, XZ
    Youssef, A
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 496 - 499
  • [45] Covariance structure models for gene expression microarray data
    Xie, J
    Bentler, PM
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2003, 10 (04) : 566 - 582
  • [46] Analysis of a Gibbs sampler method for model-based clustering of gene expression data
    Joshi, Anagha
    Van de Peer, Yves
    Michoel, Tom
    BIOINFORMATICS, 2008, 24 (02) : 176 - 183
  • [47] Scaling Method for Batch Effect Correction of Gene Expression Data Based on Spectral Clustering
    Matsuda, Momo
    Ye, Xiucai
    Sakurai, Tetsuya
    CURRENT BIOINFORMATICS, 2021, 16 (04) : 505 - 511
  • [48] Selecting a classification function for class prediction with gene expression data
    Jong, Victor L.
    Novianti, Putri W.
    Roes, Kit C. B.
    Eijkemans, Marinus J. C.
    BIOINFORMATICS, 2016, 32 (12) : 1814 - 1822
  • [49] HMMGEP: clustering gene expression data using hidden Markov models
    Ji, XL
    Yuan, Y
    Li, YD
    Sun, ZR
    BIOINFORMATICS, 2004, 20 (11) : 1799 - 1800
  • [50] An Incremental Clustering of Gene Expression data
    Das, Rosy
    Bhattacharyya, Dhruba K.
    Kalita, Jugal K.
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 741 - +