A data structure and function classification based method to evaluate clustering models for gene expression data

被引:0
|
作者
易东
杨梦苏
黄明辉
李辉智
王文昌
机构
[1] Applied Research Centre for Genomics Technology
[2] Department of Electronic Technology
[3] 83 Tat Chee Avenue
[4] Kowloon
[5] Department of Biology & Chemistry
[6] Department of Medical Statistics
[7] Chongqing 400031
[8] Southwest University of Politics and Law Science
[9] China
[10] Third Military Medical University
[11] City University of Hong Kong
[12] Chongqing 400038
关键词
gene expression; evaluation of clustering; adjust-; FOM; entropy;
D O I
暂无
中图分类号
R311 [医用数学];
学科分类号
1001 ;
摘要
Objective: To establish a systematic framework for selecting the best clustering algorithm and provide an evaluation method for clustering analyses of gene expression data. Methods: Based on data structure (internal information) and function classification (external information), the evaluation of gene expression data analyses were carried out by using 2 approaches. Firstly, to assess the predictive power of clustering algorithms, Entropy was introduced to measure the consistency between the clustering results from different algorithms and the known and validated functional classifications. Secondly, a modified method of figure of merit (adjust-FOM) was used as internal assessment method. In this method, one clustering algorithm was used to analyze all data but one experimental condition, the remaining condition was used to assess the predictive power of the resulting clusters. This method was applied on 3 gene expression data sets (2 from the Lyer’s Serum Data Sets, and 1 from the Ferea’s Saccharomyces
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
  • [21] Clustering Based Classification in Data Mining Method Recommendation
    Kazik, Ondrej
    Peskova, Klara
    Smid, Jakub
    Neruda, Roman
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 356 - 361
  • [22] Ensemble clustering method based on the resampling similarity measure for gene expression data
    Kim, Seo Young
    Lee, Jae Won
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (06) : 539 - 564
  • [23] A data-driven clustering method for time course gene expression data
    Ma, P
    Castillo-Davis, CI
    Zhong, WX
    Liu, JS
    NUCLEIC ACIDS RESEARCH, 2006, 34 (04) : 1261 - 1269
  • [24] Gene-Ontology-based clustering of gene expression data
    Adryan, B
    Schuh, R
    BIOINFORMATICS, 2004, 20 (16) : 2851 - 2852
  • [25] Fuzzy Rule Based Clustering for Gene Expression Data
    Sinaee, Mehrnoosh
    Mansoori, Eghbal G.
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 7 - 11
  • [26] Clustering of Gene Expression Data Based on Shape Similarity
    Hestilow, Travis J.
    Huang, Yufei
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [27] Dynamic core based clustering of gene expression data
    1600, ICIC International (10):
  • [28] DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA
    Bocicor, Maria-Iuliana
    Sirbu, Adela
    Czibula, Gabriela
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1051 - 1069
  • [29] Combining gene annotations and gene expression data in model-based clustering: Weighted method
    Huang, Desheng
    Wei, Peng
    Pan, Wei
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (01) : 28 - 39
  • [30] A fuzzy approach to clustering and selecting features for classification of gene expression data
    Chitsaz, Elham
    Taheri, Mohammad
    Katebi, Seraj D.
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 1650 - 1655