A data structure and function classification based method to evaluate clustering models for gene expression data

被引:0
|
作者
易东
杨梦苏
黄明辉
李辉智
王文昌
机构
[1] Applied Research Centre for Genomics Technology
[2] Department of Electronic Technology
[3] 83 Tat Chee Avenue
[4] Kowloon
[5] Department of Biology & Chemistry
[6] Department of Medical Statistics
[7] Chongqing 400031
[8] Southwest University of Politics and Law Science
[9] China
[10] Third Military Medical University
[11] City University of Hong Kong
[12] Chongqing 400038
关键词
gene expression; evaluation of clustering; adjust-; FOM; entropy;
D O I
暂无
中图分类号
R311 [医用数学];
学科分类号
1001 ;
摘要
Objective: To establish a systematic framework for selecting the best clustering algorithm and provide an evaluation method for clustering analyses of gene expression data. Methods: Based on data structure (internal information) and function classification (external information), the evaluation of gene expression data analyses were carried out by using 2 approaches. Firstly, to assess the predictive power of clustering algorithms, Entropy was introduced to measure the consistency between the clustering results from different algorithms and the known and validated functional classifications. Secondly, a modified method of figure of merit (adjust-FOM) was used as internal assessment method. In this method, one clustering algorithm was used to analyze all data but one experimental condition, the remaining condition was used to assess the predictive power of the resulting clusters. This method was applied on 3 gene expression data sets (2 from the Lyer’s Serum Data Sets, and 1 from the Ferea’s Saccharomyces
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
  • [1] The clustering of regression models method with applications in gene expression data
    Qin, LX
    Self, SG
    BIOMETRICS, 2006, 62 (02) : 526 - 533
  • [2] Ensemble classification for gene expression data based on parallel clustering
    Meng, Jun
    Jiang, Dingling
    Zhang, Jing
    Luan, Yushi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (03) : 213 - 229
  • [3] Discriminant analysis to evaluate clustering of gene expression data
    Méndez, MA
    Hödar, C
    Vulpe, C
    González, M
    Cambiazo, V
    FEBS LETTERS, 2002, 522 (1-3) : 24 - 28
  • [4] A new clustering method of gene expression data based on multivariate Gaussian mixture models
    Liu, Zhe
    Song, Yu-qing
    Xie, Cong-hua
    Tang, Zheng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (02) : 359 - 368
  • [5] A new clustering method of gene expression data based on multivariate Gaussian mixture models
    Zhe Liu
    Yu-qing Song
    Cong-hua Xie
    Zheng Tang
    Signal, Image and Video Processing, 2016, 10 : 359 - 368
  • [6] VALIDATION OF CLASSIFICATION MODELS AND DATA REDUCTION METHODS BASED ON GENE EXPRESSION DATA
    Rafiee, Mohammad
    Rafiei, Fatemeh
    Tabatabaei, Seyyed Mohammad
    AlaviMajd, Hamid
    Rafiei, Ali
    Khodakarim, Soheila
    JP JOURNAL OF BIOSTATISTICS, 2019, 16 (02) : 79 - 90
  • [7] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [8] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [9] A novel clustering approach based on the manifold structure of gene expression data
    Shi, Jinlong
    Luo, Zhigang
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [10] Clustering gene expression data by mutual information with gene function
    Kaski, S
    Sinkkonen, J
    Nikkilä, J
    ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, 2001, 2130 : 81 - 86