Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data

被引:12
|
作者
Kim, Mi Hyeon [1 ,2 ]
Seo, Hwa Jeong [3 ]
Joung, Je-Gun [1 ,2 ,4 ]
Kim, Ju Han [1 ,2 ]
机构
[1] SNUBI, Syst Biomed Informat Res Ctr, Seoul 110799, South Korea
[2] Seoul Natl Univ, Interdisciplinary Program Med Informat, Div Biomed Informat, Coll Med, Seoul 110799, South Korea
[3] Gachon Univ Med & Sci, Grad Sch Publ Hlth, Inchon 40576, South Korea
[4] Seoul Natl Univ, Inst Endem Dis, Coll Med, Seoul 110799, South Korea
来源
BMC BIOINFORMATICS | 2011年 / 12卷
基金
新加坡国家研究基金会;
关键词
CLASS DISCOVERY; CLASSIFICATION; CANCER; PREDICTION; VALIDATION; ARRAYXPATH; SYSTEM;
D O I
10.1186/1471-2105-12-S13-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patterns from DNA microarray experiments, as it efficiently reduces the dimension of gene expression data. Although several MF methods have been proposed for clustering gene expression patterns, a systematic evaluation has not been reported yet. Results: Here we evaluated the clustering performance of orthogonal and non-orthogonal MFs by a total of nine measurements for performance in four gene expression datasets and one well-known dataset for clustering. Specifically, we employed a non-orthogonal MF algorithm, BSNMF (Bi-directional Sparse Non-negative Matrix Factorization), that applies bi-directional sparseness constraints superimposed on non-negative constraints, comprising a few dominantly co-expressed genes and samples together. Non-orthogonal MFs tended to show better clustering-quality and prediction-accuracy indices than orthogonal MFs as well as a traditional method, K-means. Moreover, BSNMF showed improved performance in these measurements. Non-orthogonal MFs including BSNMF showed also good performance in the functional enrichment test using Gene Ontology terms and biological pathways. Conclusions: In conclusion, the clustering performance of orthogonal and non-orthogonal MFs was appropriately evaluated for clustering microarray data by comprehensive measurements. This study showed that non-orthogonal MFs have better performance than orthogonal MFs and K-means for clustering microarray data.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
    Mi Hyeon Kim
    Hwa Jeong Seo
    Je-Gun Joung
    Ju Han Kim
    [J]. BMC Bioinformatics, 12
  • [2] Differential analysis of DNA microarray gene expression data
    Hatfield, GW
    Hung, SP
    Baldi, P
    [J]. MOLECULAR MICROBIOLOGY, 2003, 47 (04) : 871 - 877
  • [3] Gene expression (microarray) data analysis by chemometric methods
    Zhu, David X.
    Goeke, Richard J.
    Baker, David L.
    Hamburg, James H.
    Booth, David E.
    Booth, Stephane E.
    [J]. CURRENT ANALYTICAL CHEMISTRY, 2007, 3 (03) : 233 - 237
  • [4] A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis
    Statnikov, A
    Aliferis, CF
    Tsamardinos, I
    Hardin, D
    Levy, S
    [J]. BIOINFORMATICS, 2005, 21 (05) : 631 - 643
  • [5] An evaluation of statistical methods for DNA methylation microarray data analysis
    Dongmei Li
    Zidian Xie
    Marc Le Pape
    Timothy Dye
    [J]. BMC Bioinformatics, 16 (1)
  • [6] An evaluation of statistical methods for DNA methylation microarray data analysis
    Li, Dongmei
    Xie, Zidian
    Le Pape, Marc
    Dye, Timothy
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [7] Impact of DNA microarray data transformation on gene expression analysis - comparison of two normalization methods
    Schmidt, Marcin T.
    Handschuh, Luiza
    Zyprych, Joanna
    Szabelska, Alicja
    Olejnik-Schmidt, Agnieszka K.
    Siatkowski, Idzi
    Figlerowicz, Marek
    [J]. ACTA BIOCHIMICA POLONICA, 2011, 58 (04) : 573 - 580
  • [8] A comprehensive evaluation of module detection methods for gene expression data
    Wouter Saelens
    Robrecht Cannoodt
    Yvan Saeys
    [J]. Nature Communications, 9
  • [9] A comprehensive evaluation of module detection methods for gene expression data
    Saelens, Wouter
    Cannoodt, Robrecht
    Saeys, Yvan
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [10] Non-negative Matrix and Tensor Factorization Based Classification of Clinical Microarray Gene Expression Data
    Li, Yifeng
    Ngom, Alioune
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 438 - 443