Permutation-validated principal components analysis of microarray data

被引:0
|
作者
Landgrebe, Jobst [2 ]
Wurst, Wolfgang [2 ,3 ]
Welzl, Gerhard [1 ]
机构
[1] GSF Natl Res Ctr Environm & Hlth, Inst Biomath & Biometry, D-85764 Neuherberg, Germany
[2] Max Planck Inst Psychiat, D-80804 Munich, Germany
[3] GSF Natl Res Ctr Environm & Hlth, Inst Mammalian Genet, D-85764 Neuherberg, Germany
来源
GENOME BIOLOGY | 2002年 / 3卷 / 04期
关键词
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the assessment of the reliability of gene-selection procedures. Here we describe a method to assess reliability in multivariate microarray data analysis using permutation-validated principal components analysis (PCA). The approach is designed for microarray data with a group structure. Results: We used PCA to detect the major sources of variance underlying the hybridization conditions followed by gene selection based on PCA-derived and permutation-based test statistics. We validated our method by applying it to well characterized yeast cell-cycle data and to two datasets from our laboratory. We could describe the major sources of variance, select informative genes and visualize the relationship of genes and arrays. We observed differences in the level of the explained variance and the interpretability of the selected genes. Conclusions: Combining data visualization and permutation-based gene selection, permutation-validated PCA enables one to illustrate gene-expression variance between several conditions and to select genes by taking into account the relationship of between-group to within-group variance of genes. The method can be used to extract the leading sources of variance from microarray data, to visualize relationships between genes and hybridizations and to select informative genes in a statistically reliable manner. This selection accounts for the level of reproducibility of replicates or group structure as well as gene-specific scatter. Visualization of the data can support a straightforward biological interpretation.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Permutation-validated principal components analysis of microarray data
    Jobst Landgrebe
    Wolfgang Wurst
    Gerhard Welzl
    Genome Biology, 3 (4)
  • [2] Interpretation of results of SOM analysis of microarray data by principal components
    Efimov, V. M.
    Badratinov, M. S.
    Katokhin, A. V.
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 44 - +
  • [3] A permutation test motivated by microarray data analysis
    Klebanov, L.
    Gordon, A.
    Xiao, Y.
    Land, H.
    Yakovlev, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (12) : 3619 - 3628
  • [4] Principal components analysis and the reported low intrinsic dimensionality of gene expression microarray data
    Lenz, Michael
    Mueller, Franz-Josef
    Zenke, Martin
    Schuppert, Andreas
    SCIENTIFIC REPORTS, 2016, 6
  • [5] Principal components analysis and the reported low intrinsic dimensionality of gene expression microarray data
    Michael Lenz
    Franz-Josef Müller
    Martin Zenke
    Andreas Schuppert
    Scientific Reports, 6
  • [6] Intelligent system for the analysis of microarray data using principal components and estimation of distribution algorithms
    Cano, C.
    Garcia, F.
    Lopez, F. J.
    Blanco, A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4654 - 4663
  • [7] Penalized Principal Component Analysis of Microarray Data
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 82 - 96
  • [8] ANALYSIS OF PRINCIPAL COMPONENTS OF ORIENTATION DATA
    MARKLAND, J
    INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 1974, 11 (05): : 157 - 163
  • [9] Robust Principal Components for Hyperspectral Data Analysis
    Lucini, Maria M.
    Frery, Alejandro C.
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2009, 5627 : 126 - +
  • [10] PRINCIPAL COMPONENTS ANALYSIS FOR RIGHT CENSORED DATA
    Langworthy, Benjamin W.
    Cai, Jianwen
    Corty, Robert W.
    Kosorok, Michael R.
    Fine, Jason P.
    STATISTICA SINICA, 2023, 33 (03) : 1985 - 2016