Independent component analysis of Alzheimer's DNA microarray gene expression data

被引:61
|
作者
Kong, Wei [1 ,2 ,3 ,4 ]
Mou, Xiaoyang [1 ,2 ,3 ]
Liu, Qingzhong [5 ,6 ]
Chen, Zhongxue [7 ]
Vanderburg, Charles R. [8 ,9 ,10 ]
Rogers, Jack T. [10 ,11 ]
Huang, Xudong [1 ,2 ,3 ,10 ,11 ]
机构
[1] Brigham & Womens Hosp, Biomed Informat & Cheminformat Grp, Conjugate & Med Chem Lab, Div Nucl Med & Mol Imaging, Boston, MA 02115 USA
[2] Brigham & Womens Hosp, Dept Radiol, Ctr Adv Med Imaging, Boston, MA 02115 USA
[3] Harvard Univ, Sch Med, Boston, MA 02115 USA
[4] Shanghai Maritime Univ, Informat Engn Coll, Shanghai 200135, Peoples R China
[5] New Mexico Inst Min & Technol, Dept Comp Sci, Socorro, NM 87801 USA
[6] New Mexico Inst Min & Technol, Inst Complex Addit Syst Anal, Socorro, NM 87801 USA
[7] Childrens Hosp Philadelphia, Joseph Stokes Jr Res Inst, Philadelphia, PA 19104 USA
[8] Massachusetts Gen Hosp, Dept Neurol, Charlestown, MA 02129 USA
[9] Massachusetts Gen Hosp, Harvard NeuroDiscovery Ctr, Adv Tissue Resource Ctr, Charlestown, MA 02129 USA
[10] Harvard Univ, Sch Med, Charlestown, MA 02129 USA
[11] Massachusetts Gen Hosp, Dept Psychiat, Neurochem Lab, Charlestown, MA 02129 USA
基金
中国国家自然科学基金;
关键词
NERVOUS-SYSTEM; RHO GTPASES; KINASE-II; DISEASE; ACTIN; PROFILES; MODEL; RAC;
D O I
10.1186/1750-1326-4-5
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Background: Gene microarray technology is an effective tool to investigate the simultaneous activity of multiple cellular pathways from hundreds to thousands of genes. However, because data in the colossal amounts generated by DNA microarray technology are usually complex, noisy, high-dimensional, and often hindered by low statistical power, their exploitation is difficult. To overcome these problems, two kinds of unsupervised analysis methods for microarray data: principal component analysis (PCA) and independent component analysis (ICA) have been developed to accomplish the task. PCA projects the data into a new space spanned by the principal components that are mutually orthonormal to each other. The constraint of mutual orthogonality and second-order statistics technique within PCA algorithms, however, may not be applied to the biological systems studied. Extracting and characterizing the most informative features of the biological signals, however, require higher-order statistics. Results: ICA is one of the unsupervised algorithms that can extract higher-order statistical structures from data and has been applied to DNA microarray gene expression data analysis. We performed FastICA method on DNA microarray gene expression data from Alzheimer's disease (AD) hippocampal tissue samples and consequential gene clustering. Experimental results showed that the ICA method can improve the clustering results of AD samples and identify significant genes. More than 50 significant genes with high expression levels in severe AD were extracted, representing immunity-related protein, metal-related protein, membrane protein, lipoprotein, neuropeptide, cytoskeleton protein, cellular binding protein, and ribosomal protein. Within the aforementioned categories, our method also found 37 significant genes with low expression levels. Moreover, it is worth noting that some oncogenes and phosphorylation- related proteins are expressed in low levels. In comparison to the PCA and support vector machine recursive feature elimination (SVM-RFE) methods, which are widely used in microarray data analysis, ICA can identify more AD-related genes. Furthermore, we have validated and identified many genes that are associated with AD pathogenesis. Conclusion: We demonstrated that ICA exploits higher-order statistics to identify gene expression profiles as linear combinations of elementary expression patterns that lead to the construction of potential AD-related pathogenic pathways. Our computing results also validated that the ICA model outperformed PCA and the SVM-RFE method. This report shows that ICA as a microarray data analysis tool can help us to elucidate the molecular taxonomy of AD and other multifactorial and polygenic complex diseases.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Independent component analysis of Alzheimer's DNA microarray gene expression data
    Wei Kong
    Xiaoyang Mou
    Qingzhong Liu
    Zhongxue Chen
    Charles R Vanderburg
    Jack T Rogers
    Xudong Huang
    [J]. Molecular Neurodegeneration, 4
  • [2] Study DNA microarray gene expression data of Alzheimer's disease by independent component analysis
    Kong, Wei
    Mou, Xiaoyang
    Bin Yang
    [J]. 2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 44 - +
  • [3] A review of independent component analysis application to microarray gene expression data
    Kong, Wei
    Vanderburg, Charles R.
    Gunshin, Hiromi
    Rogers, Jack T.
    Huang, Xudong
    [J]. BIOTECHNIQUES, 2008, 45 (05) : 501 - +
  • [4] Independent component analysis: Mining microarray data for fundamental human gene expression modules
    Engreitz, Jesse M.
    Daigle, Bernie J., Jr.
    Marshall, Jonathan J.
    Altman, Russ B.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (06) : 932 - 944
  • [5] Studying DNA microarray data using independent component analysis
    Berger, JA
    Mitra, SK
    Edgren, H
    [J]. ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 747 - 750
  • [6] Differential analysis of DNA microarray gene expression data
    Hatfield, GW
    Hung, SP
    Baldi, P
    [J]. MOLECULAR MICROBIOLOGY, 2003, 47 (04) : 871 - 877
  • [7] A novel dimensionality reduction technique based on independent component analysis for modeling microarray gene expression data
    Liu, H
    Kustra, R
    Zhang, J
    [J]. IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 1133 - 1139
  • [8] Application of independent component analysis to microarray data
    Suri, RE
    [J]. INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'03: MODELING, EXPLORATION, AND ENGINEERING, 2003, : 375 - 378
  • [9] An Ensemble Classifier Based on Selective Independent Component Analysis of DNA Microarray Data
    Wang Xuesong
    Gu Yangyang
    Cheng Yuhu
    Zhang Zheng
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (04) : 645 - 649
  • [10] Independent component analysis algorithms for microarray data analysis
    Malutan, Raul
    Gomez Vilda, Pedro
    Borda, Monica
    [J]. INTELLIGENT DATA ANALYSIS, 2010, 14 (02) : 193 - 206