Vector algebra in the analysis of genome-wide expression data

被引:0
|
作者
Kuruvilla, Finny G. [1 ]
Park, Peter J. [2 ]
Schreiber, Stuart L. [1 ]
机构
[1] Harvard Univ, Howard Hughes Med Inst, Bauer Ctr Genom Res, Dept Chem & Chem Biol, Cambridge, MA 02138 USA
[2] Harvard Univ, Sch Med, Dept Biostat,Childrens Hosp, Harvard Sch Publ Hlth,Informat Program, Boston, MA 02115 USA
来源
GENOME BIOLOGY | 2002年 / 3卷 / 03期
关键词
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Data from thousands of transcription-profiling experiments in organisms ranging from yeast to humans are now publicly available. How best to analyze these data remains an important challenge. A variety of tools have been used for this purpose, including hierarchical clustering, self-organizing maps and principal components analysis. In particular, concepts from vector algebra have proven useful in the study of genome-wide expression data. Results: Here we present a framework based on vector algebra for the analysis of transcription profiles that is geometrically intuitive and computationally efficient. Concepts in vector algebra such as angles, magnitudes, subspaces, singular value decomposition, bases and projections have natural and powerful interpretations in the analysis of microarray data. Angles in particular offer a rigorous method of defining 'similarity' and are useful in evaluating the claims of a microarray-based study. We present a sample analysis of cells treated with rapamycin, an immunosuppressant whose effects have been extensively studied with microarrays. In addition, the algebraic concept of a basis for a space affords the opportunity to simplify data analysis and uncover a limited number of expression vectors to span the transcriptional range of cell behavior. Conclusions: This framework represents a compact, powerful and scalable construction for analysis and computation. As the amount of microarray data in the public domain grows, these vector-based methods are relevant in determining statistical significance. These approaches are also well suited to extract biologically meaningful information in the analysis of signaling networks.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Vector algebra in the analysis of genome-wide expression data
    Finny G Kuruvilla
    Peter J Park
    Stuart L Schreiber
    [J]. Genome Biology, 3 (3):
  • [2] Biochemical systems analysis of genome-wide expression data
    Voit, EO
    Radivoyevitch, T
    [J]. BIOINFORMATICS, 2000, 16 (11) : 1023 - 1037
  • [3] Cluster analysis of genome-wide expression data for feature extraction
    Lin, Kuo-Sheng
    Chien, Chen-Fu
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3327 - 3335
  • [4] Convergence of genome-wide expression analysis and genome-wide linkage analysis identifies candidate genes for atherosclerosis
    Hauser, ER
    Gregory, S
    Seo, D
    Dobra, A
    Iversen, E
    Karra, R
    Haynes, C
    Stenger, J
    Xu, H
    Wang, LY
    Huang, LL
    Sketch, M
    Vance, J
    Kraus, WE
    Goldschmidt, P
    [J]. CIRCULATION, 2004, 110 (17) : 823 - 823
  • [5] Bicluster analysis of genome-wide gene expression
    Chen, Kuanchung
    Hu, Yuh-Jyh
    [J]. PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 225 - +
  • [6] Simultaneous analysis of genome-wide SNP data
    Hoggart, C. J.
    De Iorio, M.
    Whittaker, J. C.
    Balding, D. J.
    [J]. GENETIC EPIDEMIOLOGY, 2007, 31 (06) : 609 - 609
  • [7] Reuse of public genome-wide gene expression data
    Johan Rung
    Alvis Brazma
    [J]. Nature Reviews Genetics, 2013, 14 : 89 - 99
  • [8] Reuse of public genome-wide gene expression data
    Rung, Johan
    Brazma, Alvis
    [J]. NATURE REVIEWS GENETICS, 2013, 14 (02) : 89 - 99
  • [9] Semiparametric methods for genome-wide linkage analysis of human gene expression data
    Guoqing Diao
    DY Lin
    [J]. BMC Proceedings, 1 (Suppl 1)
  • [10] An integrated approach for genome-wide gene expression analysis
    Hu, YJ
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2001, 65 (03) : 163 - 174