Content-based microarray search using differential expression profiles

被引:31
|
作者
Engreitz, Jesse M. [2 ]
Morgan, Alexander A. [1 ]
Dudley, Joel T. [1 ,3 ,4 ]
Chen, Rong [3 ,4 ]
Thathoo, Rahul [5 ]
Altman, Russ B. [2 ,3 ,6 ]
Butte, Atul J. [1 ,3 ,4 ,7 ]
机构
[1] Stanford Univ, Sch Med, Biomed Informat Training Program, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Bioengn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Med, Sch Med, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Pediat, Sch Med, Stanford, CA 94305 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[7] Stanford Univ, Lucile Packard Childrens Hosp, Stanford, CA 94305 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
EMBRYONIC STEM-CELLS; INDEPENDENT COMPONENT ANALYSIS; FOXO TRANSCRIPTION FACTORS; GENE-EXPRESSION; LUNG-CANCER; HOMEOSTASIS; DISCOVERY; PLURIPOTENCY; HYPOXIA; NANOG;
D O I
10.1186/1471-2105-11-603
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. Results: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. Conclusions: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
    Quenot, Georges
    Tan, Tien Ping
    Bac, Le Viet
    Ayache, Stephane
    Besacier, Laurent
    Mulhem, Philippe
    CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 150 - 155
  • [32] MUSE: A content-based image search and retrieval system using relevance feedback
    Marques, O
    Furht, B
    MULTIMEDIA TOOLS AND APPLICATIONS, 2002, 17 (01) : 21 - 50
  • [33] Efficient content-based image retrieval using deep search and rescue algorithm
    Keisham, Nepoleon
    Neelima, Arambam
    SOFT COMPUTING, 2022, 26 (04) : 1597 - 1616
  • [34] Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
    Georges Quénot
    Tien Ping Tan
    Viet Bac Le
    Stéphane Ayache
    Laurent Besacier
    Philippe Mulhem
    Multimedia Tools and Applications, 2010, 48 : 123 - 140
  • [35] MUSE: A Content-Based Image Search and Retrieval System Using Relevance Feedback
    Oge Marques
    Borko Furht
    Multimedia Tools and Applications, 2002, 17 : 21 - 50
  • [36] Efficient content-based image retrieval using deep search and rescue algorithm
    Nepoleon Keisham
    Arambam Neelima
    Soft Computing, 2022, 26 : 1597 - 1616
  • [37] Query Expansion for Content-Based Similarity Search Using Local and Global Features
    Houle, Michael E.
    Ma, Xiguo
    Oria, Vincent
    Sun, Jichao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2017, 13 (03)
  • [38] Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
    Quenot, Georges
    Tan, Tien Ping
    Le, Viet Bac
    Ayache, Stephane
    Besacier, Laurent
    Mulhem, Philippe
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 48 (01) : 123 - 140
  • [39] Characterization of differential gene expression profiles in diabetic embryopathy using DNA microarray analysis
    Reece, E. Albert
    Ji, Ilwoon
    Wu, Ying-King
    Zhao, Zhiyong
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2006, 195 (04) : 1075 - 1080
  • [40] Characterization of differential gene expression profiles in diabetic embryopathy using cDNA microarray analysis
    Zhao, ZY
    Ji, IW
    Wu, YK
    Reece, EA
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2005, 193 (06) : S95 - S95