Content-based microarray search using differential expression profiles

被引:31
|
作者
Engreitz, Jesse M. [2 ]
Morgan, Alexander A. [1 ]
Dudley, Joel T. [1 ,3 ,4 ]
Chen, Rong [3 ,4 ]
Thathoo, Rahul [5 ]
Altman, Russ B. [2 ,3 ,6 ]
Butte, Atul J. [1 ,3 ,4 ,7 ]
机构
[1] Stanford Univ, Sch Med, Biomed Informat Training Program, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Bioengn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Med, Sch Med, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Pediat, Sch Med, Stanford, CA 94305 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[7] Stanford Univ, Lucile Packard Childrens Hosp, Stanford, CA 94305 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
EMBRYONIC STEM-CELLS; INDEPENDENT COMPONENT ANALYSIS; FOXO TRANSCRIPTION FACTORS; GENE-EXPRESSION; LUNG-CANCER; HOMEOSTASIS; DISCOVERY; PLURIPOTENCY; HYPOXIA; NANOG;
D O I
10.1186/1471-2105-11-603
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. Results: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. Conclusions: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Content-based microarray search using differential expression profiles
    Jesse M Engreitz
    Alexander A Morgan
    Joel T Dudley
    Rong Chen
    Rahul Thathoo
    Russ B Altman
    Atul J Butte
    BMC Bioinformatics, 11
  • [2] Content-based search of gene expression databases using binary fingerprints of differential expression profiles
    Bell F.
    Sacan A.
    Network Modeling Analysis in Health Informatics and Bioinformatics, 2015, 4 (01)
  • [3] Content-Based Search on Time-Series Microarray Databases Using Cluster-Based Fingerprints
    Ozkoc, Esma Erguner
    Ogul, Hasan
    CURRENT BIOINFORMATICS, 2017, 12 (05) : 398 - 405
  • [4] Content-based retrieval using heuristic search
    Papadias, D
    Mantzourogiannis, M
    Kalnis, P
    Mamoulis, N
    Ahmad, I
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 168 - +
  • [5] Content-Based Image Retrieval Using Iterative Search
    Zhou, Zhengzhong
    Zhang, Liqing
    NEURAL PROCESSING LETTERS, 2018, 47 (03) : 907 - 919
  • [6] Content-Based Image Retrieval Using Iterative Search
    Zhengzhong Zhou
    Liqing Zhang
    Neural Processing Letters, 2018, 47 : 907 - 919
  • [7] Content-Based Image Retrieval Using Deep Search
    Zhou, Zhengzhong
    Zhang, Liqing
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 627 - 634
  • [8] DNA Sequence Search Using Content-Based Image Search Approach
    Ramampiaro, Heri
    Grande, Aleksander
    5TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS (PACBB 2011), 2011, 93 : 191 - 199
  • [9] Author Profiles Prediction Using Syntactic and Content-Based Features
    Reddy, T. Raghunadha
    Srilatha, M.
    Sreenivas, M.
    Rajasekhar, N.
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 265 - 273
  • [10] 3D content-based search using sketches
    Konstantinos Moustakas
    Georgios Nikolakis
    Dimitrios Tzovaras
    Sebastien Carbini
    Olivier Bernier
    Jean Emmanuel Viallet
    Personal and Ubiquitous Computing, 2009, 13 : 59 - 67