Content-based microarray search using differential expression profiles

被引:31
|
作者
Engreitz, Jesse M. [2 ]
Morgan, Alexander A. [1 ]
Dudley, Joel T. [1 ,3 ,4 ]
Chen, Rong [3 ,4 ]
Thathoo, Rahul [5 ]
Altman, Russ B. [2 ,3 ,6 ]
Butte, Atul J. [1 ,3 ,4 ,7 ]
机构
[1] Stanford Univ, Sch Med, Biomed Informat Training Program, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Bioengn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Med, Sch Med, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Pediat, Sch Med, Stanford, CA 94305 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
[7] Stanford Univ, Lucile Packard Childrens Hosp, Stanford, CA 94305 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
EMBRYONIC STEM-CELLS; INDEPENDENT COMPONENT ANALYSIS; FOXO TRANSCRIPTION FACTORS; GENE-EXPRESSION; LUNG-CANCER; HOMEOSTASIS; DISCOVERY; PLURIPOTENCY; HYPOXIA; NANOG;
D O I
10.1186/1471-2105-11-603
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. Results: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. Conclusions: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] System profiles in content-based image indexing and retrieval
    Esin Guldogan
    Moncef Gabbouj
    Signal, Image and Video Processing, 2010, 4 : 463 - 480
  • [42] System profiles in content-based image indexing and retrieval
    Guldogan, Esin
    Gabbouj, Moncef
    SIGNAL IMAGE AND VIDEO PROCESSING, 2010, 4 (04) : 463 - 480
  • [43] A New Content-Based Search Mechanism for Image Retrieval Search Engine
    Jasmine, K. S.
    Raj, Rishav
    Naik, Mahalakshmi Mabla
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [44] Content-based search and annotations in multimedia digital libraries
    Arias, JA
    Sánchez, JA
    PROCEEDINGS OF THE FOURTH MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2003), 2003, : 109 - 116
  • [45] Content-Based Sensor Search with a Matching Estimation Mechanism
    Zhang, Puning
    Liu, Yuan-an
    Wu, Fan
    Fan, Wenhao
    Tang, Bihua
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2016, E99B (09) : 1949 - 1957
  • [46] Relevance feedback and learning in content-based image search
    Zhang, HJ
    Chen, Z
    Li, MJ
    Su, Z
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (02): : 131 - 155
  • [47] A Framework for Content-Based Search in Large Music Collections
    Zhu, Tiange
    Fournier-S'niehotta, Raphael
    Rigaux, Philippe
    Travers, Nicolas
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (01)
  • [48] Content-based music similarity search and emotion detection
    Li, T
    Ogihara, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 705 - 708
  • [49] Exploiting Correlations for Efficient Content-based Sensor Search
    Mietz, Richard
    Roemer, Kay
    2011 IEEE SENSORS, 2011, : 187 - 190
  • [50] Content-Based Element Search for Presentation Slide Reuse
    Zhang, Jie
    Xiao, Chuan
    Watanabe, Toyohide
    Ishikawa, Yoshiharu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (10): : 2685 - 2696