Reuse of public genome-wide gene expression data

被引:0
|
作者
Johan Rung
Alvis Brazma
机构
[1] EMBL–EBI,
[2] Wellcome Trust Genome Campus,undefined
来源
Nature Reviews Genetics | 2013年 / 14卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Over the past decade, high-throughput gene expression experiments have generated data from millions of assays. Data sets linked to publications are stored in functional genomics data archives: ArrayExpress at the European Bioinformatics Institute, Gene Expression Omnibus at the US National Center for Biotechnology Information and at the DNA Databank of Japan Omics Archive.Secondary added-value and topical databases process data from the primary archives, adding analysis and annotation to make these data accessible to every biologist by allowing queries such as 'in which tissue is a particular gene expressed?' or 'which genes are differentially expressed between a particular disease and normal samples?'Public gene expression data are commonly reused to study biological questions, both by reanalysis of primary data and by queries to secondary resources. Approximately half of the studies that use public gene expression data rely solely on existing data without adding newly generated data, and half of them use the public data in combination with new data.The reproducibility of published microarray-based studies is limited, mostly owing to insufficient experiment annotation and sometimes to unavailability of the raw or processed data. A stricter enforcement of Minimum Information About a Microarray Experiment (MIAME) requirements and also development of easy-to-use experiment annotation tools are needed to achieve a better reproducibility.Although most of the public gene expression data still are based on microarray experiments, the contribution of high-throughput-sequencing-based expression studies, known as RNA sequencing (RNA-seq), are growing rapidly.Reuse of RNA-seq data can potentially be even more valuable than reuse of microarray data, partly owing to the costs of experiments and data storage but even more importantly because of a more quantitative nature of sequencing-based expression data. Community standards such as Minimum Information about Sequencing Experiments (MINSEQE) should be adopted to make RNA-seq data maximally reusable.The bioinformatics resources that store and manage public data are sensitive to short-term funding changes, complicating the maintenance of important databases. The development of long-term infrastructure in bioinformatics, such as the ELIXIR project in Europe, is needed to ensure the long term availability of public data.
引用
收藏
页码:89 / 99
页数:10
相关论文
共 50 条
  • [1] Reuse of public genome-wide gene expression data
    Rung, Johan
    Brazma, Alvis
    [J]. NATURE REVIEWS GENETICS, 2013, 14 (02) : 89 - 99
  • [2] Reuse of public, genome-wide, murine eosinophil expression data for hypotheses development
    Grace, Jillian O.
    Malik, Astha
    Reichman, Hadar
    Munitz, Ariel
    Barski, Artem
    Fulkerson, Patricia C.
    [J]. JOURNAL OF LEUKOCYTE BIOLOGY, 2018, 104 (01) : 185 - 193
  • [3] Genome-wide in silico prediction of gene expression
    McLeay, Robert C.
    Lesluyes, Tom
    Partida, Gabriel Cuellar
    Bailey, Timothy L.
    [J]. BIOINFORMATICS, 2012, 28 (21) : 2789 - 2796
  • [4] GENOME-WIDE GENE EXPRESSION SIGNATURE OF DEPRESSION
    Ciobanu, Liliana
    Sachdev, Perminder S.
    Trollor, Julian N.
    Reppermund, Simone
    Thalamuthu, Anbupalam
    Mather, Karen A.
    Cohen-Woods, Sarah
    Stacey, David
    Toben, Catherine
    Baune, Bernhard
    [J]. EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2017, 27 : S484 - S484
  • [5] Genome-wide patterns of gene expression in cancer
    Botstein, D
    Brown, PO
    [J]. MOLECULAR BIOLOGY OF THE CELL, 1999, 10 : 1A - 1A
  • [6] GENOME-WIDE GENE EXPRESSION IN IGA NEPHROPATHY
    Lau, Y. K.
    Woo, K. T.
    Zhao, Y.
    Puong, K. Y.
    Aw, S. E.
    Wong, K. S.
    [J]. NEPHROLOGY, 2005, 10 : A11 - A12
  • [7] Genome-wide survey of gene expression in vitiligo
    Ogbogu, P
    Pan, H
    Xiang, J
    Sinha, A
    [J]. JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2002, 119 (01) : 265 - 265
  • [8] Bicluster analysis of genome-wide gene expression
    Chen, Kuanchung
    Hu, Yuh-Jyh
    [J]. PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 225 - +
  • [9] Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data
    Liu, Zhi-Ping
    [J]. CURRENT GENOMICS, 2015, 16 (01) : 3 - 22
  • [10] Semiparametric methods for genome-wide linkage analysis of human gene expression data
    Guoqing Diao
    DY Lin
    [J]. BMC Proceedings, 1 (Suppl 1)