A graphical model method for integrating multiple sources of genome-scale data

被引:5
|
作者
Dvorkin, Daniel [1 ]
Biehs, Brian [2 ,3 ]
Kechris, Katerina [1 ,4 ]
机构
[1] Univ Colorado, Sch Med, Computat Biosci Program, Aurora, CO 80045 USA
[2] Univ Calif San Francisco, Cardiovasc Res Inst, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Dept Biochem & Biophys, San Francisco, CA 94143 USA
[4] Colorado Sch Publ Hlth, Dept Biostat & Informat, Aurora, CO 80045 USA
关键词
data integration; genomics; graphical models; mixture models; GENE-EXPRESSION DATA; MIXTURE-MODELS; DNA-BINDING; CHIP-CHIP; DISCOVERY; IDENTIFICATION; TRANSCRIPTION; TARGETS; SAMPLES; DORSAL;
D O I
10.1515/sagmb-2012-0051
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Making effective use of multiple data sources is a major challenge in modern bioinformatics. Genome-wide data such as measures of transcription factor binding, gene expression, and sequence conservation, which are used to identify binding regions and genes that are important to major biological processes such as development and disease, can be difficult to use together due to the different biological meanings and statistical distributions of the heterogeneous data types, but each can provide valuable information for understanding the processes under study. Here we present methods for integrating multiple data sources to gain a more complete picture of gene regulation and expression. Our goal is to identify genes and cis-regulatory regions which play specific biological roles. We describe a graphical mixture model approach for data integration, examine the effect of using different model topologies, and discuss methods for evaluating the effectiveness of the models. Model fitting is computationally efficient and produces results which have clear biological and statistical interpretations. The Hedgehog and Dorsal signaling pathways in Drosophila, which are critical in embryonic development, are used as examples.
引用
收藏
页码:469 / 487
页数:19
相关论文
共 50 条
  • [31] BiGG Models: A platform for integrating, standardizing and sharing genome-scale models
    King, Zachary A.
    Lu, Justin
    Draeger, Andreas
    Miller, Philip
    Federowicz, Stephen
    Lerman, Joshua A.
    Ebrahim, Ali
    Palsson, Bernhard O.
    Lewis, Nathan E.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D515 - D522
  • [32] Integrating multiple data sources and multi-scale land-cover data to model the distribution of a declining amphibian
    Rose, Jonathan P.
    Halstead, Brian J.
    Fisher, Robert N.
    BIOLOGICAL CONSERVATION, 2020, 241
  • [33] Inferring Balancing Selection From Genome-Scale Data
    Bitarello, Barbara D.
    Brandt, Debora Y. C.
    Meyer, Diogo
    Andres, Aida M.
    GENOME BIOLOGY AND EVOLUTION, 2023, 15 (03):
  • [34] Estimating phylogenetic trees from genome-scale data
    Liu, Liang
    Xi, Zhenxiang
    Wu, Shaoyuan
    Davis, Charles C.
    Edwards, Scott V.
    YEAR IN EVOLUTIONARY BIOLOGY, 2015, 1360 : 36 - 53
  • [35] Integration and analysis of genome-scale data from gliomas
    Gregory Riddick
    Howard A. Fine
    Nature Reviews Neurology, 2011, 7 : 439 - 450
  • [36] Integration and analysis of genome-scale data from gliomas
    Riddick, Gregory
    Fine, Howard A.
    NATURE REVIEWS NEUROLOGY, 2011, 7 (08) : 439 - 450
  • [37] Genome-Scale Data Call for a Taxonomic Rearrangement of Geodermatophilaceae
    Montero-Calasanz, Maria del Carmen
    Meier-Kolthoff, Jan P.
    Zhang, Dao-Feng
    Yaramis, Adnan
    Rohde, Manfred
    Woyke, Tanja
    Kyrpides, Nikos C.
    Schumann, Peter
    Li, Wen-Jun
    Goeker, Markus
    FRONTIERS IN MICROBIOLOGY, 2017, 8
  • [38] Analysis of omics data with genome-scale models of metabolism
    Hyduke, Daniel R.
    Lewis, Nathan E.
    Palsson, Bernhard O.
    MOLECULAR BIOSYSTEMS, 2013, 9 (02) : 167 - 174
  • [39] High throughput barcoding method for genome-scale phasing
    Redin, David
    Frick, Tobias
    Aghelpasand, Hooman
    Kaller, Max
    Borgstrom, Erik
    Olsen, Remi-Andre
    Ahmadian, Afshin
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [40] Reframed Genome-Scale Metabolic Model to Facilitate Genetic Design and Integration with Expression Data
    Gu, Deqing
    Jian, Xingxing
    Zhang, Cheng
    Hua, Qiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (06) : 1410 - 1418