covRNA: discovering covariate associations in large-scale gene expression data

被引:0
|
作者
Urban, Lara [1 ,2 ]
Remmele, Christian W. [1 ]
Dittrich, Marcus [1 ,3 ]
Schwarz, Roland F. [4 ]
Mueller, Tobias [1 ]
机构
[1] Univ Wurzburg, Dept Bioinformat, Bioctr, Wurzburg, Germany
[2] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Cambridge, England
[3] Univ Wurzburg, Inst Human Genet, Wurzburg, Germany
[4] Max Delbruck Ctr, Berlin Inst Med Syst Biol, Berlin, Germany
关键词
Multivariate analysis; Fourthcorner analysis; RLQ analysis; Transcriptomics; High-throughput data; Visualization; Ordination methods; RNA-Seq analysis; Microarray analysis; 4TH-CORNER;
D O I
10.1186/s13104-020-04946-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectiveThe biological interpretation of gene expression measurements is a challenging task. While ordination methods are routinely used to identify clusters of samples or co-expressed genes, these methods do not take sample or gene annotations into account. We aim to provide a tool that allows users of all backgrounds to assess and visualize the intrinsic correlation structure of complex annotated gene expression data and discover the covariates that jointly affect expression patterns.ResultsThe Bioconductor package covRNA provides a convenient and fast interface for testing and visualizing complex relationships between sample and gene covariates mediated by gene expression data in an entirely unsupervised setting. The relationships between sample and gene covariates are tested by statistical permutation tests and visualized by ordination. The methods are inspired by the fourthcorner and RLQ analyses used in ecological research for the analysis of species abundance data, that we modified to make them suitable for the distributional characteristics of both, RNA-Seq read counts and microarray intensities, and to provide a high-performance parallelized implementation for the analysis of large-scale gene expression data on multi-core computational systems. CovRNA provides additional modules for unsupervised gene filtering and plotting functions to ensure a smooth and coherent analysis workflow.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] covRNA: discovering covariate associations in large-scale gene expression data
    Lara Urban
    Christian W. Remmele
    Marcus Dittrich
    Roland F. Schwarz
    Tobias Müller
    [J]. BMC Research Notes, 13
  • [2] Discovering Pictorial Brand Associations from Large-Scale Online Image Data
    Kim, Gunhee
    Xing, Eric P.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 404 - 411
  • [3] Analysis of large-scale gene expression data
    Sherlock, G
    [J]. CURRENT OPINION IN IMMUNOLOGY, 2000, 12 (02) : 201 - 205
  • [4] Interactive visualization of large-scale gene expression data
    Riveiro, Maria
    Lebram, Mikael
    Andersson, Christian X.
    Sartipy, Peter
    Synnergren, Jane
    [J]. PROCEEDINGS 2016 20TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION IV 2016, 2016, : 348 - 354
  • [5] GENE DISCOVERY METHODS FROM LARGE-SCALE GENE EXPRESSION DATA
    Shimizu, Akifumi
    Yano, Kentaro
    [J]. QUANTUM BIO-INFORMATICS III: FROM QUANTUM INFORMATION TO BIO-INFORMATICS, 2010, 26 : 489 - +
  • [6] Discovering Missing Links in Large-Scale Linked Data
    Nam Hau
    Ichise, Ryutaro
    Le, Bac
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 468 - 477
  • [7] Challenges and prospects in the analysis of large-scale gene expression data
    Ihmeis, JH
    Bergmann, S
    [J]. BRIEFINGS IN BIOINFORMATICS, 2004, 5 (04) : 313 - 327
  • [8] Automated Protocol for Large-Scale Modeling of Gene Expression Data
    Hall, Michelle Lynn
    Calkins, David
    Sherman, Woody
    [J]. Journal of Chemical Information and Modeling, 2016, 56 (11) : 2216 - 2224
  • [9] Iterative signature algorithm for the analysis of large-scale gene expression data
    Bergmann, S
    Ihmels, J
    Barkai, N
    [J]. PHYSICAL REVIEW E, 2003, 67 (03):
  • [10] Exploiting Scientific Workflows for Large-scale Gene Expression Data Analysis
    De Stasio, Alessandro
    Ertelt, Marcus
    Kemmner, Wolfgang
    Leser, Ulf
    Ceccarelli, Michele
    [J]. 2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 447 - +