EVE: Cloud-Based Annotation of Human Genetic Variants

被引:1
|
作者
Cole, Brian S. [1 ]
Moore, Jason H. [1 ]
机构
[1] Univ Penn, Inst Biomed Informat, Perelman Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
Annotation; GWAS; Cloud computing; Reproducibility; Infrastructure-as-Code; GENOME-WIDE ASSOCIATION; SCIENCE;
D O I
10.1007/978-3-319-55849-3_6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Annotation of human genetic variants enables genotype-phenotype association studies at the gene, pathway, and tissue level. Annotation results are difficult to reproduce across study sites due to shifting software versions and a lack of a unified hardware interface between study sites. Cloud computing offers a promising solution by integrating hardware and software into reproducible virtual appliances which may be utilized on-demand and shared across institutions. We developed ENSEMBL VEP on EC2 (EVE), a cloud-based virtual appliance for annotation of human genetic variants built around the ENSEMBL Variant Effect Predictor. We integrated virtual hardware infrastructure, open-source software, and publicly available genomic datasets to provide annotation capability for genetic variants in the context of genes/transcripts, Gene Ontology pathways, tissue-specific expression from the Gene Expression Atlas, miRNA annotations, minor allele frequencies from the 1000 Genomes Project and the Exome Aggregation Consortium, and deleteriousness scores from Combined Annotation Dependent Depletion. We demonstrate the utility of EVE by annotating the genetic variants in a case-control study of glaucoma. Cloud computing can reduce the difficulty of replicating complex software pipelines such as annotation pipelines across study sites. We provide a publicly available CloudFormation template of the EVE virtual appliance which can automatically provision and deploy a parameterized, preconfigured hardware/software stack ready for annotation of human genetic variants (github. com/epistasislab/EVE). This approach offers increased reproducibility in human genetic studies by providing a unified appliance to researchers across the world.
引用
收藏
页码:83 / 95
页数:13
相关论文
共 50 条
  • [1] Toward Cloud-based Classification and Annotation Support
    Swoboda, Tobias
    Kaufmann, Michael
    Hemmje, Matthias L.
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, VOL 2 (CLOSER), 2016, : 131 - 137
  • [2] A cloud-based genetic database for characterisation of missense variants associated with the acute hepatic porphyrias
    Mac Aogain, M.
    Lawlor, B.
    Savage, S.
    Walsh, E.
    Brazil, N.
    Rasheed, E.
    Cronin, T.
    Crowley, V. E. F.
    [J]. IRISH JOURNAL OF MEDICAL SCIENCE, 2022, 191 (SUPPL 4) : 126 - 126
  • [3] CGtag: complete genomics toolkit and annotation in a cloud-based Galaxy
    Hiltemann, Saskia
    Mei, Hailiang
    de Hollander, Mattias
    Palli, Ivo
    van der Spek, Peter
    Jenster, Guido
    Stubbs, Andrew
    [J]. GIGASCIENCE, 2014, 3
  • [4] SeqAnt: Cloud-Based Whole-Genome Annotation and Search
    Kotlar, Alex V.
    Trevino, Cristina E.
    Zwick, Michael E.
    Cutler, David J.
    Wingo, Thomas S.
    [J]. ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 621 - 621
  • [5] A cloud-based toolbox for the versatile environmental annotation of biodiversity data
    Li, Richard
    Ranipeta, Ajay
    Wilshire, John
    Malczyk, Jeremy
    Duong, Michelle
    Guralnick, Robert
    Wilson, Adam
    Jetz, Walter
    [J]. PLOS BIOLOGY, 2021, 19 (11)
  • [6] Variobox: Automatic Detection and Annotation of Human Genetic Variants
    Gaspar, Paulo
    Lopes, Pedro
    Oliveira, Jorge
    Santos, Rosario
    Dalgleish, Raymond
    Oliveira, Jose Luis
    [J]. HUMAN MUTATION, 2014, 35 (02) : 202 - 207
  • [7] A CLOUD-BASED HUMAN BIOSPECIMEN MARKETPLACE
    [J]. Lab. Manager., 8 (68-69):
  • [8] Cloud-based interactive analytics for terabytes of genomic variants data
    Pan, Cuiping
    McInnes, Gregory
    Deflaux, Nicole
    Snyder, Michael
    Bingham, Jonathan
    Datta, Somalee
    Tsao, Philip S.
    [J]. BIOINFORMATICS, 2017, 33 (23) : 3709 - 3715
  • [9] Functional Annotation and Analysis of Genetic Variants
    Nasr, Emad S. Abouel
    Al-Mubaid, H.
    [J]. IEEE ACCESS, 2023, 11 : 32659 - 32670
  • [10] Cloud-Based BEMS
    Ponoum, Ratcharit
    Cooperman, Alissa
    Brodrick, James
    [J]. ASHRAE JOURNAL, 2012, 54 (11) : 68 - 70