The Ensembl computing architecture

被引:12
|
作者
Cuff, JA
Coates, GMP
Cutts, TJR
Rae, M
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] Broad Inst, Cambridge, MA 02141 USA
关键词
D O I
10.1101/gr.1866304
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ensembl is a software project to automatically annotate large eukaryotic genomes and release them freely into the public domain. The project currently automatically annotates 10 complete genomes. This makes very large demands on compute resources, due to the vast number of sequence comparisons that need to be executed. To circumvent the financial outlay often associated with classical supercomputing environments, farms of multiple, lower-cost machines have now become the norm and have been deployed successfully with this project. The architecture and design of farms containing hundreds of compute nodes is complex and nontrivial to implement. This study will define and explain some of the essential elements to consider when designing such systems. Server architecture and network infrastructure are discussed with a particular emphasis on solutions that worked and those that did not (often with fairly spectacular consequences). The aim of the study is to give the reader, who may be implementing a large-scale biocompute project, an insight into some of the pitfalls that may be waiting ahead.
引用
收藏
页码:971 / 975
页数:5
相关论文
共 50 条
  • [31] Cloud computing architecture
    Kim, Won
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2013, 9 (03) : 287 - 303
  • [32] THE IMPACTS OF COMPUTING ON ARCHITECTURE
    STEVENS, G
    BUILDING AND ENVIRONMENT, 1991, 26 (01) : 3 - 11
  • [33] The Ensembl Regulatory Build
    Zerbino, Daniel R.
    Wilder, Steven P.
    Johnson, Nathan
    Juettemann, Thomas
    Flicek, Paul R.
    GENOME BIOLOGY, 2015, 16
  • [34] Ensembl variation resources
    Yuan Chen
    Fiona Cunningham
    Daniel Rios
    William M McLaren
    James Smith
    Bethan Pritchard
    Giulietta M Spudich
    Simon Brent
    Eugene Kulesha
    Pablo Marin-Garcia
    Damian Smedley
    Ewan Birney
    Paul Flicek
    BMC Genomics, 11
  • [35] The Ensembl Regulatory Build
    Daniel R Zerbino
    Steven P Wilder
    Nathan Johnson
    Thomas Juettemann
    Paul R Flicek
    Genome Biology, 16
  • [36] The Ensembl Variant Effect Predictor
    McLaren, William
    Gil, Laurent
    Hunt, Sarah E.
    Riat, Harpreet Singh
    Ritchie, Graham R. S.
    Thormann, Anja
    Flicek, Paul
    Cunningham, Fiona
    GENOME BIOLOGY, 2016, 17
  • [37] Ensembl gets a Wellcome boost
    Butler, D
    NATURE, 2000, 406 (6794) : 333 - 333
  • [38] Genome information resources - developments at Ensembl
    Hammond, MP
    Birney, E
    TRENDS IN GENETICS, 2004, 20 (06) : 268 - 272
  • [39] The Ensembl Variant Effect Predictor
    William McLaren
    Laurent Gil
    Sarah E. Hunt
    Harpreet Singh Riat
    Graham R. S. Ritchie
    Anja Thormann
    Paul Flicek
    Fiona Cunningham
    Genome Biology, 17
  • [40] Accessing Livestock Resources in Ensembl
    Martin, Fergal J.
    Gall, Astrid
    Szpak, Michal
    Flicek, Paul
    FRONTIERS IN GENETICS, 2021, 12