The Ensembl computing architecture

被引:12
|
作者
Cuff, JA
Coates, GMP
Cutts, TJR
Rae, M
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] Broad Inst, Cambridge, MA 02141 USA
关键词
D O I
10.1101/gr.1866304
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ensembl is a software project to automatically annotate large eukaryotic genomes and release them freely into the public domain. The project currently automatically annotates 10 complete genomes. This makes very large demands on compute resources, due to the vast number of sequence comparisons that need to be executed. To circumvent the financial outlay often associated with classical supercomputing environments, farms of multiple, lower-cost machines have now become the norm and have been deployed successfully with this project. The architecture and design of farms containing hundreds of compute nodes is complex and nontrivial to implement. This study will define and explain some of the essential elements to consider when designing such systems. Server architecture and network infrastructure are discussed with a particular emphasis on solutions that worked and those that did not (often with fairly spectacular consequences). The aim of the study is to give the reader, who may be implementing a large-scale biocompute project, an insight into some of the pitfalls that may be waiting ahead.
引用
收藏
页码:971 / 975
页数:5
相关论文
共 50 条
  • [41] Accessing Livestock Resources in Ensembl
    Martin, Fergal J.
    Gall, Astrid
    Szpak, Michal
    Flicek, Paul
    FRONTIERS IN GENETICS, 2021, 12
  • [42] Triticeae Resources in Ensembl Plants
    Bolser, Dan M.
    Kerhornou, Arnaud
    Walts, Brandon
    Kersey, Paul
    PLANT AND CELL PHYSIOLOGY, 2015, 56 (01) : e3
  • [43] The ensembl core software libraries
    Stabenau, A
    McVicker, G
    Melsopp, C
    Proctor, G
    Clamp, M
    Birney, E
    GENOME RESEARCH, 2004, 14 (05) : 929 - 933
  • [44] The Ensembl gene annotation system
    Aken, Bronwen L.
    Ayling, Sarah
    Barrell, Daniel
    Clarke, Laura
    Curwen, Valery
    Fairley, Susan
    Banet, Julio Fernandez
    Billis, Konstantinos
    Giron, Carlos Garcia
    Hourlier, Thibaut
    Howe, Kevin
    Kahari, Andreas
    Kokocinski, Felix
    Martin, Fergal J.
    Murphy, Daniel N.
    Nag, Rishi
    Ruffier, Magali
    Schuster, Michael
    Tang, Y. Amy
    Vogel, Jan-Hinnerk
    White, Simon
    Zadissa, Amonida
    Flicek, Paul
    Searle, Stephen M. J.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [45] The Ensembl genome database project
    Hubbard, T
    Barker, D
    Birney, E
    Cameron, G
    Chen, Y
    Clark, L
    Cox, T
    Cuff, J
    Curwen, V
    Down, T
    Durbin, R
    Eyras, E
    Gilbert, J
    Hammond, M
    Huminiecki, L
    Kasprzyk, A
    Lehvaslaiho, H
    Lijnzaad, P
    Melsopp, C
    Mongin, E
    Pettett, R
    Pocock, M
    Potter, S
    Rust, A
    Schmidt, E
    Searle, S
    Slater, G
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Stupka, E
    Ureta-Vidal, A
    Vastrik, I
    Clamp, M
    NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 38 - 41
  • [46] HP scalable computing architecture
    Wright, R
    Kumar, A
    USENIX ASSOCIATION PROCEEDINGS OF THE FIRST WORKSHOP ON INDUSTRIAL EXPERIENCES WITH SYSTEMS SOFTWARE (WIESS 2000), 2000, : 21 - 30
  • [47] Grid computing architecture: A roadmap
    Tuthill, Henri B.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 231
  • [48] The social architecture of community computing
    Batteau, AW
    MAKING UNIVERSAL SERVICE POLICY: ENHANCING THE PROCESS THROUGH MULTIDISCIPLINARY EVALUATION, 1999, : 85 - 98
  • [49] Architecture of a quantum computing platform
    Ismagilov, Marat
    Sayfutdinov, Rustam
    Vasiliev, Alexander
    INTERNATIONAL CONFERENCE ON COMPUTER SIMULATION IN PHYSICS AND BEYOND, 2019, 1163
  • [50] A network architecture for mobile computing
    Brown, K
    Singh, S
    IEEE INFOCOM '96 - FIFTEENTH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND COMMUNICATIONS SOCIETIES: NETWORKING THE NEXT GENERATION, PROCEEDINGS VOLS 1-3, 1996, : 1388 - 1396