Grid workflow software for a high-throughput proteome annotation pipeline

被引:0
|
作者
Birnbaum, A [1 ]
Hayes, J
Li, WW
Miller, MA
Arzberger, PW
Bourne, PE
Casanova, H
机构
[1] San Diego Supercomp Ctr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, La Jolla, CA 92093 USA
来源
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The goal of the Encyclopedia of Life (EOL) Project is to predict structural information for all proteins, in all organisms. This calculation presents challenges both in terms of the scale of the computational resources required (approximately 1.8 million CPU hours), as well as in data and workflow management. While tools are available that solve some subsets of these problems, it was necessary for us to build software to integrate and manage the overall Grid application execution. In this paper, we present this workflow system, detail its components, and report on the performance of our initial prototype implementation for runs over a large-scale Grid platform during the SC'03 conference.
引用
收藏
页码:68 / 81
页数:14
相关论文
共 50 条
  • [1] Grid portal interface for interactive use and monitoring of high-throughput Proteome annotation
    Shahab, A
    Chuon, D
    Suzumua, T
    Li, WW
    Byrnes, RW
    Tanaka, K
    Ang, L
    Matsuoka, S
    Bourne, PE
    Miller, MA
    Arzberger, PW
    [J]. GRID COMPUTING IN LIFE SCIENCE, 2005, 3370 : 53 - 67
  • [2] Simple high-throughput annotation pipeline (SHAP)
    DeMaere, Matthew Z.
    Lauro, Federico M.
    Thomas, Torsten
    Yau, Sheree
    Cavicchioli, Ricardo
    [J]. BIOINFORMATICS, 2011, 27 (17) : 2431 - 2432
  • [3] PIPA: A High-Throughput Pipeline for Protein Function Annotation
    Yu, Chenggang
    Desai, Valmik
    Zavaljevski, Nela
    Reifman, Jaques
    [J]. PROCEEDINGS OF THE HPCMP USERS GROUP CONFERENCE 2008, 2008, : 241 - 246
  • [4] Protein surface analysis for function annotation in high-throughput structural genomics pipeline
    Binkowski, TA
    Joachimiak, A
    Liang, J
    [J]. PROTEIN SCIENCE, 2005, 14 (12) : 2972 - 2981
  • [5] Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline
    Lesley, SA
    Kuhn, P
    Godzik, A
    Deacon, AM
    Mathews, I
    Kreusch, A
    Spraggon, G
    Klock, HE
    McMullan, D
    Shin, T
    Vincent, J
    Robb, A
    Brinen, LS
    Miller, MD
    McPhillips, TM
    Miller, MA
    Scheibe, D
    Canaves, JM
    Guda, C
    Jaroszewski, L
    Selby, TL
    Elsliger, MA
    Wooley, J
    Taylor, SS
    Hodgson, KO
    Wilson, IA
    Schultz, PG
    Stevens, RC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (18) : 11664 - 11669
  • [6] A high-throughput pipeline for validation of antibodies
    Sikorski, Krzysztof
    Mehta, Adi
    Inngjerdingen, Marit
    Thakor, Flourina
    Kling, Simon
    Kalina, Tomas
    Nyman, Tuula A.
    Stensland, Maria Ekman
    Zhou, Wei
    de Souza, Gustavo A.
    Holden, Lars
    Stuchly, Jan
    Templin, Markus
    Lund-Johansen, Fridtjof
    [J]. NATURE METHODS, 2018, 15 (11) : 909 - +
  • [7] iTree: a high-throughput phylogenomic pipeline
    Moustafa, Ahmed
    Bhattacharya, Debashish
    Allen, Andrew E.
    [J]. 2010 5TH CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE (CIBEC 2010), 2010, : 103 - 107
  • [8] A high-throughput pipeline for validation of antibodies
    Krzysztof Sikorski
    Adi Mehta
    Marit Inngjerdingen
    Flourina Thakor
    Simon Kling
    Tomas Kalina
    Tuula A. Nyman
    Maria Ekman Stensland
    Wei Zhou
    Gustavo A. de Souza
    Lars Holden
    Jan Stuchly
    Markus Templin
    Fridtjof Lund-Johansen
    [J]. Nature Methods, 2018, 15 : 909 - 912
  • [9] WGSSAT: A High-Throughput Computational Pipeline for Mining and Annotation of SSR Markers From Whole Genomes
    Pandey, Manmohan
    Kumar, Ravindra
    Srivastava, Prachi
    Agarwal, Suyash
    Srivastava, Shreya
    Nagpure, Naresh S.
    Jena, Joy K.
    Kushwaha, Basdeo
    [J]. JOURNAL OF HEREDITY, 2018, 109 (03) : 339 - 343
  • [10] VISPA2: a scalable pipeline for high-throughput identification and annotation of vector integration sites
    Spinozzi, Giulio
    Calabria, Andrea
    Brasca, Stefano
    Beretta, Stefano
    Merelli, Ivan
    Milanesi, Luciano
    Montini, Eugenio
    [J]. BMC BIOINFORMATICS, 2017, 18