Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud

被引:25
|
作者
Mrozek, Dariusz [1 ]
Gosk, Pawel [1 ]
Malysiak-Mrozek, Bozena [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, Akad 16, PL-44100 Gliwice, Poland
关键词
Bioinformatics; Proteins; 3D protein structure; Protein structure prediction; Tertiary structure prediction; Ab initio; Protein structure modeling; Cloud computing; Distributed computing; Scalability; Microsoft Azure; SECONDARY STRUCTURE; SIMULATION; WEB; CHALLENGES; ALGORITHM; FRAMEWORK; ACIDS;
D O I
10.1007/s10723-015-9353-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computational methods for protein structure prediction allow us to determine a three-dimensional structure of a protein based on its pure amino acid sequence. These methods are a very important alternative to costly and slow experimental methods, like X-ray crystallography or Nuclear Magnetic Resonance. However, conventional calculations of protein structure are time-consuming and require ample computational resources, especially when carried out with the use of ab initio methods that rely on physical forces and interactions between atoms in a protein. Fortunately, at the present stage of the development of computer science, such huge computational resources are available from public cloud providers on a pay-as-you-go basis. We have designed and developed a scalable and extensible system, called Cloud4PSP, which enables predictions of 3D protein structures in the Microsoft Azure commercial cloud. The system makes use of the Warecki-Znamirowski method as a sample procedure for protein structure prediction, and this prediction method was used to test the scalability of the system. The results of the efficiency tests performed proved good acceleration of predictions when scaling the system vertically and horizontally. In the paper, we show the system architecture that allowed us to achieve such good results, the Cloud4PSP processing model, and the results of the scalability tests. At the end of the paper, we try to answer which of the scaling techniques, scaling out or scaling up, is better for solving such computational problems with the use of Cloud computing.
引用
收藏
页码:561 / 585
页数:25
相关论文
共 50 条
  • [1] Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud
    Dariusz Mrozek
    Paweł Gosk
    Bożena Małysiak-Mrozek
    Journal of Grid Computing, 2015, 13 : 561 - 585
  • [2] Efficient 3D Protein Structure Alignment on Large Hadoop Clusters in Microsoft Azure Cloud
    Malysiak-Mrozek, Bozena
    Danilowicz, Pawel
    Mrozek, Dariusz
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: FACING THE CHALLENGES OF DATA PROLIFERATION AND GROWING VARIETY, 2018, 928 : 33 - 46
  • [3] Accelerating 3D Protein Structure Similarity Searching on Microsoft Azure Cloud with Local Replicas of Macromolecular Data
    Mrozek, Dariusz
    Kutyla, Tomasz
    Malysiak-Mrozek, Bozena
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 254 - 265
  • [4] HDInsight4PSi: Boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud
    Mrozek, Dariusz
    Danilowicz, Pawel
    Malysiak-Mrozek, Bozena
    INFORMATION SCIENCES, 2016, 349 : 77 - 101
  • [5] Scaling laws of graphs of 3D protein structures
    Praznikar, Jure
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2021, 19 (03)
  • [6] Database Under Pressure - Scaling Database Performance Tests in Microsoft Azure Public Cloud
    Mrozek, Dariusz
    Paliga, Anna
    Malysiak-Mrozek, Bozena
    Kozielski, Stanislaw
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2015, 2015, 521 : 69 - 81
  • [7] CryoDRGN2: Ab initio neural reconstruction of 3D protein structures from real cryo-EM images
    Zhong, Ellen D.
    Lerer, Adam
    Davis, Joseph H.
    Berger, Bonnie
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4046 - 4055
  • [8] Ab Initio Quantum Chemistry for Protein Structures
    Kulik, Heather J.
    Luehr, Nathan
    Ufimtsev, Ivan S.
    Martinez, Todd J.
    JOURNAL OF PHYSICAL CHEMISTRY B, 2012, 116 (41): : 12501 - 12509
  • [9] Ab initio prediction of protein loop structures
    Lin, Xiwen
    Peng, Zhihong
    Chen, Jie
    MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (10) : S255 - S255
  • [10] Ab initio predictions for 3D structure and stability of single- and double-stranded DNAs in ion solutions
    Mu, Zi-Chun
    Tan, Ya-Lan
    Zhang, Ben-Gong
    Liu, Jie
    Shi, Ya-Zhou
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (10)