Improving performance via computational replication on a large-scale computational grid

被引:15
|
作者
Li, YH [1 ]
Mascagni, M [1 ]
机构
[1] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
关键词
D O I
10.1109/CCGRID.2003.1199399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High performance computing on a large-scale computational grid is complicated by the heterogeneous computational capabilities of each node, node unavailability, and unreliable network connectivity. Replicating computation on multiple nodes can significantly improve performance by reducing task completion time on a grid's dynamic environment. We develop an analytical model to determine the number of task replicas to meet the performance goals in different computational grid configurations. Furthermore, taking advantage of the statistical nature of grid-based Monte Carlo applications, we extend the computational replication technique to an N-out-of-M scheduling strategy for grid-based Monte Carlo applications, which can potentially form a large category of grid-computing applications. In addition, we establish a corresponding model for the N-out-of-M scheduling mechanism. Simulations are used to validate the computational replication models. Our preliminary results show that the models we use are effective in predicting the required number of replicas to achieve short task completion time with a given high probability.
引用
收藏
页码:442 / 448
页数:7
相关论文
共 50 条
  • [21] Improving Material Property Prediction by Leveraging the Large-Scale Computational Database and Deep Learning
    Chen, Pin
    Chen, Jianwen
    Yan, Hui
    Mo, Qing
    Xu, Zexin
    Liu, Jinyu
    Zhang, Wenqing
    Yang, Yuedong
    Lu, Yutong
    [J]. JOURNAL OF PHYSICAL CHEMISTRY C, 2022, 126 (38): : 16297 - 16305
  • [22] Resource-aware distributed scheduling strategies for large-scale computational Cluster/Grid systems
    Viswanathan, Sivakumar
    Veeravalli, Bharadwaj
    Robertazzi, Thomas G.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2007, 18 (10) : 1450 - 1461
  • [23] Application-aware deadline constraint job scheduling mechanism on large-scale computational grid
    Tang, Xiaoyong
    Liao, Xiaoyi
    [J]. PLOS ONE, 2018, 13 (11):
  • [24] Improving the computational efficiency of rotating sound source localization via compression computational grid method
    Wang, Jiayu
    Zhang, Ce
    Ma, Wei
    [J]. Hangkong Dongli Xuebao/Journal of Aerospace Power, 2021, 36 (01): : 176 - 184
  • [25] Large-Scale Computational Modeling of Genetic Regulatory Networks
    M. Stetter
    G. Deco
    M. Dejori
    [J]. Artificial Intelligence Review, 2003, 20 : 75 - 93
  • [26] Parallelization of a large-scale computational earthquake simulation program
    Tiampo, KF
    Rundle, JB
    Hopper, P
    Martins, JS
    Gross, S
    McGinnis, S
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2002, 14 (6-7): : 531 - 550
  • [27] Challenges in Large-Scale Computational Mass Spectrometry and Multiomics
    Kohlbacher, Oliver
    Vitek, Olga
    [J]. JOURNAL OF PROTEOME RESEARCH, 2016, 15 (03) : 681 - 682
  • [28] Challenges in the automatic parallelization of large-scale computational applications
    Armstrong, B
    Eigenmann, R
    [J]. COMMERCIAL APPLICATIONS FOR HIGH-PERFORMANCE COMPUTING, 2001, 4528 : 50 - 60
  • [29] A method of substructuring large-scale computational micromechanical problems
    Zohdi, TI
    Wriggers, P
    Huet, C
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2001, 190 (43-44) : 5639 - 5656
  • [30] Advances in computational methods for large-scale structural optimization
    Papadrakakis, M
    Lagaros, ND
    [J]. COMPUTATIONAL MECHANICS FOR THE TWENTY-FIRST CENTURY, 2000, : 431 - 449