A Framework to Analyze the Performance of Load Balancing Schemes for Ensembles of Stochastic Simulations

被引:0
|
作者
Tae-Hyuk Ahn
Adrian Sandu
Layne T. Watson
Clifford A. Shaffer
Yang Cao
William T. Baumann
机构
[1] Oak Ridge National Laboratory,Computer Science and Mathematics Division
[2] Virginia Polytechnic Institute and State University,Department of Computer Science
[3] Virginia Polytechnic Institute and State University,Departments of Computer Science, Mathematics, and Aerospace and Ocean Engineering
[4] Virginia Polytechnic Institute and State University,Department of Electrical and Computer Engineering
关键词
Dynamic load balancing (DLB); Probabilistic framework analysis; Ensemble simulations; Stochastic simulation algorithm (SSA); High-performance computing (HPC); Budding yeast cell cycle;
D O I
暂无
中图分类号
学科分类号
摘要
Ensembles of simulations are employed to estimate the statistics of possible future states of a system, and are widely used in important applications such as climate change and biological modeling. Ensembles of runs can naturally be executed in parallel. However, when the CPU times of individual simulations vary considerably, a simple strategy of assigning an equal number of tasks per processor can lead to serious work imbalances and low parallel efficiency. This paper presents a new probabilistic framework to analyze the performance of dynamic load balancing algorithms for ensembles of simulations where many tasks are mapped onto each processor, and where the individual compute times vary considerably among tasks. Four load balancing strategies are discussed: most-dividing, all-redistribution, random-polling, and neighbor-redistribution. Simulation results with a stochastic budding yeast cell cycle model are consistent with the theoretical analysis. It is especially significant that there is a provable global decrease in load imbalance for the local rebalancing algorithms due to scalability concerns for the global rebalancing algorithms. The overall simulation time is reduced by up to 25 %, and the total processor idle time by 85 %.
引用
收藏
页码:597 / 630
页数:33
相关论文
共 50 条
  • [41] ON TUNING THE DYNAMIC LOAD BALANCING FEM FRAMEWORK
    Bosansky, Michal
    Patzak, Borek
    NANO & MACRO MECHANICS (NMM 2019), 2020, 26 : 7 - 12
  • [42] A Load Balancing Framework for Clustered Storage Systems
    Kunkle, Daniel
    Schindler, Jiri
    HIGH PERFORMANCE COMPUTING - HIPC 2008, PROCEEDINGS, 2008, 5374 : 57 - 72
  • [43] A load balancing framework for adaptive and asynchronous applications
    Barker, K
    Chernikov, A
    Chrisochoides, N
    Pingali, K
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2004, 15 (02) : 183 - 192
  • [44] A framework for performance monitoring, load balancing, adaptive timeouts and quality of service in digital libraries
    Sarantos Kapidakis
    Sotirios Terzis
    Jakka Sairamesh
    International Journal on Digital Libraries, 2000, 3 (1) : 19 - 35
  • [45] Efficient load balancing schemes for large-scale real-time HLA/RTI based distributed simulations
    Boukerche, Azzedine
    Shadid, Ahmad
    Zhang, Ming
    DS-RT 2007: 11TH IEEE INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL-TIME APPLICATIONS, PROCEEDINGS, 2007, : 103 - 112
  • [46] MCdevelop - a universal framework for Stochastic Simulations
    Slawinska, M.
    Jadach, S.
    COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (03) : 748 - 762
  • [47] A runtime adaptive load balancing algorithm for particle simulations
    Dixon, MF
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 709 - 718
  • [48] Dynamic Load Balancing Techniques for Particulate Flow Simulations
    Rettinger, Christoph
    Ruede, Ulrich
    COMPUTATION, 2019, 7 (01):
  • [49] Partition deactivation with load balancing for parallel flow simulations
    Bakosi, J.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 519
  • [50] Load Balancing for Molecular Dynamics Simulations on Heterogeneous Architectures
    Seckler, Steffen
    Tchipev, Nikola
    Bungartz, Hans-Joachim
    Neumann, Philipp
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 101 - 110