Dynamic balancing of communication and computation load for HLA-based simulations on large-scale distributed systems

被引:25
|
作者
De Grande, Robson E. [1 ]
Boukerche, Azzedine [1 ]
机构
[1] Univ Ottawa, Sch Informat & Technol Engn, PARADISE Res Lab, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Parallel simulations; High Level Architecture; Dynamic load balancing; Performance; MIGRATION;
D O I
10.1016/j.jpdc.2010.04.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Dynamic balancing of computation and communication load is vital for the execution stability and performance of distributed, parallel simulations deployed on the shared, unreliable resources of large-scale environments. High Level Architecture (HLA) based simulations can experience a decrease in performance due to imbalances that are produced initially and/or during run time. These imbalances are generated by the dynamic load changes of distributed simulations or by unknown, non-managed background processes resulting from the non-dedication of shared resources. Due to the dynamic execution characteristics of elements that compose distributed applications, the computational load and interaction dependencies of each simulation entity change during run time. These dynamic changes lead to an irregular load and communication distribution, which increases overhead of resources and latencies. A static partitioning of load is limited to deterministic applications and is incapable of predicting the dynamic changes caused by distributed applications or by external background processes. Therefore, a scheme for balancing the communication and computational load during the execution of distributed simulations is devised in a scalable hierarchical architecture. The proposed balancing system employs local and cluster monitoring mechanisms in order to observe the distributed load changes and identify imbalances, repartitioning policies to determine a distribution of load and minimize imbalances. A migration technique is also employed by this proposed balancing system to perform reliable and low-latency load transfers. Such a system successfully improves the use of shared resources and increases distributed simulations' performance by minimizing communication latencies and partitioning the load evenly. Experiments and comparative analyses were conducted in order to identify the gains that the proposed balancing scheme provides to large-scale distributed simulations. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:40 / 52
页数:13
相关论文
共 50 条
  • [1] A Dynamic, Distributed, Hierarchical Load Balancing for HLA-Based Simulations on Large-Scale Environments
    De Grande, Robson Eduardo
    Boukerche, Azzedine
    [J]. EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 242 - 253
  • [2] Dynamic Load Balancing Using Grid Services for HLA-Based Simulations on Large-Scale Distributed Systems
    Boukerche, Azzedine
    de Grande, Robson Eduardo
    [J]. 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL-TIME APPLICATIONS, PROCEEDINGS, 2009, : 175 - 183
  • [3] Self-Adaptive Dynamic Load Balancing for Large-Scale HLA-based Simulations
    De Grande, Robson Eduardo
    Boukerche, Azzedine
    [J]. 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL-TIME APPLICATIONS (DS-RT 2010), 2010, : 14 - 21
  • [4] Autonomous Configuration Scheme in a Distributed Load Balancing System for HLA-based Simulations
    De Grande, Robson Eduardo
    Almulla, Mohammed
    Boukerche, Azzedine
    [J]. 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2013), 2013, : 169 - 176
  • [5] Optimized Federate Migration for Large-Scale HLA-based Simulations
    Boukerche, Azzedine
    De Grande, Robson Eduardo
    [J]. DS-RT 2008: 12TH 2008 IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS, PROCEEDINGS, 2008, : 227 - 235
  • [6] Distributed re-arrangement scheme for balancing computational load and minimizing communication delays in HLA-based simulations
    De Grande, Robson Eduardo
    Boukerche, Azzedine
    Ramadan, Hussam
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (05): : 626 - 648
  • [7] Efficient load balancing schemes for large-scale real-time HLA/RTI based distributed simulations
    Boukerche, Azzedine
    Shadid, Ahmad
    Zhang, Ming
    [J]. DS-RT 2007: 11TH IEEE INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL-TIME APPLICATIONS, PROCEEDINGS, 2007, : 103 - 112
  • [8] An HLA-based framework for simulation of large-scale critical systems
    Ficco, Massimo
    Avolio, Giovanni
    Palmieri, Francesco
    Castiglione, Aniello
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (02): : 400 - 419
  • [9] An Adaptive Dynamic Load Balancing For Large Scale Distributed and Virtual Simulations
    Boukerche, Azzedine
    Zhang, Ming
    Xie, Hengheng
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON VIRTUAL ENVIRONMENTS, HUMAN-COMPUTER INTERFACES AND MEASUREMENT SYSTEMS, 2009, : 96 - 100
  • [10] Enhancing Load Balancing Efficiency Based on Migration Delay for Large-Scale Distributed Simulations
    Alghamdi, Turki G.
    De Grande, Robson Eduardo
    Boukerche, Azzedine
    [J]. 2015 IEEE/ACM 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2015, : 33 - 40