Performance evaluation of iterative parallel algorithms

被引:6
|
作者
Hanuliak, Ivan [1 ]
Hanuliak, Peter [1 ]
机构
[1] Univ Zilina, Fac Control & Informat, Zilina, Slovakia
关键词
Cybernetic network of workstations; Iterative parallel algorithms; Decomposition strategy; Inter-process communication; Message passing interface; Performance evaluation; COMMUNICATION-SYSTEMS;
D O I
10.1108/03684921011021309
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - With the availability of powerful personal computers (PCs), workstations and networking devices, the recent trend in parallel computing is to connect a number of individual workstations (PC and PC symmetric multiprocessor systems (SNIP)) to solve computation-intensive tasks in parallel way on such clusters (networks of workstations (NOW), SNIP and Grid). In this sense, it is not more true to consider traditionally evolved parallel computing and distributed computing as two separate research disciplines. Current trends in high performance computing are to use NOW (and SMP) as a cheaper alternative to traditionally used massively parallel multiprocessors or supercomputers and to profit from unifying of both mentioned disciplines. The purpose of this paper is to consider the individual workstations could be so single PC as parallel computers based on modern SMP implemented within workstation. Design/methodology/approach - Such parallel systems (NOW and SMP), are connected through widely used communication standard networks and co-operate to solve one large problem. Each workstation is threatened similarly to a processing element as in a conventional multiprocessor system. But, personal processors or multiprocessors as workstations are far more powerful and flexible than the processing elements in conventional multiprocessors. To make the whole system appear to the applications as a single parallel computing engine (a virtual parallel system), run-time environments such as OpenMP, Java (SMP), message passing interface, Java (NOW) are used to provide an extra layer of abstraction. Findings - To exploit the parallel processing capability of such cluster, the application program must be paralleled. The effective way how to do it for (parallelisation strategy) belongs to a most important step in developing effective parallel algorithm (optimisation). To behaviour analysis, all overheads that have the influence to performance of parallel algorithms (architecture, computation, communication, etc.) have to be taken into account. In this paper, such complex performance evaluation of iterative parallel algorithms (IPA) and their practical implementations are discussed (Jacobi and Gauss-Seidel iteration). On real application example, the various influences in process of modelling and performance evaluation and the consequences of their distributed parallel implementations are demonstrated. Originality/value - The paper usefully shows that better load balancing can be achieved among used network nodes (performance optimisation of parallel algorithm). Generally, it claims that the parallel algorithms or their parts (processes) with more communication (similar to analyzed Gauss-Seidel parallel algorithm) will have better speed-up values using modern SMP parallel system as its parallel implementation in NOW. For the algorithms or processes with small communication overheads (similar to analysed Jacobi parallel algorithm) the other network nodes can be used based on single processors.
引用
收藏
页码:107 / 126
页数:20
相关论文
共 50 条
  • [1] Parallel SOR iterative algorithms and performance evaluation on a Linux cluster
    Zhang, CY
    Lan, H
    Ye, Y
    Estrade, BD
    PDPTA '05: Proceedings of the 2005 International Conference on Parallel and Distributed Processing Techniques and Applications, Vols 1-3, 2005, : 263 - 269
  • [2] Parallel SOR iterative algorithms and performance evaluation on a linux cluster
    Zhang, Chaoyang
    Lan, Hong
    Ye, Yang
    Estrade, Brett D.
    Proc. Int. Conf. Parallel Distrib. Process. Tech. Appl., PDPTA, (263-269):
  • [3] To performance evaluation of distributed parallel algorithms
    Hanuliak, J
    Hanuliak, I
    KYBERNETES, 2005, 34 (9-10) : 1633 - 1650
  • [4] TO COMPLEX PERFORMANCE EVALUATION OF PARALLEL ALGORITHMS
    Hanuliak, Ivan
    EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING APPLIED IN COMPUTER AND ECONOMIC ENVIRONMENTS, 2010, : 21 - 27
  • [5] Performance evaluation of iterative geometric fitting algorithms
    Kanatani, Kenichi
    Sugaya, Yasuyuki
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (02) : 1208 - 1222
  • [6] Iterative algorithms for performance evaluation of closed network models
    Krougly, ZL
    Stanford, DA
    PERFORMANCE EVALUATION, 2005, 61 (01) : 41 - 64
  • [7] Performance evaluation of parallel iterative deepening A* on clusters of workstations
    Al-Ayyoub, AE
    PERFORMANCE EVALUATION, 2005, 60 (1-4) : 223 - 236
  • [8] Parallel chaotic GAOR iterative algorithms
    Li, Jiuzhong
    Wang, Nengchao
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 1995, 23 (Sup):
  • [9] Parallel iterative algorithms of computer tomography
    Gubareni, N.M.
    Engineering Simulation, 1998, 15 (04): : 467 - 478
  • [10] Performance evaluation of block-iterative algorithms for SPECT reconstruction
    Liu, Chi
    Volokh, Lana
    Zhao, Xide
    Xu, Jingyan
    Lee, Taek-Soo
    Tsui, Benjamin M. W.
    2005 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2005, : 1827 - 1831