Efficient collective communication on Heterogeneous Networks of Workstations

被引:40
|
作者
Banikazemi, M [1 ]
Moorthy, V [1 ]
Panda, DK [1 ]
机构
[1] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
关键词
D O I
10.1109/ICPP.1998.708518
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Networks of Workstations (NOW) have become an attractive alternative platform for high performance computing. Due to the commodity nature of workstations and interconnects and due to the multiplicity of vendors and platforms, the NOW environments are being gradually redefined as Heterogeneous Networks of Workstations (HNOW) environments. This paper presents a new framework for implementing collective communication operations (as defined by the Message Passing Interface (MPI) standard) efficiently for the emerging HNOW environments. We first classify different types of heterogeneity in HNOW and then focus on one important characteristic: communication capabilities of workstations. Taking this characteristic into account, we propose two new approaches (Speed-Partitioned Ordered Chain (SPOC) and Fastest-Node First (FNF)) to implement collective communication operations with reduced latency. We also investigate methods for deriving optimal trees for broadcast and multicast operations. Generating such trees is shown to be computationally intensive. It is shown that the FNF approach, in spite of its simplicity, cart deliver performance within 1% of the performance of the optimal trees. Finally, these new approaches are compared with the approach used in the MPICH implementation on experimental as well as on simulated testbeds. On a 24-node existing HNOW environment with SGI workstations and ATM interconnection, our approaches reduce the latency of broadcast and multicast operations by a factor of up to 3.5 compared to the approach used in the existing MPICH implementation. On a 64-node simulated testbed, our approaches can reduce the latency of broadcast and multicast operations by a factor of up to 4.5. Thus, these results demonstrate that there is significant potential for our approaches. to be applied towards designing scalable collective communication libraries for current and future generation HNOW environments.
引用
收藏
页码:460 / 467
页数:8
相关论文
共 50 条
  • [1] Communication modeling of heterogeneous networks of workstations for performance characterization of collective operations
    Banikazemi, M
    Sampathkumar, J
    Prabhu, S
    Panda, DK
    Sadayappan, P
    [J]. (HCW '99) - EIGHTH HETEROGENEOUS COMPUTING WORKSHOP, PROCEEDINGS, 1999, : 125 - 133
  • [2] ECO: Efficient Collective Operations for communication on heterogeneous networks
    Lowekamp, BB
    Beguelin, A
    [J]. 10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, : 399 - 405
  • [3] Efficient multicast in heterogeneous networks of workstations
    Libeskind-Hadas, R
    Hartline, J
    [J]. 2000 INTERNATIONAL WORKSHOPS ON PARALLEL PROCESSING, PROCEEDINGS, 2000, : 403 - 410
  • [4] Efficient broadcast algorithms for heterogeneous networks of workstations
    Tosun, AS
    Agarwal, A
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 256 - 261
  • [5] Efficient use of parallel libraries on heterogeneous Networks of Workstations
    Clematis, A
    Dodero, G
    Gianuzzi, V
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (08) : 641 - 653
  • [6] Heterogeneous networks of workstations
    Baek, S
    Lee, K
    Kim, J
    Morris, J
    [J]. ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2004, 3189 : 426 - 439
  • [7] Efficient collective communication in distributed heterogeneous systems
    Bhat, PB
    Raghavendra, CS
    Prasanna, VK
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2003, 63 (03) : 251 - 263
  • [8] Efficient collective communication in distributed heterogeneous systems
    Bhat, PB
    Raghavendra, CS
    Prasanna, VK
    [J]. 19TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1999, : 15 - 24
  • [9] Efficient collective communication in optical networks
    Bermond, JC
    Gargano, L
    Perennes, S
    Rescigno, AA
    Vaccaro, U
    [J]. THEORETICAL COMPUTER SCIENCE, 2000, 233 (1-2) : 165 - 189
  • [10] Efficient Broadcast in Heterogeneous Networks of Workstations Using Two Sub-Networks
    Chao Lin
    Jang-Ping Sheu
    [J]. International Journal of Parallel Programming, 2005, 33 : 351 - 391