Exploring the relationship between parallel application run-time variability and network performance in clusters

被引:0
|
作者
Evans, JJ [1 ]
Hood, CS [1 ]
Gropp, WD [1 ]
机构
[1] Purdue Univ, Dept Elect & Comp Engn Technol, W Lafayette, IN 47907 USA
关键词
cluster; Mryinet; application run-time sensitivity; network performance;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Highly variable parallel application execution time is a persistent issue in cluster computing environments, and can be particularly acute in systems composed of Networks of Workstations (NOWs). We are looking at this issue in terms of consistency. In particular we are focusing on network performance. Before we can use techniques from fault management to attain consistency, this paper presents our preliminary analysis of run-time variability from logs and experiments, exposing important issues related to systemic inconsistency in NOW clusters. The characterization of application sensitivity can be used to set network performance goals, thereby defining operational requirements. Network performance depends on the virtual topology imposed by the scheduler's allocation of nodes and the communication patterns of the set of running applications. Therefore it is important to look at both the network and the cluster's centralized node mapper (scheduler) as critical subsystems.
引用
收藏
页码:538 / 547
页数:10
相关论文
共 50 条
  • [1] Exploring the Relationship Between Algorithm Performance, Vocabulary, and Run-Time in Text Classification
    Fearn, Wilson
    Weller, Orion
    Seppi, Kevin
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3069 - 3082
  • [2] Incremental Run-time Application Mapping for Heterogeneous Network on Chip
    Jingcheng Shao
    Chen Tian-zhou
    Li Liu
    [J]. 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 485 - 492
  • [3] A Metaobject Protocol for Optimizing Application-Specific Run-Time Variability
    Chari, Guido
    Garbervetsky, Diego
    Marr, Stefan
    [J]. PROCEEDINGS OF THE 12TH WORKSHOP ON IMPLEMENTATION, COMPILATION AND OPTIMIZATION OF OBJECT-ORIENTED LANGUAGES, PROGRAMS AND SYSTEMS (ICOOOLPS'17), 2017,
  • [4] Run-time parallelization for partially parallel loops
    Yang, CT
    Tseng, SS
    Kao, SH
    Hsieh, MH
    Jiang, MF
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 308 - 313
  • [5] AN APPROACH TO THE RUN-TIME MONITORING OF PARALLEL PROGRAMS
    CAI, WT
    TURNER, SJ
    [J]. COMPUTER JOURNAL, 1994, 37 (04): : 333 - 345
  • [6] THE RUN-TIME EFFICIENCY OF PARALLEL ASYNCHRONOUS ALGORITHMS
    DUBOIS, M
    BRIGGS, FA
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1991, 40 (11) : 1260 - 1266
  • [7] Cellflow: a Parallel Application Development Environment with Run-Time Support for the Cell BE Processor
    Ruggiero, Martino
    Lombardi, Michele
    Milano, Michela
    Benini, Luca
    [J]. 11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 645 - 650
  • [8] High-Performance Parallel Accelerator for Flexible and Efficient Run-Time Monitoring
    Deng, Daniel Y.
    Suh, G. Edward
    [J]. 2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [9] QuickerCheck Implementing and Evaluating a Parallel Run-Time for QuickCheck
    Krook, Robert
    Smallbone, Nicholas
    Svensson, Bo Joel
    Claessen, Koen
    [J]. PROCEEDINGS OF THE 2023 35TH SYMPOSIUM ON IMPLEMENTATION AND APPLICATION OF FUNCTIONAL LANGUAGES, IFL 2023, 2024,
  • [10] Run-time prediction of parallel applications on shared environments
    Lee, BD
    Schopf, JM
    [J]. IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2003, : 487 - 491