Obtaining hardware performance metrics for the BlueGene/L supercomputer

被引:0
|
作者
Mindlin, P [1 ]
Brunheroto, JR [1 ]
DeRose, L [1 ]
Moreira, JE [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hardware performance monitoring is the basis of modem performance analysis tools for application optimization. We are interested in providing such performance analysis tools for the new BlueGene/L supercomputer as early as possible, so that applications can be tuned for that machine. We are faced with two challenges in achieving that goal. First, the machine is still going through its final design and assembly stages and, therefore, it is not yet available to system and application programmers. Second, and most important, key hardware performance metrics, such as instruction counters and Level 1 cache behavior counters, are missing from the BlueGene/L architecture. Our solution to those problems has been to implement a set of nonarchitected performance counters in an instruction-set simulator of BlueGene/L, and to provide a mechanism for executing code to retrieve the value of those counters. Using that mechanism, we have ported a version of the libHPM performance analysis library. We validate our implementation by comparing our results for BlueGene/L to analytical models and to results from a real machine.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [21] Performance effects of node mappings on the IBM BlueGene/L machine
    Smith, BE
    Bode, B
    [J]. EURO-PAR 2005 PARALLEL PROCESSING, PROCEEDINGS, 2005, 3648 : 1005 - 1013
  • [22] Performance Characteristics of Hybrid MPI/OpenMP Scientific Applications on a Large-scale Multithreaded BlueGene/Q Supercomputer
    Wu, Xingfu
    Taylor, Valerie
    [J]. 2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 303 - 309
  • [23] Performance Analysis with Unified Hardware Counter Metrics
    Gravelle, Brian J.
    Nystrom, William David
    Norris, Boyana
    [J]. 2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS), 2022, : 60 - 70
  • [24] Massively parallel full-wave modeling of advanced packaging structures on BlueGene supercomputer
    Morsey, J. D.
    Jiang, L.
    Rubin, B. J.
    Deutsch, A.
    Surovic, C. W.
    Becker, D.
    Haridass, A.
    [J]. 58TH ELECTRONIC COMPONENTS & TECHNOLOGY CONFERENCE, PROCEEDINGS, 2008, : 1218 - +
  • [25] Job scheduling for the BlueGene/L system
    Krevat, E
    Castaños, JG
    Moreira, JE
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2002, 2537 : 38 - 54
  • [26] BlueGene/L: A powerful platform for simulation
    Gupta, M
    [J]. Modelling and Simulation 2003, 2003, : 552 - 552
  • [27] Performance of Dense Eigensolvers on BlueGene/Q
    Gutheil, Inge
    Muenchhalfen, Jan Felix
    Grotendorst, Johannes
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 26 - 35
  • [28] Job scheduling for the BlueGene/L system
    Krevat, E
    Castaños, JG
    Moreira, JE
    [J]. EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS, 2002, 2400 : 207 - 211
  • [29] Hardware technologies for supercomputer SX-4
    Yamada, M
    Sakamoto, F
    Katoh, T
    Oguri, T
    Ikeda, H
    Yamazaki, M
    [J]. NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 493 - 507
  • [30] Optimizing Lattice QCD Simulations on BlueGene/L
    Krieg, Stefan
    [J]. PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 543 - +