Software-Managed Power Reduction in Infiniband Links

被引:9
|
作者
Dickov, Branimir [1 ,2 ]
Pericas, Miquel [3 ]
Carpenter, Paul M. [1 ]
Navarro, Nacho [1 ,2 ]
Ayguade, Eduard [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, BarcelonaTech, E-08028 Barcelona, Spain
[3] Tokyo Inst Technol, Tokyo, Japan
关键词
INTERCONNECTION NETWORKS;
D O I
10.1109/ICPP.2014.40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The backbone of a large-scale supercomputer is the interconnection network. As compute nodes become more energy-efficient, the interconnect is accounting for an increasing proportion of the total system energy consumption. The interconnect's energy consumption is, however, only starting to receive serious attention. Some hardware-based schemes have been proposed that exploit idle periods or low utilisation, either by turning off the links or by lowering the frequency and voltage. Although these schemes are effective in certain cases, they do not have enough global information about the application's communication behaviour to efficiently manage the network power consumption. This paper proposes an alternative approach: moving the intelligence into the PMPI layer of the MPI library, and using prediction to discover repetitive patterns in the application's communication behaviour. The core of the prediction algorithm is an n-gram extraction technique, which can accurately predict not only when a link will become unused but also when it will become active again, allowing lanes to be switched off during the idle periods and switched back on again in time to avoid incurring a significant performance degradation. Many HPC applications benefit from prediction, since they have repetitive computation and communication phases. By implementing the energy-saving mechanism inside the MPI library, existing MPI programs do not need to be modified. Using an event-driven simulator, driven by representative HPC workloads, we demonstrate average energy savings in Infiniband switches up to around 33%, while the average execution time increase is only up to 1%.
引用
收藏
页码:311 / 320
页数:10
相关论文
共 50 条
  • [1] Self-Tuned Software-Managed Energy Reduction in InfiniBand Links
    Dickov, Branimir
    Carpenter, Paul M.
    Pericas, Miquel
    Ayguade, Eduard
    2015 IEEE 21ST INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2015, : 649 - 657
  • [2] Software-managed address translation
    Jacob, B
    Mudge, T
    THIRD INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE - PROCEEDINGS, 1997, : 156 - 167
  • [3] DESIGN TRADEOFFS FOR SOFTWARE-MANAGED TLBS
    UHLIG, R
    NAGLE, D
    STANLEY, T
    MUDGE, T
    SECHREST, S
    BROWN, R
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1994, 12 (03): : 175 - 205
  • [4] Compiler-directed management of leakage power in software-managed memories
    Chen, G.
    Li, F.
    Kandemir, M.
    Ozturk, O.
    Demirkiran, I.
    IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: EMERGING VLSI TECHNOLOGIES AND ARCHITECTURES, 2006, : 450 - +
  • [5] A SOFTWARE-CONTROLLED PREFETCHING MECHANISM FOR SOFTWARE-MANAGED TLBS
    PARK, JS
    AHN, GS
    MICROPROCESSING AND MICROPROGRAMMING, 1995, 41 (02): : 121 - 136
  • [6] A SURVEY: SOFTWARE-MANAGED ON-CHIP MEMORIES
    Alam, Shahid
    Horspool, Nigel
    COMPUTING AND INFORMATICS, 2015, 34 (05) : 1168 - 1200
  • [7] A fully associative software-managed cache design
    Hallnor, EG
    Reinhardt, SK
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 107 - 116
  • [8] Software-controlled prefetching mechanism for software-managed TLBs
    Electronics and Telecommunications, Research Inst, Taejon, Korea, Republic of
    Microprocess Microprogram, 2 (121-136):
  • [9] A Tuning Framework for Software-Managed Memory Hierarchies
    Ren, Manman
    Park, Ji Young
    Houston, Mike
    Aiken, Alex
    Daily, William J.
    PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 280 - 291
  • [10] In-line interrupt handling for software-managed TLBs
    Jaleel, A
    Jacob, B
    2001 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD 2001, PROCEEDINGS, 2001, : 62 - 67