Software-Managed Power Reduction in Infiniband Links

被引:9
|
作者
Dickov, Branimir [1 ,2 ]
Pericas, Miquel [3 ]
Carpenter, Paul M. [1 ]
Navarro, Nacho [1 ,2 ]
Ayguade, Eduard [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, BarcelonaTech, E-08028 Barcelona, Spain
[3] Tokyo Inst Technol, Tokyo, Japan
关键词
INTERCONNECTION NETWORKS;
D O I
10.1109/ICPP.2014.40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The backbone of a large-scale supercomputer is the interconnection network. As compute nodes become more energy-efficient, the interconnect is accounting for an increasing proportion of the total system energy consumption. The interconnect's energy consumption is, however, only starting to receive serious attention. Some hardware-based schemes have been proposed that exploit idle periods or low utilisation, either by turning off the links or by lowering the frequency and voltage. Although these schemes are effective in certain cases, they do not have enough global information about the application's communication behaviour to efficiently manage the network power consumption. This paper proposes an alternative approach: moving the intelligence into the PMPI layer of the MPI library, and using prediction to discover repetitive patterns in the application's communication behaviour. The core of the prediction algorithm is an n-gram extraction technique, which can accurately predict not only when a link will become unused but also when it will become active again, allowing lanes to be switched off during the idle periods and switched back on again in time to avoid incurring a significant performance degradation. Many HPC applications benefit from prediction, since they have repetitive computation and communication phases. By implementing the energy-saving mechanism inside the MPI library, existing MPI programs do not need to be modified. Using an event-driven simulator, driven by representative HPC workloads, we demonstrate average energy savings in Infiniband switches up to around 33%, while the average execution time increase is only up to 1%.
引用
收藏
页码:311 / 320
页数:10
相关论文
共 50 条
  • [41] Power Reduction in Network On chip Links
    Behere, Chetan S.
    Gugulothu, Somulu
    2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [42] Power reduction of on-chip serial links
    Kedia, Amit
    Saleh, Resve
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 865 - 868
  • [43] InfiniBand performance review: It's the software stupid
    Benjegerdes, TR
    Bode, BM
    USENIX ASSOCIATION PROCEEDINGS OF THE FREENIX TRACK 2004 USENIX ANNUAL TECHNICAL CONFERENCE, 2004, : 219 - 224
  • [44] The Yin and Yang of Power and Performance for Asymmetric Hardware and Managed Software
    Cao, Ting
    Blackburn, Stephen M.
    Gao, Tiejun
    McKinley, Kathryn S.
    2012 39TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2012, : 225 - 236
  • [45] ViSMI: Software distributed shared memory for InfiniBand clusters
    Osendorfer, C
    Tao, J
    Trinitis, C
    Mairandres, M
    THIRD IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2004, : 185 - 191
  • [46] The Missing Links in Software Estimation Team Loading and Team Power
    Gencel, Cigdem
    Buglione, Luigi
    PROCEEDINGS OF 2016 JOINT CONFERENCE OF THE INTERNATIONAL WORKSHOP ON SOFTWARE MEASUREMENT AND THE INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS AND PRODUCT MEASUREMENT (IWSM-MENSURA), 2016, : 212 - 212
  • [47] Software directed issue queue power reduction
    Jones, TM
    O'Boyle, MFP
    Abella, J
    González, A
    11TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, : 144 - 153
  • [48] Implementation of the software distributed shared-memory system on the InfiniBand
    Park, I
    Choi, HW
    Han, Y
    Hwang, S
    Kim, SW
    Park, K
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 1273 - 1279
  • [49] Analysis of the memory registration process in the Mellanox InfiniBand software stack
    Mietke, Frank
    Rex, Robert
    Baumgartl, Robert
    Mehlan, Torsten
    Hoefler, Torsten
    Rehm, Wolfgang
    EURO-PAR 2006 PARALLEL PROCESSING, 2006, 4128 : 124 - 133
  • [50] Efficient exploitation of kernel access to Infiniband: A software DSM example
    Liss, L
    Birk, Y
    Schuster, A
    HOT INTERCONNECTS 11, 2003, : 130 - 135