Software-Managed Power Reduction in Infiniband Links

被引:9
|
作者
Dickov, Branimir [1 ,2 ]
Pericas, Miquel [3 ]
Carpenter, Paul M. [1 ]
Navarro, Nacho [1 ,2 ]
Ayguade, Eduard [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, BarcelonaTech, E-08028 Barcelona, Spain
[3] Tokyo Inst Technol, Tokyo, Japan
关键词
INTERCONNECTION NETWORKS;
D O I
10.1109/ICPP.2014.40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The backbone of a large-scale supercomputer is the interconnection network. As compute nodes become more energy-efficient, the interconnect is accounting for an increasing proportion of the total system energy consumption. The interconnect's energy consumption is, however, only starting to receive serious attention. Some hardware-based schemes have been proposed that exploit idle periods or low utilisation, either by turning off the links or by lowering the frequency and voltage. Although these schemes are effective in certain cases, they do not have enough global information about the application's communication behaviour to efficiently manage the network power consumption. This paper proposes an alternative approach: moving the intelligence into the PMPI layer of the MPI library, and using prediction to discover repetitive patterns in the application's communication behaviour. The core of the prediction algorithm is an n-gram extraction technique, which can accurately predict not only when a link will become unused but also when it will become active again, allowing lanes to be switched off during the idle periods and switched back on again in time to avoid incurring a significant performance degradation. Many HPC applications benefit from prediction, since they have repetitive computation and communication phases. By implementing the energy-saving mechanism inside the MPI library, existing MPI programs do not need to be modified. Using an event-driven simulator, driven by representative HPC workloads, we demonstrate average energy savings in Infiniband switches up to around 33%, while the average execution time increase is only up to 1%.
引用
收藏
页码:311 / 320
页数:10
相关论文
共 50 条
  • [21] Acyclic orientation graph coloring for software-managed memory allocation
    WANG Li
    XUE JingLing
    YANG XueJun
    ScienceChina(InformationSciences), 2014, 57 (09) : 40 - 57
  • [22] Acyclic orientation graph coloring for software-managed memory allocation
    Wang Li
    Xue JingLing
    Yang XueJun
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (09) : 1 - 18
  • [23] An efficient software-managed cache based on cell broadband engine architecture
    Feng Guofu
    Dong Xiaoshe
    Wang Xuhao
    Chu Ying
    Zhang Xingjun
    INTERNATIONAL SYMPOSIUM ON ADVANCES IN COMPUTER AND SENSOR NETWORKS AND SYSTEMS, PROCEEDINGS: IN CELEBRATION OF 60TH BIRTHDAY OF PROF. S. SITHARAMA IYENGAR FOR HIS CONTRIBUTIONS TO THE SCIENCE OF COMPUTING, 2008, : 33 - 40
  • [24] Understanding the Tradeoffs between Software-Managed vs. Hardware-Managed Caches in GPUs
    Li, Chao
    Yang, Yi
    Dai, Hongwen
    Yan, Shengen
    Mueller, Frank
    Zhou, Huiyang
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 231 - 241
  • [25] SoMMA: A software-managed memory architecture for multi-issue processors
    Jost, Tiago Trevisan
    Nazar, Gabriel Luca
    Carro, Luigi
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 77 (77)
  • [26] An Efficient Software-Managed Cache Based on Cell Broadband Engine Architecture
    Feng Guofu
    Dong Xiaoshe
    Wang Xuhao
    Chu Ying
    Zhang Xingjun
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2009, 5 (01): : 16 - 16
  • [27] Concurrent Migration of Multiple Pages in Software-Managed Hybrid Main Memory
    Bock, Santiago
    Childers, Bruce R.
    Melhem, Rami
    Mosse, Daniel
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 420 - 423
  • [28] MEGATRON: Software-Managed Device TLB for Shared-Memory FPGA Virtualization
    Liu, Yanqiang
    Ma, Jiacheng
    Zhang, Zhengjun
    Li, Linsheng
    Qi, Zhengwei
    Guan, Haibing
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1213 - 1218
  • [29] Efficient Computation of Sum-products on GPUs Through Software-Managed Cache
    Silberstein, Mark
    Schuster, Assaf
    Geiger, Dan
    Patney, Anjul
    Owens, John D.
    ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2008, : 309 - +
  • [30] WCET-Aware Dynamic Code Management on Scratchpads for Software-Managed Multicores
    Kim, Yooseong
    Broman, David
    Cai, Jian
    Shrivastaval, Aviral
    2014 IEEE 20TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS), 2014, : 179 - 188