DeepHiR : Improving High-radix Router Throughput with Deep Hybrid Memory Buffer Microarchitecture

被引:4
|
作者
Li, Cunlu [1 ]
Dong, Dezun [1 ]
Liao, Xiang-Ke [1 ]
Kim, John [2 ]
Kim, Changhyun [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
[2] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
ARCHITECTURE; INTERCONNECT; CACHE;
D O I
10.1145/3330345.3330381
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hierarchical high-radix router microarchitecture consisting of small SRAM-based intermediate buffers have been used in large-scale supercomputers interconnection networks. While hierarchical organization enables efficient scaling to higher switch port count, it requires intermediate buffers that can cause performance bottleneck. Shallow intermediate buffers can cause head-of-line blocking and result in backpressure towards the input buffers to reduce overall performance. Increasing intermediate buffer size overcomes this problem but is infeasible since the amount of intermediate buffer is proportional to O(p(2)) where p is the router radix. Adopting new memory technology with higher density can increase intermediate buffer size but is not practical in decentralized, small-size intermediate buffers. In this work, we propose to organise the decentralized intermediate buffers as centralized buffers and leverage alternate memory technology to increase the buffer capacity for high-radix routers. In particular, we exploit Spin-Torque Transfer Magnetic RAM (STT-MRAM) to provide high-density and increase intermediate buffer depths while providing near-zero leakage power. STT-MRAM does result in significant overhead with larger amount of write/read ports that is necessary to support speedup. To overcome this cost, we propose DeepHiR, a novel deep hybrid buffer organization (STT-MRAM and SRAM) combined with a centralized buffer organization to provide high performance with minimal cost.
引用
收藏
页码:403 / 413
页数:11
相关论文
共 27 条
  • [21] High-Throughput In-Memory Computing for Binary Deep Neural Networks With Monolithically Integrated RRAM and 90-nm CMOS
    Yin, Shihui
    Sun, Xiaoyu
    Yu, Shimeng
    Seo, Jae-Sun
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2020, 67 (10) : 4185 - 4192
  • [22] Improving throughput and image quality of high-resolution 3D X-ray microscopes using deep learning reconstruction techniques
    Villarraga-Gómez, Herminso
    Rad, Mansoureh Norouzi
    Andrew, Matthew
    Andreyev, Andriy
    Sanapala, Ravikumar
    Omlor, Lars
    Vom Hagen, Christoph Graf
    e-Journal of Nondestructive Testing, 2022, 27 (03):
  • [23] A Low-Power High-Throughput In-Memory CMOS-ReRAM Accelerator for Large-Scale Deep Residual Neural Networks
    Cheng, Yuan
    Wong, Ngai
    Liu, Xiong
    Ni, Leibin
    Chen, Hai-Bao
    Yu, Hao
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [24] Deep Reinforcement Learning-Based Optimal and Fast Hybrid Equalizer Design Method for High-Bandwidth Memory (HBM) Module
    Choi, Seonguk
    Son, Keeyoung
    Park, Hyunwook
    Kim, Seongguk
    Sim, Boogyo
    Kim, Jihun
    Park, Joonsang
    Kim, Minsu
    Kim, Haeyeon
    Song, Jinwook
    Kim, Youngwoo
    Kim, Joungho
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2023, 13 (11): : 1804 - 1816
  • [25] High-Throughput, Area-Efficient, and Variation-Tolerant 3-D In-Memory Compute System for Deep Convolutional Neural Networks
    Veluri, Hasita
    Li, Yida
    Niu, Jessie Xuhua
    Zamburg, Evgeny
    Thean, Aaron Voon-Yew
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11) : 9219 - 9232
  • [26] Improving the accuracy of daily solar radiation prediction by climatic data using an efficient hybrid deep learning model: Long short-term memory (LSTM) network coupled with wavelet transform
    Alizamir, Meysam
    Shiri, Jalal
    Fard, Ahmad Fakheri
    Kim, Sungwon
    Gorgij, AliReza Docheshmeh
    Heddam, Salim
    Singh, Vijay P.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [27] Design and Analysis of Hybrid-Excitation Variable Flux Memory Motor for Traction Applications: Improving Output Power in High-Speed Area During Six-Step Operation Mode
    Tsunata, Ren
    Yokomichi, Keito
    Takemoto, Masatsugu
    Imai, Jun
    IEEE ACCESS, 2023, 11 : 82024 - 82036