HRL: Efficient and Flexible Reconfigurable Logic for Near-Data Processing

被引:0
|
作者
Gao, Mingyu [1 ]
Kozyrakis, Christos [1 ,2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
基金
美国国家科学基金会;
关键词
MODEL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The energy constraints due to the end of Dennard scaling, the popularity of in-memory analytics, and the advances in 3D integration technology have led to renewed interest in near-data processing (NDP) architectures that move processing closer to main memory. Due to the limited power and area budgets of the logic layer, the NDP compute units should be area and energy efficient while providing sufficient compute capability to match the high bandwidth of vertical memory channels. They should also be flexible to accommodate a wide range of applications. Towards this goal, NDP units based on fine-grained (FPGA) and coarse-grained (CGRA) reconfigurable logic have been proposed as a compromise between the efficiency of custom engines and the flexibility of programmable cores. Unfortunately, FPGAs incur significant area overheads for bit-level reconfiguration, while CGRAs consume significant power in the interconnect and are inefficient for irregular data layouts and control flows. This paper presents Heterogeneous Reconfigurable Logic (HRL), a reconfigurable array for NDP systems that improves on both FPGA and CGRA arrays. HRL combines both coarse-grained and fine-grained logic blocks, separates routing networks for data and control signals, and uses specialized units to effectively support branch operations and irregular data layouts in analytics workloads. HRL has the power efficiency of FPGA and the area efficiency of CGRA. It improves performance per Watt by 2.2x over FPGA and 1.7x over CGRA. For NDP systems running MapReduce, graph processing, and deep neural networks, HRL achieves 92% of the peak performance of an NDP system based on custom accelerators for each application.
引用
收藏
页码:126 / 137
页数:12
相关论文
共 50 条
  • [1] NEAR-DATA PROCESSING
    Balasubramonian, Rajeev
    Grot, Boris
    [J]. IEEE MICRO, 2016, 36 (01) : 4 - 5
  • [2] Machine Learning Migration for Efficient Near-Data Processing
    Cordeiro, Aline S.
    dos Santos, Sairo R.
    Moreira, Francis B.
    Santos, Paulo C.
    Carro, Luigi
    Alves, Marco A. Z.
    [J]. 2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 212 - 219
  • [3] Efficient Machine Learning execution with Near-Data Processing
    Cordeiro, Aline S.
    dos Santos, Sairo R.
    Moreira, Francis B.
    Santos, Paulo C.
    Carro, Luigi
    Alves, Marco A. Z.
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2022, 90
  • [4] Advancing Near-Data Processing with Precise Exceptions and Efficient Data Fetching
    Santos, Sairo
    Kepe, Tiago R.
    Moreira, Francis B.
    Santos, Paulo C.
    Alves, Marco A. Z.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 230 - 232
  • [5] Two Reconfigurable NDP Servers: Understanding the Impact of Near-Data Processing on Data Center Applications
    Song, Xiaojia
    Xie, Tao
    Fischer, Stephen
    [J]. ACM TRANSACTIONS ON STORAGE, 2021, 17 (04)
  • [6] Overcoming Challenges to Near-Data Processing
    Jayasena, Nuwan
    [J]. IEEE MICRO, 2016, 36 (01) : 8 - 9
  • [7] Near-Data Processing of Neural Networks
    Chen, Yunji
    Tao, Jinhua
    [J]. IEEE MICRO, 2016, 36 (01) : 9 - 10
  • [8] Optimizing Near-Data Processing for Spark
    Rachuri, Sri Pramodh
    Gantasala, Arun
    Emanuel, Prajeeth
    Gandhi, Anshul
    Foley, Robert
    Puhov, Peter
    Gkountouvas, Theodoros
    Lei, Hui
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 636 - 646
  • [9] An Architecture for Near-Data Processing Systems
    Vermij, Erik
    Hagleitner, Christoph
    Fiorin, Leandro
    Jongerius, Rik
    van Lunteren, Jan
    Bertels, Koen
    [J]. PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 357 - 360
  • [10] GraNDe: Efficient Near-Data Processing Architecture for Graph Neural Networks
    Yun, Sungmin
    Nam, Hwayong
    Park, Jaehyun
    Kim, Byeongho
    Ahn, Jung Ho
    Lee, Eojin
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (10) : 2391 - 2404