HRL: Efficient and Flexible Reconfigurable Logic for Near-Data Processing

被引：0

作者：

Gao, Mingyu ^{[1
]}

Kozyrakis, Christos ^{[1
,2
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA-22) | 2016年

基金：

美国国家科学基金会;

关键词：

MODEL;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The energy constraints due to the end of Dennard scaling, the popularity of in-memory analytics, and the advances in 3D integration technology have led to renewed interest in near-data processing (NDP) architectures that move processing closer to main memory. Due to the limited power and area budgets of the logic layer, the NDP compute units should be area and energy efficient while providing sufficient compute capability to match the high bandwidth of vertical memory channels. They should also be flexible to accommodate a wide range of applications. Towards this goal, NDP units based on fine-grained (FPGA) and coarse-grained (CGRA) reconfigurable logic have been proposed as a compromise between the efficiency of custom engines and the flexibility of programmable cores. Unfortunately, FPGAs incur significant area overheads for bit-level reconfiguration, while CGRAs consume significant power in the interconnect and are inefficient for irregular data layouts and control flows. This paper presents Heterogeneous Reconfigurable Logic (HRL), a reconfigurable array for NDP systems that improves on both FPGA and CGRA arrays. HRL combines both coarse-grained and fine-grained logic blocks, separates routing networks for data and control signals, and uses specialized units to effectively support branch operations and irregular data layouts in analytics workloads. HRL has the power efficiency of FPGA and the area efficiency of CGRA. It improves performance per Watt by 2.2x over FPGA and 1.7x over CGRA. For NDP systems running MapReduce, graph processing, and deep neural networks, HRL achieves 92% of the peak performance of an NDP system based on custom accelerators for each application.

引用

页码：126 / 137

页数：12

共 50 条

[1] NEAR-DATA PROCESSING
Balasubramonian, Rajeev
Grot, Boris
[J]. IEEE MICRO, 2016, 36 (01) : 4 - 5
[2] Machine Learning Migration for Efficient Near-Data Processing
Cordeiro, Aline S.
dos Santos, Sairo R.
Moreira, Francis B.
Santos, Paulo C.
Carro, Luigi
Alves, Marco A. Z.
[J]. 2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 212 - 219
[3] Efficient Machine Learning execution with Near-Data Processing
Cordeiro, Aline S.
dos Santos, Sairo R.
Moreira, Francis B.
Santos, Paulo C.
Carro, Luigi
Alves, Marco A. Z.
[J]. MICROPROCESSORS AND MICROSYSTEMS, 2022, 90
[4] Advancing Near-Data Processing with Precise Exceptions and Efficient Data Fetching
Santos, Sairo
Kepe, Tiago R.
Moreira, Francis B.
Santos, Paulo C.
Alves, Marco A. Z.
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 230 - 232
[5] Two Reconfigurable NDP Servers: Understanding the Impact of Near-Data Processing on Data Center Applications
Song, Xiaojia
Xie, Tao
Fischer, Stephen
[J]. ACM TRANSACTIONS ON STORAGE, 2021, 17 (04)
[6] Overcoming Challenges to Near-Data Processing
Jayasena, Nuwan
[J]. IEEE MICRO, 2016, 36 (01) : 8 - 9
[7] Near-Data Processing of Neural Networks
Chen, Yunji
Tao, Jinhua
[J]. IEEE MICRO, 2016, 36 (01) : 9 - 10
[8] Optimizing Near-Data Processing for Spark
Rachuri, Sri Pramodh
Gantasala, Arun
Emanuel, Prajeeth
Gandhi, Anshul
Foley, Robert
Puhov, Peter
Gkountouvas, Theodoros
Lei, Hui
[J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 636 - 646
[9] An Architecture for Near-Data Processing Systems
Vermij, Erik
Hagleitner, Christoph
Fiorin, Leandro
Jongerius, Rik
van Lunteren, Jan
Bertels, Koen
[J]. PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 357 - 360
[10] GraNDe: Efficient Near-Data Processing Architecture for Graph Neural Networks
Yun, Sungmin
Nam, Hwayong
Park, Jaehyun
Kim, Byeongho
Ahn, Jung Ho
Lee, Eojin
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (10) : 2391 - 2404

← 1 2 3 4 5 →