Active-Routing: Compute on the Way for Near-Data Processing

被引:12
|
作者
Huang, Jiayi [1 ]
Puli, Ramprakash Reddy [2 ]
Majumder, Pritam [1 ]
Kim, Sungkeun [1 ]
Boyapati, Rahul [3 ]
Yum, Ki Hwan [1 ]
Kim, Eun Jung [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] NVIDIA, Santa Clara, CA USA
[3] Intel Corp, Santa Clara, CA 95051 USA
关键词
memory network; data-flow; in-network computing; near-data processing; processing-in-memory;
D O I
10.1109/HPCA.2019.00018
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The explosion of data availability and the demand for faster data analysis have led to the emergence of applications exhibiting large memory footprint and low data reuse rate. These workloads, ranging from neural networks to graph processing, expose compute kernels that operate over myriads of data. Significant data movement requirements of these kernels impose heavy stress on modern memory subsystems and communication fabrics. To mitigate the worsening gap between high CPU computation density and deficient memory bandwidth, solutions like memory networks and near-data processing designs are being architected to improve system performance substantially. In this work, we examine the idea of mapping compute kernels to the memory network so as to leverage in-network computing in data-flow style, by means of near-data processing. We propose Active-Routing, an in-network compute architecture that enables computation on the way for near-data processing by exploiting patterns of aggregation over intermediate results of arithmetic operators. The proposed architecture leverages the massive memory-level parallelism and network concurrency to optimize the aggregation operations along a dynamically built Active-Routing Tree. Our evaluations show that Active-Routing can achieve upto 7x speedup with an average of 60% performance improvement, and reduce the energy-delay product by 80% across various benchmarks compared to the state-of-the-art processing-in-memory architecture.
引用
收藏
页码:674 / 686
页数:13
相关论文
共 50 条
  • [1] NEAR-DATA PROCESSING
    Balasubramonian, Rajeev
    Grot, Boris
    [J]. IEEE MICRO, 2016, 36 (01) : 4 - 5
  • [2] Overcoming Challenges to Near-Data Processing
    Jayasena, Nuwan
    [J]. IEEE MICRO, 2016, 36 (01) : 8 - 9
  • [3] Near-Data Processing of Neural Networks
    Chen, Yunji
    Tao, Jinhua
    [J]. IEEE MICRO, 2016, 36 (01) : 9 - 10
  • [4] Optimizing Near-Data Processing for Spark
    Rachuri, Sri Pramodh
    Gantasala, Arun
    Emanuel, Prajeeth
    Gandhi, Anshul
    Foley, Robert
    Puhov, Peter
    Gkountouvas, Theodoros
    Lei, Hui
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 636 - 646
  • [5] An Architecture for Near-Data Processing Systems
    Vermij, Erik
    Hagleitner, Christoph
    Fiorin, Leandro
    Jongerius, Rik
    van Lunteren, Jan
    Bertels, Koen
    [J]. PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 357 - 360
  • [6] JAFAR: Near-Data Processing for Databases
    Babarinsa, Oreoluwa
    Idreos, Stratos
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 2069 - 2070
  • [7] Streaming Analytics with Adaptive Near-data Processing
    Sandur, Atul
    Park, ChanHo
    Volos, Stavros
    Agha, Gul
    Jeon, Myeongjae
    [J]. COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 563 - 566
  • [8] An Active-Routing Authentication Scheme in MANET
    Tu, Jinbin
    Tian, Dahai
    Wang, Yun
    [J]. IEEE ACCESS, 2021, 9 : 34276 - 34286
  • [9] Computing En-Route for Near-Data Processing
    Huang, Jiayi
    Majumder, Pritam
    Kim, Sungkeun
    Fulton, Troy
    Puli, Ramprakash Reddy
    Yum, Ki Hwan
    Kim, Eun Jung
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (06) : 906 - 921
  • [10] Advancing Database System Operators with Near-Data Processing
    dos Santos, Sairo R.
    Moreira, Francis B.
    Kepe, Tiago R.
    Alves, Marco A. Z.
    [J]. 30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 127 - 134