Memory Hotspot Optimization for Data-Intensive Applications

被引：0

作者：

机构：

来源：

2019 28TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2019) | 2019年

关键词：

D O I：

10.1109/PACT.2019.00048

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Emerging High-Performance Computing (HPC) workloads, such as graph analytics, machine learning, big data science, are data-intensive. The data-intensive workloads usually present irregular memory footprints with limited data locality, and thus incur frequent cache misses and a growing desire for memory bandwidth. Driven by this need, 3D-stacked memory devices such as Hybrid Memory Cube (HMC) and High Bandwidth Memory (HBM) are introduced to yield significantly higher throughput. However, the traditional interfaces and optimization methods for JEDEC DDR devices cannot fully exploit the potential performance of 3D-stacked memory to handle massive irregular memory accesses accompanied with data-intensive applications. 3D-stacked memory devices (as shown in Figure 1), such as the High Bandwidth Memory (HBM) [1] and Hybrid Memory Cube (HMC) [2], provide significantly higher bandwidth with respect to conventional Double Data Rate synchronous Dynamic Random Access Memory (DDR DRAM), and offer an opportunity to better address requirements of data-intensive applications. In these devices, the DRAM dies are stacked on top of a logic die via 3D packaging. The logic layer implements the memory controller that manages the stacked DRAMs. Well known commercial devices using this technology are the latest generations of NVIDIA's Graphic Processing Units (GPUs), Intel's Xeon Phi processors and Fujitsu PrimeHPC FX100. One issue for data-intensive applications are the frequent generation of memory hotspots, due to the fine-grained nature of their data accesses. Memory hotspots are frequently accessed memory locations that may significantly hinder the performance of DRAM devices, due to their banked design. In fact, frequent accesses to the same memory banks lead to increased bank conflicts of the memory operations, thus lengthening their latency [3]. Given nondeterministic memory footprints presented in the irregular applications, the bank-interleaving may not be able to avoid the hotspot formations as expected.

引用

页码：466 / 467

页数：2

共 50 条

[31] GORDON:. AN IMPROVED ARCHITECTURE FOR DATA-INTENSIVE APPLICATIONS
Caulfield, Adrian M.
Grupp, Laura M.
Swanson, Steven
IEEE MICRO, 2010, 30 (01) : 121 - 130
[32] FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications
Singh, Gagandeep
Alser, Mohammed
Cali, Damla Senol
Diamantopoulos, Dionysios
Gomez-Luna, Juan
Corporaal, Henk
Mutlu, Onur
IEEE MICRO, 2021, 41 (04) : 39 - 48
[33] System dynamics simulations for data-intensive applications
Neuwirth, Christian
ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 96 : 140 - 145
[34] Enhancing Parallelism of Data-Intensive Bioinformatics Applications
Xie, Zheng
Han, Liangxiu
Baldock, Richard
2013 8TH EUROSIM CONGRESS ON MODELLING AND SIMULATION (EUROSIM), 2013, : 519 - 524
[35] Conceptual modeling of data-intensive Web applications
Ceri, S
Fraternali, P
Matera, M
IEEE INTERNET COMPUTING, 2002, 6 (04) : 20 - 30
[36] Privacy-Aware Data-Intensive Applications
Guerriero, Michele
PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 1030 - 1033
[37] Probabilistic advisory systems for data-intensive applications
Quinn, A
Ettler, P
Jirsa, L
Nagy, I
Nedoma, P
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2003, 17 (02) : 133 - 148
[38] A dynamically reconfigurable IP for data-intensive applications
Miyamoto, N
Karnan, L
Kotani, K
Ohmi, T
PROCEEDINGS OF 2004 IEEE ASIA-PACIFIC CONFERENCE ON ADVANCED SYSTEM INTEGRATED CIRCUITS, 2004, : 404 - 405
[39] A framework for the internationalization of data-intensive Web applications
Belussi, A
Posenato, R
WEB ENGINEERING, PROCEEDINGS, 2004, 3140 : 478 - 482
[40] EXTMEM: Enabling Application-Aware Virtual Memory Management for Data-Intensive Applications
Jalalian, Sepehr
Patel, Shaurya
Hajidehi, Milad Rezaei
Seltzer, Margo
Fedorova, Alexandra
PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 397 - 408

← 1 2 3 4 5 →