SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator

被引:9
|
作者
Pal, Subhankar [1 ]
Amarnath, Aporva [1 ]
Feng, Siying [1 ]
O'Boyle, Michael [2 ]
Dreslinski, Ronald [1 ]
Dubach, Christophe [3 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] McGill Univ, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会; 英国工程与自然科学研究理事会;
关键词
reconfigurable accelerators; sparse linear algebra; energy-efficient computing; machine learning; predictive models;
D O I
10.1145/3466752.3480134
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic adaptation is a post-silicon optimization technique that adapts the hardware to workload phases. However, current adaptive approaches are oblivious to implicit phases that arise from operating on irregular data, such as sparse linear algebra operations. Implicit phases are short-lived and do not exhibit consistent behavior throughout execution. This calls for a high-accuracy, low overhead runtime mechanism for adaptation at a fine granularity. Moreover, adopting such techniques for reconfigurable manycore hardware, such as coarse-grained reconfigurable architectures (CGRAs), adds complexity due to synchronization and resource contention. We propose a lightweight machine learning-based adaptive framework called SparseAdapt. It enables low-overhead control of configuration parameters to tailor the hardware to both implicit (datadriven) and explicit (code-driven) phase changes. SparseAdapt is implemented within the runtime of a recently-proposed CGRA called Transmuter, which has been shown to deliver high performance for irregular sparse operations. SparseAdapt can adapt configuration parameters such as resource sharing, cache capacities, prefetcher aggressiveness, and dynamic voltage-frequency scaling (DVFS). Moreover, it can operate under the constraints of either (i) high energy-efficiency (maximal GFLOPS/W), or (ii) high power-performance (maximal GFLOPS3/W). We evaluate SparseAdapt with sparse matrix-matrix and matrix-vector multiplication (SpMSpM and SpMSpV) routines across a suite of uniform random, power-law and real-world matrices, in addition to end-to-end evaluation on two graph algorithms. SparseAdapt achieves similar performance on SpMSpM as the largest static configuration, with 5.3x better energy-efficiency. Furthermore, on both performance and efficiency, SparseAdapt is at most within 13% of an Oracle that adapts the configuration of each phase with global knowledge of the entire program execution. Finally, SparseAdapt is able to outperform the state-of-the-art approach for runtime reconfiguration by up to 2.9x in terms of energy-efficiency.
引用
收藏
页码:1005 / 1021
页数:17
相关论文
共 50 条
  • [41] On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures
    Lebedev, Anton
    Alexandrov, Vassil
    SCALA 2018: PROCEEDINGS OF 2018 IEEE/ACM 9TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS (SCALA), 2018, : 81 - 90
  • [42] ANALOG COMPUTING IN A MODERN CONTEXT: A LINEAR ALGEBRA ACCELERATOR CASE STUDY
    Huang, Yipeng
    Guo, Ning
    Seok, Mingoo
    Tsividis, Yannis
    Sethumadhavan, Simha
    IEEE MICRO, 2017, 37 (03) : 30 - 38
  • [43] A Fully-Parallel Reconfigurable Spiking Neural Network Accelerator with Structured Sparse Connections
    Li, Mingyang
    Kan, Yirong
    Zhang, Renyuan
    Nakashima, Yasuhiko
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [44] Resource efficient parallel architectures for linear matrix algebra in real time adaptive control algorithms on reconfigurable logic
    Khan, Fahad Ahmad
    Ashraf, Rizwan Arshad
    Abbasi, Qammar Hussain
    Nasir, Ali Arshad
    2008 SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2008, : 275 - +
  • [45] High-performance designs for linear algebra operations on reconfigurable hardware
    Zhuo, Ling
    Prasanna, Viktor K.
    IEEE TRANSACTIONS ON COMPUTERS, 2008, 57 (08) : 1057 - 1071
  • [46] Linear algebra based trajectory control
    Scaglia, G. J. E.
    Serrano, M. E.
    Albertos, P.
    REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2020, 17 (04): : 344 - 353
  • [47] DESOLA: An active linear algebra library using delayed evaluation and runtime code generation
    Russell, Francis P.
    Mellor, Michael R.
    Kelly, Paul H. J.
    Beckmann, Olav
    SCIENCE OF COMPUTER PROGRAMMING, 2011, 76 (04) : 227 - 242
  • [48] PARALiA: A Performance Aware Runtime for Auto-tuning Linear Algebra on Heterogeneous Systems
    Anastasiadis, Petros
    Papadopoulou, Nikela
    Goumas, Georgios
    Koziris, Nectarios
    Hoppe, Dennis
    Zhong, Li
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2023, 20 (04)
  • [49] Programming reconfigurable decoupled application control accelerator for mobile systems
    Yazdani, Samar
    Cambonie, Joel
    Pottier, Bernard
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2008, 4943 : 15 - +
  • [50] Sparse Stream Semantic Registers: A Lightweight ISA Extension Accelerating General Sparse Linear Algebra
    Scheffler, Paul
    Zaruba, Florian
    Schuiki, Fabian
    Hoefler, Torsten
    Benini, Luca
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (12) : 3147 - 3161