Neural Acceleration for GPU Throughput Processors

被引:56
|
作者
Yazdanbakhsh, Amir [1 ]
Park, Jongse [1 ]
Sharma, Hardik [1 ]
Lotfi-Kamran, Pejman [2 ]
Esmaeilzadeh, Hadi [1 ]
机构
[1] Georgia Inst Technol, Alternat Comp Technol ACT Lab, Atlanta, GA 30332 USA
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
基金
美国国家科学基金会;
关键词
Approximate computing; GPU; neural processing unit;
D O I
10.1145/2830772.2830810
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphics Processing Units (GPUs) can accelerate diverse classes of applications, such as recognition, gaming, data analytics, weather prediction, and multimedia. Many of these applications are amenable to approximate execution. This application characteristic provides an opportunity to improve GPU performance and efficiency. Among approximation techniques, neural accelerators have been shown to provide significant performance and efficiency gains when augmenting CPU processors. However, the integration of neural accelerators within a GPU processor has remained unexplored. GPUs are, in a sense, many-core accelerators that exploit large degrees of data-level parallelism in the applications through the SIMT execution model. This paper aims to harmoniously bring neural and GPU accelerators together without hindering SIMT execution or adding excessive hardware overhead. We introduce a low overhead neurally accelerated architecture for GPUs, called NGPU, that enables scalable integration of neural accelerators for large number of GPU cores. This work also devises a mechanism that controls the tradeoff between the quality of results and the benefits from neural acceleration. Compared to the baseline GPU architecture, cycle-accurate simulation results for NGPU show a 2.4x average speedup and a 2.8x average energy reduction within 10% quality loss margin across a diverse set of benchmarks. The proposed quality control mechanism retains a 1.9x average speedup and a 2.1x energy reduction while reducing the degradation in the quality of results to 2.5%. These benefits are achieved by less than 1% area overhead.
引用
收藏
页码:482 / 493
页数:12
相关论文
共 50 条
  • [41] GPU Based Acceleration of Telegraph equation
    Simek, Vaclav
    Kraus, Michal
    Kunovsky, Jiri
    Petrek, Jiri
    2008 UKSIM TENTH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, 2008, : 629 - 630
  • [42] Jaya optimization algorithm with GPU acceleration
    A. Jimeno-Morenilla
    J. L. Sánchez-Romero
    H. Migallón
    H. Mora-Mora
    The Journal of Supercomputing, 2019, 75 : 1094 - 1106
  • [43] GPU Acceleration for Statistical Gene Classification
    Benso, Alfredo
    Di Carlo, Stefano
    Politano, Gianfranco
    Savino, Alessandro
    PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [44] Acceleration of Turbomachinery Steady Simulations on GPU
    Aissa, Mohamed Hassanine
    Mueller, Lasse
    Verstraete, Tom
    Vuik, Cornelis
    EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 814 - 825
  • [45] GPU Acceleration of Transmural Electrophysiological Imaging
    Corraine, M.
    Lopez, S.
    Wang, L.
    2012 COMPUTING IN CARDIOLOGY (CINC), VOL 39, 2012, 39 : 849 - 852
  • [46] GPU Acceleration of Pyrosequencing Noise Removal
    Gao, Yang
    Bakos, Jason D.
    2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 94 - 101
  • [47] GPU acceleration for Kernel Samepage Merging
    Lin, Wei-Cheng
    Tu, Chia-Heng
    Yeh, Chih-Wei
    Hung, Shih-Hao
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2017,
  • [48] SAGA: SystemC Acceleration on GPU Architectures
    Vinco, Sara
    Chatterjee, Debapriya
    Bertacco, Valeria
    Fummi, Franco
    2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 115 - 120
  • [49] GPU Acceleration of CFD Simulations in OpenFOAM
    Piscaglia, Federico
    Ghioldi, Federico
    AEROSPACE, 2023, 10 (09)
  • [50] GPU Acceleration of Robust Point Matching
    Mourning, Chad
    Nykl, Scott
    Xu, Huihui
    Chelberg, David
    Liu, Jundong
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 417 - 426