Exploring the Tradeoffs of Application-Specific Processing

被引:1
|
作者
Schabel, Joshua C. [1 ]
Franzon, Paul D. [1 ]
机构
[1] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
关键词
ASIP; SIMD; CGRA; processing-in-memory; processing-near-memory; HTM; sparsey; artificial neural networks; ARCHITECTURE; SPECIALIZATION; DESIGN;
D O I
10.1109/JETCAS.2018.2849939
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-traditional processing schemes continue to grow in popularity as a means to achieve high performance with greater energy-efficiency. Data-centric processing is one such scheme that targets functional-specialization and memory bandwidth limitations, opening up small processors to wide memory IO. These functional-specific accelerators prove to be an essential component to achieve energy-efficiency and performance, but purely application-specific integrated circuit accelerators have expensive design overheads with limited reusability. We propose an architecture that combines existing processing schemes utilizing CGRAs for dynamic data path configuration as a means to add flexibility and reusability to data-centric acceleration. While flexibility adds a large energy overhead, performance can be regained through intelligent mappings to the CGRA for the functions of interest, while reusability can he gained through incrementally adding general purpose functionality to the processing elements. Building upon previous work accelerating sparse encoded neural networks, we present a CGRA architecture for mapping functional accelerators operating at 500 MHz in 32 nm. This architecture achieves a latency-per-function within 2x of its function-specific counterparts with energy-per-operation increases between 21-188 x, and energy-per-area increases between 1.8-3.6x.
引用
收藏
页码:531 / 542
页数:12
相关论文
共 50 条
  • [31] Enclosures go application-specific
    2001, Putman Publishing Company (14):
  • [32] Application-Specific Product Generics
    Lach, John
    Kumar, Vinu Vijay
    COMPUTER, 2009, 42 (08) : 64 - 74
  • [33] Application-specific permutation networks
    Draeger, Thorsten
    Fettweis, Gerhard P.
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2008, 3 (04) : 199 - 208
  • [34] Application-specific architectures for energy-efficient database query processing and optimization
    Haas, Sebastian
    Scholze, Stefan
    Hoeppner, Sebastian
    Ungethuem, Annett
    Mayr, Christian
    Schueffay, Rene
    Lehner, Wolfgang
    Fettweis, Gerhard
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 55 : 119 - 130
  • [35] Parallel processing of intersections for ray-tracing in application-specific processors and GPGPUs
    Nery, Alexandre S.
    Nedjah, Nadia
    Franca, Felipe M. G.
    Jozwiak, Lech
    MICROPROCESSORS AND MICROSYSTEMS, 2013, 37 (6-7) : 739 - 749
  • [36] Implementation of High-Throughput FFT Processing on an Application-Specific Reconfigurable Processor
    Liu, Lei
    Yang, Ziyu
    Li, Sikun
    Yan, Ming
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1284 - 1288
  • [37] Novel application-specific signal processing architectures for wideband CDMA and TDMA applications
    Subramanian, R
    Jha, U
    Medlock, J
    Woodthorpe, C
    Rieken, K
    2000 IEEE 51ST VEHICULAR TECHNOLOGY CONFERENCE, PROCEEDINGS, VOLS 1-3, 2000, : 1311 - 1317
  • [38] Loop-Oriented Metrics for Exploring an Application-Specific Architecture Design-Space
    Mbaye, Maria
    Belanger, Normand
    Savaria, Yvon
    Pierre, Samuel
    2008 INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2008, : 257 - +
  • [39] Application-specific Processor Architecture: Then and Now
    Peter Cappello
    Journal of Signal Processing Systems, 2008, 53 : 197 - 215
  • [40] SYNTHESIS OF APPLICATION-SPECIFIC INSTRUCTION SETS
    HUANG, IJ
    DESPAIN, AM
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1995, 14 (06) : 663 - 675