Exploring the Tradeoffs of Application-Specific Processing

被引:1
|
作者
Schabel, Joshua C. [1 ]
Franzon, Paul D. [1 ]
机构
[1] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
关键词
ASIP; SIMD; CGRA; processing-in-memory; processing-near-memory; HTM; sparsey; artificial neural networks; ARCHITECTURE; SPECIALIZATION; DESIGN;
D O I
10.1109/JETCAS.2018.2849939
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-traditional processing schemes continue to grow in popularity as a means to achieve high performance with greater energy-efficiency. Data-centric processing is one such scheme that targets functional-specialization and memory bandwidth limitations, opening up small processors to wide memory IO. These functional-specific accelerators prove to be an essential component to achieve energy-efficiency and performance, but purely application-specific integrated circuit accelerators have expensive design overheads with limited reusability. We propose an architecture that combines existing processing schemes utilizing CGRAs for dynamic data path configuration as a means to add flexibility and reusability to data-centric acceleration. While flexibility adds a large energy overhead, performance can be regained through intelligent mappings to the CGRA for the functions of interest, while reusability can he gained through incrementally adding general purpose functionality to the processing elements. Building upon previous work accelerating sparse encoded neural networks, we present a CGRA architecture for mapping functional accelerators operating at 500 MHz in 32 nm. This architecture achieves a latency-per-function within 2x of its function-specific counterparts with energy-per-operation increases between 21-188 x, and energy-per-area increases between 1.8-3.6x.
引用
收藏
页码:531 / 542
页数:12
相关论文
共 50 条
  • [1] APPLICATION-SPECIFIC DIGITAL SIGNAL-PROCESSING
    BROWN, A
    ELECTRONICS WORLD & WIRELESS WORLD, 1993, (1688): : 581 - 583
  • [2] Method to derive application-specific embedded processing cores
    Hebert, Olivier
    Kraljic, Ivan C.
    Savaria, Yvon
    Hardware/Software Codesign - Proceedings of the International Workshop, 2000, : 88 - 92
  • [3] Signal processing for application-specific ad hoc networks
    Sung, Youngchul
    Misra, Saswat
    Tong, Lang
    Ephremides, Anthony
    IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (05) : 74 - 83
  • [4] Application-Specific Quantum Circuits (ASQCs) for Image Processing
    Thapliyal, Himanshu
    2021 IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2021), 2021, : 37 - 38
  • [5] Performance Tradeoffs of General-Purpose Digital Hardware and Application-Specific Analog Hardware
    Natalino, Carlos
    Li, Dan
    Ozolins, Oskars
    Pang, Xiaodan
    Da Ros, Francesco
    MACHINE LEARNING IN PHOTONICS, 2024, 13017
  • [6] Application-specific XML processing a parallel approach for optimum performance
    Trujillo, R
    PDPTA '05: Proceedings of the 2005 International Conference on Parallel and Distributed Processing Techniques and Applications, Vols 1-3, 2005, : 959 - 963
  • [7] A Generative Approach to the Construction of Application-Specific XML Processing Components
    Sarasa-Cabezuelo, Antonio
    Martinez-Aviles, Alberto
    Sierra, Jose-Luis
    Fernandez-Valmayor, Alfredo
    2009 35TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS, PROCEEDINGS, 2009, : 345 - 352
  • [8] Supporting Application-specific In-network Processing in Data Centres
    Mai, Luo
    Rupprecht, Lukas
    Costa, Paolo
    Migliavacca, Matteo
    Pietzuch, Peter
    Wolf, Alexander L.
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 519 - 520
  • [9] ENGINEERS ARE NOT APPLICATION-SPECIFIC
    MANDEL, P
    EDN, 1986, 31 (23) : 33 - 33
  • [10] Application-specific processors
    Veidenbaum, A
    IEEE MICRO, 2004, 24 (03) : 8 - 9