Early DSE and Automatic Generation of Coarse-grained Merged Accelerators

被引:2
|
作者
Brumar, Iulian [1 ]
Zacharopoulos, Georgios [1 ]
Yao, Yuan [1 ]
Rama, Saketh [1 ]
Brooks, David [1 ]
Wei, Gu-Yeon [1 ]
机构
[1] Harvard Univ, POB 1212, Cambridge, MA 02138 USA
关键词
Hardware-software codesign; datapath optimization; neural networks;
D O I
10.1145/3546070
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Post-Moore's law area-constrained systems rely on accelerators to deliver performance enhancements. Coarse-grained accelerators can offer substantial domain acceleration, but manual, ad hoc identification of code to accelerate is prohibitively expensive. Because cycle-accurate simulators and high-level synthesis (HLS) flows are so time-consuming, the manual creation of high-utilization accelerators that exploit control and data flow patterns at optimal granularities is rarely successful. To address these challenges, we present AccelMerger, the first automated methodology to create coarse-grained, control- and data-flow-rich merged accelerators. AccelMerger uses sequence alignment matching to recognize similar function call-graphs and loops, and neural networks to quickly evaluate their post-HLS characteristics. It accurately identifies which functions to accelerate, and it merges accelerators to respect an area budget and to accommodate system communication characteristics like latency and bandwidth. Merging two accelerators can save as much as 99% of the area of one. The space saved is used by a globally optimal integer linear program to allocate more accelerators for increased performance. We demonstrate AccelMerger's effectiveness using HLS flows without any manual effort to fine-tune the resulting designs. On FPGA-based systems, AccelMerger yields application performance improvements of up to 16.7x over software implementations, and 1.91x on average with respect to state-of-the-art early-stage design space exploration tools.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] CGPA: Coarse-Grained Pipelined Accelerators
    Liu, Feng
    Ghosh, Soumyadeep
    Johnson, Nick P.
    August, David I.
    2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
  • [2] Automatic coarse-grained parallelization techniques
    Cosnard, M
    Jeannot, E
    ADVANCES IN HIGH PERFORMANCE COMPUTING, 1997, 30 : 253 - 270
  • [3] DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
    Palumbo, Francesca
    Sau, Carlo
    Raffo, Luigi
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 744 - 749
  • [4] AURORA: Automated Refinement of Coarse-Grained Reconfigurable Accelerators
    Tan, Cheng
    Xie, Chenhao
    Li, Ang
    Barker, Kevin J.
    Tumeo, Antonino
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 1388 - 1393
  • [5] An Automatic Parallelizer for Coarse-Grained Reconfigurable Processor
    Mi, Ping
    Zhao, Zhongyuan
    Sheng, Weiguang
    He, Weifeng
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2016, : 215 - 217
  • [6] TANGRAM: Optimized Coarse-Grained Dataflow for Scalable NN Accelerators
    Gao, Mingyu
    Yang, Xuan
    Pu, Jing
    Horowitz, Mark
    Kozyrakis, Christos
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 807 - 820
  • [7] Automatic discovery of coarse-grained parallelism in media applications
    Ryoo, Shane
    Ueng, Sain-Zee
    Rodrigues, Christopher I.
    Kidd, Robert E.
    Frank, Matthew I.
    Hwu, Wen-mei W.
    TRANSACTIONS ON HIGH-PERFORMANCE EMBEDDED ARCHITECTURES AND COMPILERS I, 2007, 4050 : 194 - +
  • [8] AHA: An Agile Approach to the Design of Coarse-Grained Reconfigurable Accelerators and Compilers
    Koul, Kalhan
    Melchert, Jackson
    Sreedhar, Kavya
    Truong, Leonard
    Nyengele, Gedeon
    Zhang, Keyi
    Liu, Qiaoyi
    Setter, Jeff
    Chen, Po-Han
    Mei, Yuchen
    Strange, Maxwell
    Daly, Ross
    Donovick, Caleb
    Carsello, Alex
    Kong, Taeyoung
    Feng, Kathleen
    Huff, Dillon
    Nayak, Ankita
    Setaluri, Rajsekhar
    Thomas, James
    Bhagdikar, Nikhil
    Durst, David
    Myers, Zachary
    Tsiskaridze, Nestan
    Richardson, Stephen
    Bahr, Rick
    Fatahalian, Kayvon
    Hanrahan, Pat
    Barrett, Clark
    Horowitz, Mark
    Torng, Christopher
    Kjolstad, Fredrik
    Raina, Priyanka
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (02)
  • [9] A polarizable coarse-grained water model for coarse-grained proteins simulations
    Ha-Duong, Tap
    Basdevant, Nathalie
    Borgis, Daniel
    CHEMICAL PHYSICS LETTERS, 2009, 468 (1-3) : 79 - 82
  • [10] A Proposal of Automatic Selection of Coarse-grained Semantic Classes for WSD
    Izquierdo, Ruben
    Suarez, Armando
    Rigau, German
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2007, (39): : 189 - 196