Scalable Light-Weight Integration of FPGA Based Accelerators with Chip Multi-Processors

被引:5
|
作者
Lin, Zhe [1 ]
Sinha, Sharad [2 ]
Liang, Hao [1 ]
Feng, Liang [1 ]
Zhang, Wei [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS | 2018年 / 4卷 / 02期
关键词
FPGA; hardware accelerator; heterogeneous system; network-on-chip; chip-multiprocessor;
D O I
10.1109/TMSCS.2017.2754378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern multicore systems are migrating from homogeneous systems to heterogeneous systems with accelerator-based computing in order to overcome the barriers of performance and power walls. In this trend, FPGA-based accelerators are becoming increasingly attractive, due to their excellent flexibility and low design cost. In this paper, we propose the architectural support for efficient interfacing between FPGA-based multi-accelerators and chip-multiprocessors (CMPs) connected through the network-on-chip (NoC). Distributed packet receivers and hierarchical packet senders are designed to maintain scalability and reduce the critical path delay under a heavy task load. A dedicated accelerator chaining mechanism is also proposed to facilitate intra-FPGA data reuse among accelerators to circumvent prohibitive communication overhead between the FPGA and processors. In order to evaluate the proposed architecture, a complete system emulation with programmability support is performed using FPGA prototyping. Experimental results demonstrate that the proposed architecture has high-performance, and is light-weight and scalable in characteristics.
引用
收藏
页码:152 / 162
页数:11
相关论文
共 50 条
  • [11] A Low-Power Network-on-Chip Architecture for Tile-based Chip Multi-Processors
    Psarras, Anastasios
    Lee, Junghee
    Mattheakis, Pavlos
    Nicopoulos, Chrysostomos
    Dimitrakopoulos, Giorgos
    2016 INTERNATIONAL GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI), 2016, : 335 - 340
  • [12] Cache Partitioning on Chip Multi-processors for Balanced Parallel Scientific Applications
    Suo, Guang
    2009 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009), 2009, : 124 - 129
  • [13] ThermOS: System Support for Dynamic Thermal Management of Chip Multi-Processors
    Sironi, Filippo
    Maggio, Martina
    Cattaneo, Riccardo
    Del Nero, Giovanni F.
    Sciuto, Donatella
    Santambrogio, Marco D.
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 41 - 50
  • [14] Utility Aware Snoozy Caches for Energy Efficient Chip Multi-Processors
    Kulkarn, Ashwini A.
    Chakraborty, Shounak
    Mahajan, Shrinivas P.
    Kapoor, Hemangee K.
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 249 - 254
  • [15] Dynamic energy management for chip multi-processors under performance constraints
    Moghaddam, Milad Ghorbani
    Ababei, Cristinel
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 54 : 1 - 13
  • [16] A survey of memory architecture for 3D chip multi-processors
    Zhang, Yuang
    Li, Li
    Lu, Zhonghai
    Jantsch, Axel
    Gao, Minglun
    Pan, Hongbing
    Han, Feng
    MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (05) : 415 - 430
  • [17] HaDeS: Architectural Synthesis for Heterogeneous Dark Silicon Chip Multi-processors
    Turakhia, Yatish
    Raghunathan, Bharathwaj
    Garg, Siddharth
    Marculescu, Diana
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [18] Towards Analysing the Effect of Hybrid Caches on the Temperature of Tiled Chip Multi-Processors
    Kulkarni, Ashwini
    Rani, Khushboo
    Agarwal, Sukarn
    Mahajan, S. P.
    Kapoor, Hemangee K.
    2018 IEEE 4TH INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2018), 2018, : 52 - 57
  • [19] A Light-Weight and Scalable Network Profiling System
    Kind, Andreas
    Hurley, Paul
    Massar, Jeroen
    ERCIM NEWS, 2005, (60): : 67 - 68
  • [20] Exploiting On-Chip Routers to Store Dirty Cache Blocks in Tiled Chip Multi-Processors
    Das, Abhijit
    Kumar, Abhishek
    Jose, John
    Palesi, Maurizio
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 147 - 152