On Scaling Speedup with Coarse-Grain Coprocessor Accelerators on Reconfigurable Platforms

被引:0
|
作者
Kornaros, George [1 ,2 ]
Motakis, Antonios [1 ]
机构
[1] Inst Educ Technol, Appl Informat & Multimedia Dept, Iraklion, Crete, Greece
[2] Tech Univ Crete, Elect & Comp Engn Dept, Khania, Crete, Greece
关键词
Hardware accelerator; Multiprocessor on FPGA; Coprocessor; SIMD; Embedded Processing; Speedup; PROCESSOR;
D O I
10.1109/DSD.2010.79
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Instruction set accelerator architectures have emerged recently as light-weight hardware coprocessors, so as to transparently improve applications performance. This paper investigates the effectiveness of adding hardware accelerators as refers to scaling, based on applications that show data level parallelism such as image edge detection and fractal applications. The implementation results using reconfigurable technology show that, by utilizing a number of hardware coprocessor units, applications such as Sobel edge detection can achieve speedup more than 37X. Finally, architectural directions based on the developed case studies show that even better performance can be achieved when the overheads of communication, of serialized data accesses, shared memory and of bus protocols are reduced.
引用
收藏
页码:355 / 362
页数:8
相关论文
共 50 条
  • [31] A coarse-grain reconfigurable architecture for multimedia applications featuring subword computation capabilities
    Brunelli, Claudio
    Garzia, Fabio
    Nurmi, Jari
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2008, 3 (1-2) : 21 - 32
  • [32] Coarse-grain membrane models
    Sperotto, Maria Maddalena
    CHEMISTRY AND PHYSICS OF LIPIDS, 2007, 149 : S43 - S44
  • [33] Scalable algorithm for Montgomery Multiplication and its implementation on the coarse-grain reconfigurable chip
    Trichina, E
    Tiountchik, A
    TOPICS IN CRYPTOLOGY - CT-RAS 2001, PROCEEDINGS, 2001, 2020 : 235 - 249
  • [34] Implementing non power-of-two FFTs on coarse-grain reconfigurable architectures
    Rivaton, Arnaud
    Quevremont, Jerome
    Zhang, Qiwei
    Wolkotte, Pascal
    Smit, Gerard
    2005 INTERNATIONAL SYMPOSIUM ON SYSTEM-ON-CHIP, PROCEEDINGS, 2005, : 74 - 77
  • [35] A dynamically reconfigurable video compression scheme using FPGAs with coarse-grain parallelism
    Ramachandran, S
    Srinivasan, S
    VLSI DESIGN, 2002, 15 (02) : 521 - 528
  • [36] An Area-Efficient Interconnection Network for Coarse-Grain Reconfigurable Cryptographic Array
    Qu, Tongzhou
    Dai, Zibin
    Nan, Longmei
    Li, Wei
    Yin, Anqi
    2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 710 - 713
  • [37] Network topology exploration of mesh-based coarse-grain reconfigurable architectures
    Bansal, N
    Gupta, S
    Dutt, N
    Nicolau, A
    Gupta, R
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2004, : 474 - 479
  • [38] A coarse-grain model of DNA
    Knotts, Thomas A.
    Rathore, Nitin
    de Pablo, Juan J.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 417 - 417
  • [39] MORA: A new coarse-grain reconfigurable array for high throughput multimedia processing
    Lanuzza, Marco
    Perri, Stefania
    Corsonello, Pasquale
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION - PROCEEDINGS, 2007, 4599 : 159 - +
  • [40] Energy Efficient Coarse-Grain Reconfigurable Array for Accelerating Digital Signal Processing
    Lanuzza, Marco
    Perri, Stefania
    Corsonello, Pasquale
    Margala, Martin
    INTEGRATED CIRCUIT AND SYSTEMS DESIGN: POWER AND TIMING MODELING, OPTIMIZATION AND SIMULATION, 2009, 5349 : 297 - +