H-SIMD machine: Configurable parallel computing for matrix multiplication

被引:0
|
作者
Xu, XZ [1 ]
Ziavras, SG [1 ]
机构
[1] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGAs (Field-Programmable Gate Arrays) are often used as coprocessors to boost the performance of dataintensive applications [1, 2]. However, mapping algorithms onto multimillion-gate FPGAs is time consuming and remains a challenge in configurable system design. The communication overhead between the host workstation and the FPGAs is also significant. To address these problems, we propose in this paper the FPGA-based Hierarchical-SIMD (H-SIMD) machine with its codesign of the Hierarchical Instruction Set Architecture (HISA). At each level, HISA instructions are classified into communication instructions or computation instructions. The former are executed by the local controller while the latter are issued to the lower level for execution. Additionally, by using a memory switching scheme and the high-level HISA set to partition the application into coarse-grain tasks, the host-FPGA communication overhead can be hidden. We enlist matrix multiplication (MM) to test the effectiveness of HSIMD. The test results show sustained high performance.
引用
收藏
页码:671 / 676
页数:6
相关论文
共 50 条
  • [1] Configurable Systolic Matrix Multiplication
    Kamranfar, Parastoo
    Shahabi, S. Ali
    Vazhbakht, Ghazaleh
    Nayabi, Zainalabedin
    2014 27TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2014 13TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID 2014), 2014, : 336 - 341
  • [2] Structured sparse matrix-vector multiplication on massively parallel SIMD architectures
    Dehn, T
    Eiermann, M
    Giebermann, K
    Sperling, V
    PARALLEL COMPUTING, 1995, 21 (12) : 1867 - 1894
  • [3] EFFICIENT MATRIX MULTIPLICATION ON SIMD COMPUTERS
    BJORSTAD, P
    MANNE, F
    SOREVIK, T
    VAJTERSIC, M
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1992, 13 (01) : 386 - 401
  • [4] SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision
    Hishinuma, Toshiaki
    Hasegawa, Hidehiko
    Tanaka, Teruo
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 21 - 34
  • [5] Design of a Massively Parallel Computing Architecture for Dense Matrix Multiplication
    Jose, Wilson
    Silva, Ana Rita
    Vestias, Mario
    Neto, Horacio
    2013 IEEE 4TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2013,
  • [6] PARALLEL COMPUTING OF MATRIX MULTIPLICATION IN OPEN MP SUPPORTED CODEBLOCKS
    Singh, Hari
    Chander, Dinesh
    Bhatt, Ravindara
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2019, 18 (08): : 775 - 787
  • [7] Parallel Matrix Multiplication
    Tomikj, Nikola
    Gusev, Marjan
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 204 - 209
  • [8] General matrix-matrix multiplication using SIMD features of the PIII
    Aberdeen, D
    Baxter, J
    EURO-PAR 2000 PARALLEL PROCESSING, PROCEEDINGS, 2000, 1900 : 980 - 983
  • [9] CMRF: a Configurable Matrix Register File for accelerating matrix operations on SIMD processors
    Zhang, Kai
    Chen, Shuming
    Chen, Hu
    Wang, Yaohua
    Chen, Xiaowen
    Liu, Sheng
    Liu, Wei
    IEICE ELECTRONICS EXPRESS, 2012, 9 (04): : 283 - 289
  • [10] Matrix multiplication by neuromorphic computing
    Lawrence, Sheril
    Yandapalli, Aishwarya
    Rao, Shrisha
    NEUROCOMPUTING, 2021, 431 : 179 - 187