H-SIMD machine: Configurable parallel computing for matrix multiplication

被引：0

作者：

Xu, XZ ^{[1
]}

Ziavras, SG ^{[1
]}

机构：

[1] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA

来源：

2005 IEEE International Conference on Computer Design: VLSI in Computers & Processors, Proceedings | 2005年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

FPGAs (Field-Programmable Gate Arrays) are often used as coprocessors to boost the performance of dataintensive applications [1, 2]. However, mapping algorithms onto multimillion-gate FPGAs is time consuming and remains a challenge in configurable system design. The communication overhead between the host workstation and the FPGAs is also significant. To address these problems, we propose in this paper the FPGA-based Hierarchical-SIMD (H-SIMD) machine with its codesign of the Hierarchical Instruction Set Architecture (HISA). At each level, HISA instructions are classified into communication instructions or computation instructions. The former are executed by the local controller while the latter are issued to the lower level for execution. Additionally, by using a memory switching scheme and the high-level HISA set to partition the application into coarse-grain tasks, the host-FPGA communication overhead can be hidden. We enlist matrix multiplication (MM) to test the effectiveness of HSIMD. The test results show sustained high performance.

引用

页码：671 / 676

页数：6

共 50 条

[1] Configurable Systolic Matrix Multiplication
Kamranfar, Parastoo
Shahabi, S. Ali
Vazhbakht, Ghazaleh
Nayabi, Zainalabedin
2014 27TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2014 13TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID 2014), 2014, : 336 - 341
[2] Structured sparse matrix-vector multiplication on massively parallel SIMD architectures
Dehn, T
Eiermann, M
Giebermann, K
Sperling, V
PARALLEL COMPUTING, 1995, 21 (12) : 1867 - 1894
[3] EFFICIENT MATRIX MULTIPLICATION ON SIMD COMPUTERS
BJORSTAD, P
MANNE, F
SOREVIK, T
VAJTERSIC, M
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1992, 13 (01) : 386 - 401
[4] SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision
Hishinuma, Toshiaki
Hasegawa, Hidehiko
Tanaka, Teruo
HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 21 - 34
[5] Design of a Massively Parallel Computing Architecture for Dense Matrix Multiplication
Jose, Wilson
Silva, Ana Rita
Vestias, Mario
Neto, Horacio
2013 IEEE 4TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2013,
[6] PARALLEL COMPUTING OF MATRIX MULTIPLICATION IN OPEN MP SUPPORTED CODEBLOCKS
Singh, Hari
Chander, Dinesh
Bhatt, Ravindara
ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2019, 18 (08): : 775 - 787
[7] Parallel Matrix Multiplication
Tomikj, Nikola
Gusev, Marjan
2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 204 - 209
[8] General matrix-matrix multiplication using SIMD features of the PIII
Aberdeen, D
Baxter, J
EURO-PAR 2000 PARALLEL PROCESSING, PROCEEDINGS, 2000, 1900 : 980 - 983
[9] CMRF: a Configurable Matrix Register File for accelerating matrix operations on SIMD processors
Zhang, Kai
Chen, Shuming
Chen, Hu
Wang, Yaohua
Chen, Xiaowen
Liu, Sheng
Liu, Wei
IEICE ELECTRONICS EXPRESS, 2012, 9 (04): : 283 - 289
[10] Matrix multiplication by neuromorphic computing
Lawrence, Sheril
Yandapalli, Aishwarya
Rao, Shrisha
NEUROCOMPUTING, 2021, 431 : 179 - 187

← 1 2 3 4 5 →