Optimizing Soft Vector Processing in FPGA-Based Embedded Systems

被引:4
|
作者
Kapre, Nachiket [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, 50 Nanyang Ave, Singapore 639798, Singapore
关键词
Vector Processors; Embedded Systems; Compilers; Vector processors; soft processors; streaming computations;
D O I
10.1145/2912884
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Soft vector processors can augment and extend the capability of FPGA-based embedded systems-on-chip such as the Xilinx Zynq. However, configuring and optimizing the soft processor for best performance is hard. We must consider architectural parameters such as precision, vector lane count, vector length, chunk size, and DMA scheduling to ensure efficient execution of code on the soft vector processing platform. To simplify the design process, we develop a compiler framework and an autotuning runtime that splits the optimization into a combination of static and dynamic passes that map data-parallel computations to the soft processor. We compare and contrast implementations running on the scalar ARM processor, the embedded NEON hard vector engine, and low-level streaming Verilog designs with the VectorBlox MXP soft vector processor. Across a range of data-parallel benchmarks, we show that the MXP soft vector processor can outperform other organizations by up to 4x while saving approximate to 10% dynamic power. Our compilation and runtime framework is also able to outperform the gccNEON vectorizer under certain conditions by explicit generation of NEON intrinsics and performance tuning of the autogenerated data-parallel code. When constrained by IO bandwidth, soft vector processors are even competitive with spatial Verilog implementations of computation.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] FPGA-based platform for image and video processing embedded systems
    Toledo, F. Javier
    Martinez, J. Javier
    Ferrandez, J. Manuel
    [J]. 2007 3RD SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC, PROCEEDINGS, 2007, : 171 - +
  • [2] FPGA-based vector processing for matrix operations
    Yang, Hongyan
    Ziavras, Sotirios G.
    Hu, Jie
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 989 - +
  • [3] Sorting units for FPGA-based embedded systems
    Marcelino, Rui
    Neto, Horacio
    Cardoso, Joao M. P.
    [J]. DISTRIBUTED EMBEDDED SYSTEMS: DESIGN, MIDDLEWARE AND RESOURCES, 2008, : 11 - 22
  • [4] Configuration compression for FPGA-based embedded systems
    Dandalis, A
    Prasanna, VK
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2005, 13 (12) : 1394 - 1398
  • [5] Managing Security in FPGA-Based Embedded Systems
    Huffmire, Ted
    Brotherton, Brett
    Sherwood, Timothy
    Kastner, Ryan
    Levin, Timothy
    Nguyen, Thuy D.
    Irvine, Cynthia
    [J]. IEEE DESIGN & TEST OF COMPUTERS, 2008, 25 (06): : 590 - 598
  • [6] A distributed platform for integration of FPGA-based embedded systems
    Morais, Daniel C.
    Nascimento, Tiago P.
    Brito, Alisson V.
    Silva, Thiago Werlley B.
    Melcher, Elmar U. K.
    [J]. 2016 VI BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING (SBESC 2016), 2016, : 86 - 92
  • [7] Investigation of transient effects on FPGA-based embedded systems
    Bakhoda, A
    Miremadi, SG
    Zarandi, HR
    [J]. ICESS 2005: SECOND INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2005, : 231 - 236
  • [8] Hardware Decompression Techniques for FPGA-Based Embedded Systems
    Koch, Dirk
    Beckhoff, Christian
    Teich, Juergen
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (02)
  • [9] A comparison of profiling tools for FPGA-Based embedded systems
    Tong, Jason G.
    Khalid, Mohammed A. S.
    [J]. 2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1687 - 1690
  • [10] A PUF Design for Secure FPGA-Based Embedded Systems
    Anderson, Jason H.
    [J]. 2010 15TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2010), 2010, : 1 - 6