Design of a Reconfigurable Coprocessor for Double Precision Floating Point Matrix Algorithms

被引:0
|
作者
Li, Shenglong [1 ]
Li, Zhaolin [2 ]
Zheng, Qingwei [1 ]
机构
[1] Tsinghua Univ, Inst Microelect, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Res Inst Informat Technol, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Reconfigurable Coprocessor; Multi-Processor System on Chip; Quaternion Algorithm; Double Precision Floating Point Matrix Algorithms;
D O I
10.4028/www.scientific.net/AMM.58-60.1037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Double precision floating point matrix operations are wildly used in a variety of engineering and scientific computing applications. However, it's inefficient to achieve these operations using software approaches on general purpose processors. In order to reduce the processing time and satisfy the real-time demand, a reconfigurable coprocessor for double precision floating point matrix algorithms is proposed in this paper. The coprocessor is embedded in a Multi-Processor System on Chip (MPSoC), cooperates with an ARM core and a DSP core for high-performance control and calculation. One algorithm in GPS applications is taken for example to illustrate the efficiency of the coprocessor proposed in this paper. The experiment result shows that the coprocessor can achieve speedup a factor of 50 for the quaternion algorithm of attitude solution in inertial navigation application compare with software execution time of a TI C6713 DSP. The coprocessor is implemented in SMIC 0.13 mu m CMOS technology, the synthesis time delay is 9.75ns, and the power consumption is 63.69 mW when it works at 100MHz.
引用
收藏
页码:1037 / +
页数:2
相关论文
共 50 条
  • [41] Design of a Dynamically Reconfigurable Arithmetic Unit for Matrix Algorithms
    Wang, Weijiang
    Ding, Yingtao
    Cao, Shan
    Zhao, Xianli
    PROCEEDINGS OF 2015 IEEE 11TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2015,
  • [42] Run-Time-Reconfigurable Multi-Precision Floating-Point Matrix Multiplier Intellectual Property Core on FPGA
    Arish, S.
    Sharma, R. K.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (03) : 998 - 1026
  • [43] An FPGA based coprocessor for matrix algorithms implementation
    Amira, A
    Bensaali, F
    OPTP-IRELAND 2002: OPTICAL METROLOGY, IMAGING, AND MACHINE VISION, 2003, 4877 : 271 - 277
  • [44] Run-Time-Reconfigurable Multi-Precision Floating-Point Matrix Multiplier Intellectual Property Core on FPGA
    S. Arish
    R. K. Sharma
    Circuits, Systems, and Signal Processing, 2017, 36 : 998 - 1026
  • [45] Performance Evaluation of Strassen Matrix Multiplication Supporting Triple-Double Precision Floating-Point Arithmetic
    Kouya, Tomonori
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT V, 2020, 12253 : 163 - 176
  • [46] Design and Implementation of Differential Evolution Algorithm on FPGA for Double-Precision Floating-Point Representation
    Cortes-Antonio, Prometeo
    Rangel-Gonzalez, Josue
    Villa-Vargas, Luis A.
    Antonio Ramirez-Salinas, Marco
    Molina-Lozano, Heron
    Batyrshin, Ildar
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (04) : 139 - 153
  • [47] Design of Single Precision Floating Point Arithmetic Logic Unit
    Singh, Nisha
    Dhanabal, R.
    2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENERGY SYSTEMS (ICEES), 2018, : 133 - 137
  • [48] VFloat: A Variable Precision Fixed- and Floating-Point Library for Reconfigurable Hardware
    Wang, Xiaojun
    Leeser, Miriam
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (03)
  • [49] A Reconfigurable Processing Element for Multiple-Precision Floating/Fixed-Point HPC
    Li, Boyu
    Li, Kai
    Zhou, Jiajun
    Ren, Yuan
    Mao, Wei
    Yu, Hao
    Wong, Ngai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1401 - 1405
  • [50] Implementation of a floating-point matrix-vector multiplication on a reconfigurable architecture
    Garzia, Fabio
    Brunelli, Claudio
    Rossi, Davide
    Nurmi, Jari
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 3496 - +