Implementation of FFT and CRBLASTER on the Maestro Processor

被引:0
|
作者
Suh, Jinwoo [1 ]
Mighell, Kenneth J. [2 ]
Kang, Dong-In [1 ]
Crago, Stephen P. [1 ]
机构
[1] Univ Southern Calif, Inst Informat Sci, 3811 N Fairfax Dr,Suite 200, Arlington, VA 22003 USA
[2] Natl Opt Astron Observat, Tucson, AZ 85719 USA
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Currently, most microprocessors use multiple cores to increase performance while limiting power usage. Some processors use not just a few cores, but tens of cores or even 100 cores. One such many-core microprocessor is the Maestro processor, which is based on Tilera's TILE64 processor. The Maestro chip is a 49-core, general-purpose, radiation-hardened processor designed for space applications. The Maestro processor, unlike the TILE64, has a floating-point coprocessor in each core for improved floating point performance. The Maestro processor runs at up to 350 MHz clock frequency. On the Maestro processor, we implemented an FFT and an image processing application called CRBLASTER and evaluated the performance. The FFT is a well-known and commonly used signal processing kernel. CRBLASTER is a parallel-processing image-analysis application that does cosmic-ray rejection on CCD (charge-coupled device) images using the embarrassingly-parallel L. A. COSMIC algorithm. Both applications were written in C. CRBLASTER uses the high-performance computing industry standard Message Passing Interface (MPI) library. The achieved performance of the FFT was up to 3,813 MFLOPS, and the speedup compared to single tile was 46.4 using 49 tiles. The speedup for CRBLASTER, which was memory-bound, was up to 12.5 using 36 tiles.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Implementation of kernels on the Maestro Processor
    Suh, Jinwoo
    Kang, Dong In D.
    Crago, Stephen P.
    [J]. 2013 IEEE AEROSPACE CONFERENCE, 2013,
  • [2] Implementation of an asynchronous FFT processor
    Seshasayanan, R
    Srivatsa, SK
    Sugavaneswaran, V
    [J]. INDICON 2005 Proceedings, 2005, : 327 - 331
  • [3] Real time FFT processor implementation
    Minallah, Mian Sijjad
    Raja, Gulistan
    [J]. SECOND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2006, PROCEEDINGS, 2006, : 192 - +
  • [4] Design and implementation of an FFT processor for VDSL
    Rudberg, MK
    Sandberg, M
    Ekholm, K
    [J]. APCCAS '98 - IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS: MICROELECTRONICS AND INTEGRATING SYSTEMS, 1998, : 611 - 614
  • [5] VHDL Implementation of a Flexible and Synthesizable FFT Processor
    Correa, I. S.
    Freitas, L. C.
    Klautau, A.
    Costa, J. C. W. A.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2012, 10 (01) : 1180 - 1183
  • [6] A parallel architecture for VLSI implementation of FFT processor
    Peng, YJ
    [J]. 2003 5TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS, 2003, : 748 - 751
  • [7] Design, optimization, and implementation of a universal FFT processor
    Kumhom, P
    Johnson, JR
    Nagvajara, P
    [J]. 13TH ANNUAL IEEE INTERNATIONAL ASIC/SOC CONFERENCE, PROCEEDINGS, 2000, : 182 - 186
  • [8] An improved VLSI implementation method of FFT processor
    Liu, Guihua
    Feng, Quanyuan
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 131 - +
  • [9] An Efficient Implementation Approach to FFT Processor for Spectral Analysis
    Hazarika, Jinti
    Khan, Mohd. Tasleem
    Ahamed, Shaik Rafi
    Nemade, Harshal B. B.
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [10] Locality-improved FFT implementation on a graphics processor
    Romero, Sergio
    Trenas, Maria A.
    Gutierrez, Eladio
    Zapata, Emilio L.
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'-07), 2007, : 54 - +