Hardware efficient fast DCT based on novel cyclic convolution structures

被引:36
|
作者
Cheng, Chao [1 ]
Parhi, Keshab K. [1 ]
机构
[1] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
cyclic convolution; discrete cosine transforms; linear convolution; very large-scale integration;
D O I
10.1109/TSP.2006.881269
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cyclic convolution is a widely used operation in signal processing. In very large-scale integration (VLSI) design, it is usually implemented with systolic array and distributed arithmetic; however, these implementation designs may not be fast enough or use too much hardware cost when the convolution length is large. This paper presents a new fast cyclic convolution algorithm, which is hardware efficient and suitable for high-speed VLSI implementation, especially when the convolution length is large. For example, when the proposed fast cyclic convolution algorithm is applied to the implementation of prime length discrete cosine transform (DCT), the proposed high-throughput implementation of 1297-length DCT design saves 1216 (94%) multiplications, 282 (22%) additions, and 4792 (74%) delay elements compared with those of recently proposed systolic array based algorithms. Furthermore, the proposed algorithm can run at a speed that is 1.5 times that of previous designs and requires less I/O cost as long as the wordlength L is less than 20 bits.
引用
收藏
页码:4419 / 4434
页数:16
相关论文
共 50 条
  • [1] Hardware efficient fast parallel FIR filter structures based on Iterated Short Convolution
    Cheng, C
    Parhi, KK
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, 2004, : 361 - 364
  • [2] Hardware efficient fast parallel FIR filter structures based on iterated short convolution
    Cheng, C
    Parhi, KK
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2004, 51 (08) : 1492 - 1500
  • [3] An efficient algorithm for cyclic convolution based on fast-polynomial and fast-W transforms
    Lizhi C.
    Zengrong J.
    [J]. Circuits, Systems and Signal Processing, 2001, Birkhauser Boston (20) : 77 - 88
  • [4] An efficient algorithm for cyclic convolution based on fast-polynomial and fast-W transforms
    Cheng, LH
    Jiang, ZR
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2001, 20 (01) : 77 - 88
  • [5] FAST DCT-BASED ALGORITHMS FOR SIGNAL CONVOLUTION AND TRANSLATION
    Bilevich, Leonid
    Yaroslavsky, Leonid
    [J]. 2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 898 - 903
  • [6] An efficient method for hardware based DCT/IDCT implementation
    Sun, XT
    Wu, CK
    [J]. NEURAL NETWORK AND DISTRIBUTED PROCESSING, 2001, 4555 : 6 - 10
  • [7] Hardware-efficient DFT designs with cyclic convolution and subexpression sharing
    Chang, TS
    Guo, JI
    Jen, CW
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2000, 47 (09): : 886 - 892
  • [8] An efficient 2-D DCT/IDCT core design using cyclic convolution and adder-based realization
    Guo, JI
    Ju, RC
    Chen, JW
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (04) : 416 - 428
  • [9] NOTE ON FAST CYCLIC CONVOLUTION
    ZALCSTEI.Y
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (06) : 665 - &
  • [10] Design and Analysis of A Novel High Speed Adder Based Hardware Efficient Discrete Cosine Transform (DCT)
    Kiran, K. Ravi
    Kumar, C. Ashok
    Kumar, M. Suresh
    [J]. 2015 Fifth International Conference on Advances in Computing and Communications (ICACC), 2015, : 169 - 173