An efficient unified framework for implementation of a prime-length DCT/IDCT with high throughput

被引:13
|
作者
Chiper, Doru-Morin [1 ]
Swamy, M. N. S. [1 ]
Ahmad, M. Omair [1 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Ctr Commun & Signal Proc, Montreal, PQ H3G 1M8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
discrete cosine transform (DCT); high throughput implementation; systolic arrays; unified DCT/IDCT structure; VLSI algorithms;
D O I
10.1109/TSP.2007.893746
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a unified design framework for primelength forward and inverse discrete cosine transforms with a high, throughput is presented. The proposed design facilitates trade-off between the throughput and hardware cost or power consumption, and is well suited for low-power applications. The VLSI structure is highly regular and modular with a topology well suited for the VLSI implementation. The proposed approach is based on the derivation of new efficient systolic algorithms. The algorithms have the same core structure for both the transforms, and the core structure consists of two circular correlations, which unlike other similar computational structures, have the same length and form. Thus, they can be computed in parallel and mapped, on the same linear systolic array with channels having a low I/O bandwidth requirement and their number being independent of the transform length N. Further, it is shown that the two transforms, can be efficiently implemented on the same VLSI chip, where only the pre- and post-processing stages are different. The proposed systolic algorithms retain the benefits provided by VLSI implementations based on circular or cyclic convolution structures, and at the same time has a simpler control structure, high speed and low complexity.
引用
收藏
页码:2925 / 2936
页数:12
相关论文
共 50 条
  • [1] Hardware-Efficient Realization of Prime-Length DCT Based on Distributed Arithmetic
    Xie, Jiafeng
    Meher, Pramod Kumar
    He, Jianjun
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (06) : 1170 - 1178
  • [2] An efficient systolic array algorithm for the VLSI implementation of a prime-length DHT
    Chiper, DF
    Swamy, MNS
    Ahmad, MO
    [J]. ISSCS 2005: INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 167 - 169
  • [3] Fast algorithm for computing prime-length DCT using cyclic convolutions
    [J]. 2001, Nanjing University of Aeronautics an Astronautics (16):
  • [4] A new fast algorithm for computing prime-length DCT through cyclic convolutions
    Yin, RX
    Siu, WC
    [J]. SIGNAL PROCESSING, 2001, 81 (05) : 895 - 906
  • [5] DATA MAPPING SCHEME AND IMPLEMENTATION FOR HIGH-THROUGHPUT DCT/IDCT TRANSPOSE MEMORY
    Xie, Zheng
    Lu, Yanheng
    Fan, Yibo
    Zeng, Xiaoyang
    [J]. 2014 12TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2014,
  • [6] Techniques for efficient DCT/IDCT implementation on generic GPU
    Fang, B
    Shen, GB
    Li, SP
    Chen, HF
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1126 - 1129
  • [7] An efficient method for hardware based DCT/IDCT implementation
    Sun, XT
    Wu, CK
    [J]. NEURAL NETWORK AND DISTRIBUTED PROCESSING, 2001, 4555 : 6 - 10
  • [8] High throughput 2D DCT/IDCT processor for video coding
    Ruiz, GA
    Michell, JA
    Burón, AM
    [J]. 2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3521 - 3524
  • [9] An efficient prime-length DFT algorithm over finite fields GF(2 m)
    Gappmair, Wilfried
    [J]. European Transactions on Telecommunications, 2003, 14 (02): : 171 - 176
  • [10] Finite word-length effects of an unified systolic array for 2-D DCT/IDCT
    Lim, H
    Yim, C
    Swartzlander, EE
    [J]. INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS 1996, PROCEEDINGS, 1996, : 35 - 44