A systolic design methodology with application to full-search block-matching architectures

被引:8
|
作者
Chen, YK [1 ]
Kung, SY [1 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
关键词
D O I
10.1023/A:1008012332212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a systematic methodology to support the design tradeoffs of array processors in several emerging issues, such as (1) high performance and high flexibility, (2) low cost, low power, (3) efficient memory usage, and (4) system-on-a-chip or the ease of system integration. This methodology is algebraic based, so it can cope with high-dimensional data dependence. The methodology consists of some transformation rules of data dependency graphs for facilitating flexible array designs. For example, two common partitioning approaches, LPGS and LSGP, could be unified under the methodology. It supports the design of high-speed and massively parallel processor arrays with efficient memory usage. More specifically, it leads to a novel systolic cache architecture comprising of shift registers only (cache without tags). To demonstrate how the methodology works, we have presented several systolic design examples based on the block-matching motion estimation algorithm (BMA). By multiprojecting a 4D DG of the BMA to 2D mesh, we can reconstruct several existing array processors. By multiprojecting a 6D DG of the BMA, a novel 2D systolic array can be derived that features significantly improved rates in data reusability (96%) and processor utilization (99%).
引用
收藏
页码:51 / 77
页数:27
相关论文
共 50 条
  • [1] A Systolic Design Methodology with Application to Full-Search Block-Matching Architectures
    Yen-Kuang Chen
    S.Y. Kung
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 1998, 19 : 51 - 77
  • [2] PARAMETERIZABLE VLSI ARCHITECTURES FOR THE FULL-SEARCH BLOCK-MATCHING ALGORITHM
    DEVOS, L
    STEGHERR, M
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (10): : 1309 - 1316
  • [3] Design and implementation of efficient VLSI architectures for full-search block-matching motion estimation
    Mao, ZG
    He, WF
    Zhou, WB
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 1, 2004, : 616 - 620
  • [4] Systolic array architectures for full-search block matching motion estimation
    Elgamel, MA
    Nallamilli, BR
    Bayoumi, MA
    Mashaly, S
    [J]. THIRD INTERNATIONAL WORKSHOP ON DIGITAL AND COMPUTATIONAL VIDEO, PROCEEDINGS, 2002, : 108 - 115
  • [5] Cost Effective VLSI Architectures for Full-Search Block-Matching Motion Estimation Algorithm
    Zhong L. He
    Ming L. Liou
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 1997, 17 : 225 - 240
  • [6] Efficient and configurable full-search block-matching processors
    Roma, N
    Sousa, L
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (12) : 1160 - 1167
  • [7] Cost effective VLSI architectures for full-search block-matching motion estimation algorithm
    He, ZL
    Liou, ML
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1997, 17 (2-3): : 225 - 240
  • [8] A hierarchical design methodology for full-search block matching motion estimation
    Rehan, Mohamed
    El-Kharashi, M. Watheq
    Gebali, Fayez
    [J]. MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2006, 17 (04) : 327 - 341
  • [9] A hierarchical design methodology for full-search block matching motion estimation
    Mohamed Rehan
    M. Watheq El-Kharashi
    Fayez Gebali
    [J]. Multidimensional Systems and Signal Processing, 2006, 17 : 327 - 341
  • [10] Hierarchical systolic array design for full-search block matching motion estimation
    Gebali, F
    [J]. PROCEEDINGS OF THE FOURTH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2004, : 76 - 80