Hydra: An Accelerator for Real-Time Edge-Aware Permeability Filtering in 65nm CMOS

被引:0
|
作者
Eggimann, M. [1 ]
Gloor, C. [1 ]
Scheidegger, F. [1 ]
Cavigelli, L. [1 ]
Schaffner, M. [1 ]
Smolic, A. [2 ]
Benini, L. [1 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab IIS, Zurich, Switzerland
[2] Trinity Coll Dublin, Dublin, Ireland
关键词
IMAGE;
D O I
10.1109/ISCAS.2018.8351051
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many modern video processing pipelines rely on edge-aware (EA) filtering methods. However, recent high-quality methods are challenging to run in real-time on embedded hardware due to their computational load. To this end, we propose an area-efficient and real-time capable hardware implementation of a high quality EA method. In particular, we focus on the recently proposed permeability filter (PF) that delivers promising quality and performance in the domains of high dynamic range (HDR) tone mapping, disparity and optical flow estimation. We present an efficient hardware accelerator that implements a tiled variant of the PF with low on-chip memory requirements and a significantly reduced external memory bandwidth (6.4 x w.r.t. the non-tiled PF). The design has been taped out in 65nm CMOS technology, is able to filter 720p grayscale video at 24:8 Hz and achieves a high compute density of 6:7GFLOPS=mm(2) (12 x higher than embedded GPUs when scaled to the same technology node). The low area and bandwidth requirements make the accelerator highly suitable for integration into systems-on-chip (SoCs) where silicon area budget is constrained and external memory is typically a heavily contended resource.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] A 0.1pJ Freeze Vernier Time-to-Digital Converter in 65nm CMOS
    Blutman, Kristof
    Angevare, Jan
    Zjajo, Amir
    van der Meijs, Nick
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 85 - 88
  • [22] A 550ps access-time compilable SRAM in 65nm CMOS technology
    Wissel, Larry
    Pilo, Harold
    LeBlanc, Chris
    Wang, Xiaopeng
    Lamphier, Steve
    Fragano, Michael
    [J]. PROCEEDINGS OF THE IEEE 2007 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 2007, : 21 - 24
  • [23] Altitude and Underground Real-Time SER Testing of SRAMs Manufactured in CMOS Bulk 130, 65 and 40 nm
    Autran, Jean-Luc
    Munteanu, Daniela
    Sauze, S.
    Gasiot, G.
    Roche, P.
    [J]. 2014 IEEE RADIATION EFFECTS DATA WORKSHOP (REDW), 2014,
  • [24] CMOS 65 nm 'on chip' broadband real time substrate noise measurement
    Noulis, T.
    Lourandakis, E.
    Stefanou, S.
    Merakos, P.
    [J]. ELECTRONICS LETTERS, 2015, 51 (21) : 1710 - U107
  • [25] A Cyclic Vernier Time-to-Digital Converter Synthesized from a 65nm CMOS Standard Library
    Park, Youngmin
    Wentzloff, David D.
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 3561 - 3564
  • [26] A Low-Power Coarse-Fine Time-to-Digital Converter in 65nm CMOS
    Zhang, Xue-Jiao
    Cui, Ke-Ji
    Zou, Zhuo
    Zheng, Li-Rong
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2015,
  • [27] Edge-preserving texture filtering for real-time rendering
    Lee, YC
    Jen, CW
    [J]. VISUAL COMPUTER, 2003, 19 (01): : 10 - 22
  • [28] Edge-preserving texture filtering for real-time rendering
    Yuan-Chung Lee
    Chein-Wei Jen
    [J]. The Visual Computer, 2003, 19 : 10 - 22
  • [29] A Low-Power 1GHz Razor FIR Accelerator with Time-Borrow Tracking Pipeline and Approximate Error Correction in 65nm CMOS
    Whatmough, Paul N.
    Das, Shidhartha
    Bull, David M.
    [J]. 2013 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE DIGEST OF TECHNICAL PAPERS (ISSCC), 2013, 56 : 428 - U613
  • [30] Analog NVM Synapse for Hardware-Aware Neural Network Training Optimization on 65nm CMOS TaOx ReRAM Devices
    Liehr, Maximilian
    Abedin, Minhaz
    Beckmann, Karsten
    Cady, Nathaniel
    [J]. 2023 IEEE 32ND MICROELECTRONICS DESIGN & TEST SYMPOSIUM, MDTS, 2023,