TileNET: Scalable Architecture for High-throughput Ternary Convolution Neural Networks using FPGAs

被引:4
|
作者
Vikram, Sahu Sai [1 ]
Pant, Vibha [2 ]
Mody, Mihir [3 ]
Purnaprajna, Madhura [2 ]
机构
[1] Amrita Univ, Dept Elect & Commun Engn, Bengaluru, India
[2] Amrita Univ, Dept Comp Sci Engn, Bengaluru, India
[3] Texas Instruments Inc, Bengaluru, India
关键词
D O I
10.1109/VLSID.2018.113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolution Neural Networks (CNNs) are becoming increasing popular in Advanced driver assistance systems (ADAS) and Autonomated driving (AD) for camera perception enabling multiple applications like object detection, lane detection and semantic segmentation. Ever increasing need for high resolution multiple cameras around car necessitates a huge-throughput in the order of about few 10's of TeraMACs per second (TMACS) along with high accuracy of detection. Existing implementations do not scale, with performance ranging only in the order of a few Giga operations per second. This paper, proposes a novel tiled architecture for CNNs that uses only ternarized weights, while input and output features are kept full precision resulting in minimal loss of accuracy. The proposed solution is implemented on Virtex-7 FPGA resulting in throughput of 13.76 TOPS. The post-implementation power simulation for AlexNet consumes 16 W, orders of magnitude lower than exist in GPUs.
引用
收藏
页码:461 / 462
页数:2
相关论文
共 50 条
  • [11] A Many-core Architecture for an Ensemble Ternary Neural Network Toward High-Throughput Inference
    Kayanoma, Ryota
    Jinguji, Akira
    Nakahara, Hiroki
    2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 446 - 453
  • [12] High-Throughput Classification of Radiographs Using Deep Convolutional Neural Networks
    Rajkomar, Alvin
    Lingam, Sneha
    Taylor, Andrew G.
    Blum, Michael
    Mongan, John
    JOURNAL OF DIGITAL IMAGING, 2017, 30 (01) : 95 - 101
  • [13] Using Neural Networks to Identify More Proteins in High-Throughput Proteomics
    McHugh, Leo
    Arthur, Jonathan
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 1676 - 1679
  • [14] High-throughput screening of DeNOx catalyst using artificial neural networks
    Chae, Song Hwa
    Kim, Sang Hun
    Park, Sunwon
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 1896 - +
  • [15] High-Throughput Classification of Radiographs Using Deep Convolutional Neural Networks
    Alvin Rajkomar
    Sneha Lingam
    Andrew G. Taylor
    Michael Blum
    John Mongan
    Journal of Digital Imaging, 2017, 30 : 95 - 101
  • [16] A Scalable High-Precision and High-Throughput Architecture for Emulation of Quantum Algorithms
    Mahmud, Naveed
    El-Araby, Esam
    2018 31ST IEEE INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2018, : 49 - 54
  • [17] DENVIS: Scalable and High-Throughput Virtual Screening Using Graph Neural Networks with Atomic and Surface Protein Pocket Features
    Krasoulis, Agamemnon
    Antonopoulos, Nick
    Pitsikalis, Vassilis
    Theodorakis, Stavros
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, : 4642 - 4659
  • [18] Scalable high-throughput variable block size motion estimation architecture
    Warrington, Stephen
    Chan, Wai-Yip
    Sudharsanan, Subramania
    MICROPROCESSORS AND MICROSYSTEMS, 2009, 33 (04) : 319 - 325
  • [19] A scalable architecture for high-throughput regular-expression pattern matching
    Brodie, Benjamin C.
    Cytron, Ron K.
    Taylor, David E.
    33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHTIECTURE, PROCEEDINGS, 2006, : 191 - 202
  • [20] Towards High-throughput Neural Network Inference with Computational BRAM on Nonvolatile FPGAs
    Zhang, Hao
    Zhao, Mengying
    Zheng, Huichuan
    Xiong, Yuqing
    Zhang, Yuhao
    Shen, Zhaoyan
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,