Quark: An Integer RISC-V Vector Processor for Sub-Byte Quantized DNN Inference

被引:3
|
作者
AskariHemmat, MohammadHossein [1 ]
Dupuis, Theo [1 ]
Fournier, Yoan [1 ]
El Zarif, Nizar [1 ]
Cavalcante, Matheus [2 ]
Perotti, Matteo [2 ]
Gurkaynak, Frank [2 ]
Benini, Luca [2 ]
Leduc-Primeau, Francois [1 ]
Savaria, Yvon [1 ]
David, Jean-Pierre [1 ]
机构
[1] Ecole Polytech Montreal, Dept Elect Engn, Montreal, PQ, Canada
[2] Swiss Fed Inst Technol, Integrated Syst Lab, Zurich, Switzerland
关键词
RISC-V; Vector ISA; Quantization; Machine Learning; Efficiency; ENERGY;
D O I
10.1109/ISCAS46773.2023.10181985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present Quark, an integer RISC-V vector processor specifically tailored for sub-byte DNN inference. Quark is implemented in GlobalFoundries' 22FDX FD-SOI technology. It is designed on top of Ara, an open-source 64-bit RISC-V vector processor. To accommodate sub-byte DNN inference, Quark extends Ara by adding specialized vector instructions to perform sub-byte quantized operations. We also remove the floating-point unit from Quarks' lanes and use the CVA6 RISC-V scalar core for the re-scaling operations that are required in quantized neural network inference. This makes each lane of Quark 2 times smaller and 1.9 times more power efficient compared to the ones of Ara. In this paper we show that Quark can run quantized models at sub-byte precision. Notably we show that for 1-bit and 2-bit quantized models, Quark can accelerate computation of Conv2d over various ranges of inputs and kernel sizes.
引用
收藏
页数:5
相关论文
共 42 条
  • [41] Yun: An Open-Source, 64-Bit RISC-V-Based Vector Processor With Multi-Precision Integer and Floating-Point Support in 65-nm CMOS
    Perotti, Matteo
    Cavalcante, Matheus
    Ottaviano, Alessandro
    Liu, Jiantao
    Benini, Luca
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (10) : 3732 - 3736
  • [42] MANIC: A 19μW @ 4MHz, 256 MOPS/mW, RISC-V microcontroller with embedded MRAM main memory and vector-dataflow co-processor in 22nm bulk finFET CMOS
    Gobieski, Graham
    Atli, Oguz
    Erbagci, Cagri
    Mai, Ken
    Beckmann, Nathan
    Lucia, Brandon
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,