HW-FlowQ: A Multi-Abstraction Level HW-CNN Co-design Quantization Methodology

被引:7
|
作者
Fasfous, Nael [1 ]
Vemparala, Manoj Rohit [2 ]
Frickenstein, Alexander [2 ]
Valpreda, Emanuele [3 ]
Salihu, Driton [1 ]
Nguyen Anh Vu Doan [1 ]
Unger, Christian [2 ]
Nagaraja, Naveen Shankar [2 ]
Martina, Maurizio [3 ]
Stechele, Walter [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] BMW Autonomous Driving, Munich, Germany
[3] Politecn Torino, Turin, Italy
关键词
Convolutional neural networks; multi-objective optimization; hardware modeling; genetic algorithms; quantization; CONVOLUTION;
D O I
10.1145/3476997
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Model compression through quantization is commonly applied to convolutional neural networks (CNNs) deployed on compute and memory-constrained embedded platforms. Different layers of the CNN can have varying degrees of numerical precision for both weights and activations, resulting in a large search space. Together with the hardware (HW) design space, the challenge of finding the globally optimal HW-CNN combination for a given application becomes daunting. To this end, we propose HW-FlowQ, a systematic approach that enables the co-design of the target hardware platform and the compressed CNN model through quantization. The search space is viewed at three levels of abstraction, allowing for an iterative approach for narrowing down the solution space before reaching a high-fidelity CNN hardware modeling tool, capable of capturing the effects of mixed-precision quantization strategies on different hardware architectures (processing unit counts, memory levels, cost models, dataflows) and two types of computation engines (bit-parallel vectorized, bit-serial). To combine both worlds, a multi-objective non-dominated sorting genetic algorithm (NSGA-II) is leveraged to establish a Pareto-optimal set of quantization strategies for the target HW-metrics at each abstraction level. HW-FlowQ detects optima in a discrete search space and maximizes the task-related accuracy of the underlying CNN while minimizing hardware-related costs. The Pareto-front approach keeps the design space open to a range of non-dominated solutions before refining the design to a more detailed level of abstraction. With equivalent prediction accuracy, we improve the energy and latency by 20% and 45% respectively for ResNet56 compared to existing mixed-precision search methods.
引用
收藏
页数:25
相关论文
共 26 条
  • [1] AnaCoNGA: Analytical HW-CNN Co-Design Using Nested Genetic Algorithms
    Fasfous, Nael
    Vemparala, Manoj Rohit
    Frickenstein, Alexander
    Valpreda, Emanuele
    Salihu, Driton
    Hoefer, Julian
    Singh, Anmol
    Nagaraja, Naveen-Shankar
    Voegel, Hans-Joerg
    Nguyen Anh Vu Doan
    Martina, Maurizio
    Becker, Juergen
    Stechele, Walter
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 238 - 243
  • [2] Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator
    Zhang, Yiran
    Li, Guiying
    Yuan, Bo
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 678 - 683
  • [3] Virtual component HW/SW co-design - From system level design exploration to HW/SW implementation
    Schirrmeister, F
    Krolikoski, S
    SYSTEM-ON-CHIP METHODOLOGIES & DESIGN LANGUAGES, 2001, : 333 - 342
  • [4] Application of a system-level HW/SW co-design methodology to an industrial embedded system
    Galassi, Alberto
    Pomante, Luigi
    Nardocci, Vitaliano
    2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 64 - 70
  • [5] A new HW/SW co-design methodology to generate a system level platform based on LISA
    Yang, S
    Qian, Y
    Zhang, TJ
    Rui, S
    Hou, CH
    2005 6th International Conference on ASIC Proceedings, Books 1 and 2, 2005, : 215 - 218
  • [6] iTuCoMe: HCDFG-based incremental tuning HW/SW co-design methodology for multi-level exploration
    Wang, HL
    Bian, JN
    Wu, Q
    Wang, YF
    PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS 1 AND 2, 2005, : 978 - 983
  • [7] SoC Design with HW/SW Co-Design Methodology for Wireless Communication System
    Surantha, Nico
    Sutisna, Nana
    Nagao, Yuhei
    Ochi, Hiroshi
    2017 17TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2017,
  • [8] Hybrid Prototyping Methodology for Rapid System Validation in HW/SW Co-Design
    Wicaksana, Arief
    Charif, Amir
    Andriamisaina, Caaliph
    Ventroux, Nicolas
    2019 CONFERENCE ON DESIGN AND ARCHITECTURES FOR SIGNAL AND IMAGE PROCESSING (DASIP), 2019, : 35 - 40
  • [9] HW/SW Co-Design of Cost-Efficient CNN Inference for Cognitive IoT
    Lee, Kwangho
    Kong, Joonho
    Munir, Arslan
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [10] HW/SW co-design and implementation of multi-standard video decoding
    Liu Feng
    Guo Rui
    Shi Shu
    Cheng Xu
    PROCEEDINGS OF THE 2006 IEEE/ACM/IFIP WORKSHOP ON EMBEDDED SYSTEMS FOR REAL TIME MULTIMEDIA, 2006, : 87 - +