Optimizing OpenCL-Based CNN Design on FPGA with Comprehensive Design Space Exploration and Collaborative Performance Modeling

被引:9
|
作者
Mu, Jiandong [1 ]
Zhang, Wei [1 ]
Liang, Hao [2 ]
Sinha, Sharad [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Indian Inst Technol IIT, Veling, Goa, India
关键词
CNN; modeling; hardware design; design space exploration;
D O I
10.1145/3397514
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent success in applying convolutional neural networks (CNNs) to object detection and classification has sparked great interest in accelerating CNNs using hardware-like field-programmable gate arrays (FPGAs). However, finding an efficient FPGA design for a given CNN model and FPGA board is not trivial since a strong background in hardware design and detailed knowledge of the target board are required. In this work, we try to solve this problem by design space exploration with a collaborative framework. Our framework consists of three main parts: FPGA design generation, coarse-grained modeling, and fine-grained modeling. In the FPGA design generation, we propose a novel data structure, LoopTree, to capture the details of the FPGA design for CNN applications without writing down the source code. Different LoopTrees, which indicate different FPGA designs, are automatically generated in this process. A coarse-grained model will evaluate LoopTrees at the operation level, e.g., add, mult, and so on, so that the most efficient LoopTrees can be selected. A fine-grained model, which is based on the source code, will then refine the selected design in a cycle-accurate manner. A set of comprehensive OpenCL-based designs have been implemented on board to verify our framework. An average estimation error of 8.87% and 4.8% has been observed for our coarse-grained model and fine-grained model, respectively. This is much lower than the prevalent operation-statistics-based estimation, which is obtained according to a predefined formula for specific loop schedules.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Design Space Exploration for FPGA-based Hybrid Multicore Architecture
    Yan, Jian
    Yuan, Junqi
    Wang, Ying
    Leong, Philip
    Wang, Lingli
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2014, : 280 - 281
  • [32] Automated design space exploration for FPGA-based heterogeneous interconnects
    Cilardo, A.
    Fusella, E.
    Gallo, L.
    Mazzeo, A.
    Mazzocca, N.
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2014, 18 (3-4) : 157 - 170
  • [33] OpenCL HLS Based Design of FPGA Accelerators for Cryptographic Primitives
    Barenghi, Alessandro
    Madaschi, Michele
    Mainardi, Nicholas
    Pelosi, Gerardo
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 634 - 641
  • [34] Design space exploration for the integrated digital CNN camera
    Malki, Suleyman
    Spaanenburg, Lambert
    PROCEEDINGS OF THE 2008 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, 2008, : 107 - 110
  • [35] Design Space Exploration for CNN Offloading to FPGAs at the Edge
    Korol, Guilherme
    Jordan, Michael Guilherme
    Rutzig, Mateus Beck
    Castrillon, Jeronimo
    Schneider Beck, Antonio Carlos
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 276 - 281
  • [36] Dynamical structural modeling A collaborative design exploration
    Burry, Jane
    Felicetti, Peter
    Tang, Jiwu
    Burry, Mark
    Xie, Mike
    INTERNATIONAL JOURNAL OF ARCHITECTURAL COMPUTING, 2005, 3 (01) : 27 - 42
  • [37] Design-Space Exploration between FPGA and ASIF
    Qureshi, Muhammad Amin
    Parvez, Husain
    2014 9TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2014,
  • [38] FPGA Redundancy Configurations: An Automated Design Space Exploration
    Anwer, Jahanzeb
    Platzner, Marco
    Meisner, Sebastian
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 275 - 280
  • [39] A comprehensive method for optimizing the design of a regular architectural space to improve building performance
    Zou, Yukai
    Zhan, Qiaosheng
    Xiang, Ke
    ENERGY REPORTS, 2021, 7 : 981 - 996
  • [40] Model-Based Design Space Exploration for Approximate Image Processing on FPGA
    Manuel, Manu
    Kreddig, Arne
    Conrady, Simon
    Doan, Nguyen Anh Vu
    Stechele, Walter
    2020 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS), 2020,