CHEF: A Framework for Deploying Heterogeneous Models on Clusters With Heterogeneous FPGAs

被引:0
|
作者
Tang, Yue [1 ]
Song, Yukai [1 ]
Elango, Naveena [2 ]
Priya, Sheena Ratnam [2 ]
Jones, Alex K. [3 ,4 ]
Xiong, Jinjun [2 ]
Zhou, Peipei [5 ]
Hu, Jingtong [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[2] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] Syracuse Univ, Dept Elect Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Comp Sci Dept, Syracuse, NY 13244 USA
[5] Brown Univ, Sch Engn, Providence, RI 02912 USA
关键词
Design automation; Computational modeling; Clustering algorithms; Machine learning; Bandwidth; Benchmark testing; Market research; Hardware; Integrated circuit modeling; Field programmable gate arrays; Heterogeneous FPGA clusters; multimodality multitask (MMMT);
D O I
10.1109/TCAD.2024.3438994
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) are rapidly evolving from streamlined single-modality single-task (SMST) to multimodality multitask (MMMT) with large variations for different layers and complex data dependencies among layers. To support such models, hardware systems also evolved to be heterogeneous. The heterogeneous system comes from the prevailing trend to integrate diverse accelerators into the system for lower latency. FPGAs have high-computation density and communication bandwidth and are configurable to be deployed with different designs of accelerators, which are widely used for various machine-learning applications. However, scaling from SMST to MMMT on heterogeneous FPGAs is challenging since MMMT has much larger layer variations, a massive number of layers, and complex data dependency among different backbones. Previous mapping algorithms are either inefficient or over-simplified which makes them impractical in general scenarios. In this work, we propose CHEF to enable efficient implementation of MMMT models in realistic heterogeneous FPGA clusters, i.e., deploying heterogeneous accelerators on heterogeneous FPGAs (A2F) and mapping the heterogeneous DNNs on the deployed heterogeneous accelerators (M2A). We propose CHEF-A2F, a two-stage accelerators-to-FPGAs deployment approach to co-optimize hardware deployment and accelerator mapping. In addition, we propose CHEF-M2A, which can support general and practical cases compared to previous mapping algorithms. To the best of our knowledge, this is the first attempt to implement MMMT models in real heterogeneous FPGA clusters. Experimental results show that the latency obtained with CHEF is near-optimal while the search time is 10 $000\times $ less than exhaustively searching the optimal solution.
引用
收藏
页码:3937 / 3948
页数:12
相关论文
共 50 条
  • [21] Floorplanning for High Utilization of Heterogeneous FPGAs
    Liu, Nan
    Chen, Song
    Yoshimura, Takeshi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (09) : 1529 - 1537
  • [22] A Flexible Scheduling Framework for Heterogeneous CPU-GPU Clusters
    Sajjapongse, Kittisak
    Agarwal, Tejaswi
    Becchi, Michela
    2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,
  • [23] Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters
    Chen, Yingda
    Wang, Jiamang
    Lu, Yifeng
    Han, Ying
    Lv, Zhiqiang
    Min, Xuebin
    Cai, Hua
    Zhang, Wei
    Fan, Haochuan
    Li, Chao
    Guan, Tao
    Lin, Wei
    Jia, Yangqing
    Zhou, Jingren
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (12): : 2972 - 2985
  • [24] Acceleration of Scientific Deep Learning Models on Heterogeneous Computing Platform with Intel® FPGAs
    Jiang, Chao
    Ojika, Dave
    Kurth, Thorsten
    Prabhat
    Vallecorsa, Sofia
    Patel, Bhavesh
    Lam, Herman
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 587 - 600
  • [25] On Exploring the Optimum Configuration of Apache Spark Framework in Heterogeneous Clusters
    Ballas, Ioannis
    Tsakanikas, Vassilios
    Pefanis, Evaggelos
    Tampakas, Vassilios
    25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 250 - 253
  • [26] A scalable framework for Adaptive Computational General Relativity on Heterogeneous Clusters
    Fernando, Milinda
    Neilsen, David
    Hirschmann, Eric W.
    Sundar, Hari
    INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2019), 2019, : 1 - 12
  • [27] Research of Hybrid Resource Scheduling Framework of Heterogeneous Clusters for Dataflow
    Tang X.-C.
    Zhao Q.
    Fu Y.
    Zhu Z.-Y.
    Ding Z.
    Hu X.-X.
    Li Z.-H.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (12): : 4704 - 4726
  • [28] When Application-Specific ISA Meets FPGAs: A Multi-layer Virtualization Framework for Heterogeneous Cloud FPGAs
    Zha, Yue
    Li, Jing
    ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2021, : 123 - 134
  • [29] CLUSTERS IN HETEROGENEOUS CATALYSIS
    ERMAKOV, YI
    ARZAMASKOVA, LN
    KUZNETSOV, VL
    KOORDINATSIONNAYA KHIMIYA, 1984, 10 (07): : 877 - 886
  • [30] On the use of performance models for adaptive algorithm selection on heterogeneous clusters
    Achour, Sami
    Nasri, Wahid
    Steffenel, Luiz Angelo
    PROCEEDINGS OF THE PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2009, : 85 - +