Cloud-DNN: An Open Framework for Mapping DNN Models to Cloud FPGAs

被引:80
|
作者
Chen, Yao [1 ]
He, Jiong [1 ]
Zhang, Xiaofan [2 ]
Hao, Cong [2 ]
Chen, Deming [1 ,2 ]
机构
[1] Adv Digital Sci Ctr, Singapore, Singapore
[2] Univ Illinois, Champaign, IL USA
基金
新加坡国家研究基金会;
关键词
DNN Accelerator; FPGA; High-Level Synthesis; Cloud Computing;
D O I
10.1145/3289602.3293915
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The efficacy and effectiveness of Convolutional Neural Networks (CNNs) have been proven in a wide range of machine learning applications. However, the high computational complexity of CNNs presents a critical challenge towards their broader adoption in real-time and power-efficient scenarios. FPGAs are poised to take a significant role for high-performance and energy-efficient computation of CNNs for both mobile (e.g., UAVs, self-driving cars, and IoT devices) and cloud computing domains. However, implementing an effective CNN system onto FPGAs efficiently remains problematic. The current cloud-based FPGAs with unique design constraints and architectural characteristics further increase the challenges. To address these challenges, we propose a novel open-source automated tool chain called Cloud-DNN. Our tool chain takes trained CNN models specified in Caffe as input, performs a set of transformations, and maps the model to a cloud-based FPGA. Cloud-DNN can significantly improve the overall design productivity of CNNs on FPGAs while satisfying the emergent computational requirements. Our design provides an alternative solution compared to other cloud-based options (e.g., GPUs or TPUs) while offering flexible, and high performance DNN inferences. The unique features of Cloud-DNN include the optimizations with cloud-platform characteristics and the support of easier and streamlined implementation. Experimental results demonstrate up to 104.55x performance improvement when compared to CPU implementation and comparable usability, flexibility, and strong quality compared to other state-of-the-art DNN inference implementations on standalone FPGAs.
引用
收藏
页码:73 / 82
页数:10
相关论文
共 50 条
  • [1] Characterizing DNN Models for Edge-Cloud Computing
    Xia, Chunwei
    Zhao, Jiacheng
    Cui, Huimin
    Feng, Xiaobing
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2018, : 82 - 83
  • [2] ApproxDNN: Incentivizing DNN Approximation in Cloud
    Nabavinejad, Seyed Morteza
    Mashayekhy, Lena
    Reda, Sherief
    [J]. 2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 639 - 648
  • [3] DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing
    Xia, Chunwei
    Zhao, Jiacheng
    Cui, Huimin
    Feng, Xiaobing
    Xue, Jingling
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (04)
  • [4] Scale-Train: A Scalable DNN Training Framework for a Heterogeneous GPU Cloud
    Kim, Kyeonglok
    Lee, Hyeonsu
    Oh, Seungmin
    Seo, Euiseong
    [J]. IEEE ACCESS, 2022, 10 : 68468 - 68481
  • [5] A Compliance Checking Framework for DNN Models
    Verma, Sunny
    Wang, Chen
    Zhu, Liming
    Liu, Wei
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6470 - 6471
  • [6] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
    Liu, Guozhi
    Dai, Fei
    Xu, Xiaolong
    Fu, Xiaodong
    Dou, Wanchun
    Kumar, Neeraj
    Bilal, Muhammad
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
  • [7] Accelerating DNN Inference by Edge-Cloud Collaboration
    Chen, Jianan
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    [J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [8] An open source framework based on Kafka-ML for Distributed DNN inference over the Cloud-to-Things continuum
    Torres, Daniel R.
    Martin, Cristian
    Rubio, Bartolome
    Diaz, Manuel
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 118
  • [9] A DNN partitioning framework with controlled lossy mechanisms for edge-cloud collaborative intelligence
    Kim, Hyochan
    Choi, Ji Sub
    Kim, Jungrae
    Ko, Jong Hwan
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 426 - 439
  • [10] Scheduling DNN Inferencing on Edge and Cloud for Personalized UAV Fleets
    Raj, Suman
    Gupta, Harshil
    Simmhan, Yogesh
    [J]. 2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 615 - 626