Cloud-DNN: An Open Framework for Mapping DNN Models to Cloud FPGAs

被引：80

作者：

Chen, Yao ^{[1
]}

He, Jiong ^{[1
]}

Zhang, Xiaofan ^{[2
]}

Hao, Cong ^{[2
]}

Chen, Deming ^{[1
,2
]}

机构：

[1] Adv Digital Sci Ctr, Singapore, Singapore

[2] Univ Illinois, Champaign, IL USA

来源：

PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19) | 2019年

基金：

新加坡国家研究基金会;

关键词：

DNN Accelerator; FPGA; High-Level Synthesis; Cloud Computing;

D O I：

10.1145/3289602.3293915

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The efficacy and effectiveness of Convolutional Neural Networks (CNNs) have been proven in a wide range of machine learning applications. However, the high computational complexity of CNNs presents a critical challenge towards their broader adoption in real-time and power-efficient scenarios. FPGAs are poised to take a significant role for high-performance and energy-efficient computation of CNNs for both mobile (e.g., UAVs, self-driving cars, and IoT devices) and cloud computing domains. However, implementing an effective CNN system onto FPGAs efficiently remains problematic. The current cloud-based FPGAs with unique design constraints and architectural characteristics further increase the challenges. To address these challenges, we propose a novel open-source automated tool chain called Cloud-DNN. Our tool chain takes trained CNN models specified in Caffe as input, performs a set of transformations, and maps the model to a cloud-based FPGA. Cloud-DNN can significantly improve the overall design productivity of CNNs on FPGAs while satisfying the emergent computational requirements. Our design provides an alternative solution compared to other cloud-based options (e.g., GPUs or TPUs) while offering flexible, and high performance DNN inferences. The unique features of Cloud-DNN include the optimizations with cloud-platform characteristics and the support of easier and streamlined implementation. Experimental results demonstrate up to 104.55x performance improvement when compared to CPU implementation and comparable usability, flexibility, and strong quality compared to other state-of-the-art DNN inference implementations on standalone FPGAs.

引用

页码：73 / 82

页数：10

共 50 条

[1] Characterizing DNN Models for Edge-Cloud Computing
Xia, Chunwei
Zhao, Jiacheng
Cui, Huimin
Feng, Xiaobing
[J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2018, : 82 - 83
[2] ApproxDNN: Incentivizing DNN Approximation in Cloud
Nabavinejad, Seyed Morteza
Mashayekhy, Lena
Reda, Sherief
[J]. 2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 639 - 648
[3] DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing
Xia, Chunwei
Zhao, Jiacheng
Cui, Huimin
Feng, Xiaobing
Xue, Jingling
[J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (04)
[4] Scale-Train: A Scalable DNN Training Framework for a Heterogeneous GPU Cloud
Kim, Kyeonglok
Lee, Hyeonsu
Oh, Seungmin
Seo, Euiseong
[J]. IEEE ACCESS, 2022, 10 : 68468 - 68481
[5] A Compliance Checking Framework for DNN Models
Verma, Sunny
Wang, Chen
Zhu, Liming
Liu, Wei
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6470 - 6471
[6] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
Liu, Guozhi
Dai, Fei
Xu, Xiaolong
Fu, Xiaodong
Dou, Wanchun
Kumar, Neeraj
Bilal, Muhammad
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
[7] Accelerating DNN Inference by Edge-Cloud Collaboration
Chen, Jianan
Qi, Qi
Wang, Jingyu
Sun, Haifeng
Liao, Jianxin
[J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
[8] An open source framework based on Kafka-ML for Distributed DNN inference over the Cloud-to-Things continuum
Torres, Daniel R.
Martin, Cristian
Rubio, Bartolome
Diaz, Manuel
[J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 118
[9] A DNN partitioning framework with controlled lossy mechanisms for edge-cloud collaborative intelligence
Kim, Hyochan
Choi, Ji Sub
Kim, Jungrae
Ko, Jong Hwan
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 426 - 439
[10] Scheduling DNN Inferencing on Edge and Cloud for Personalized UAV Fleets
Raj, Suman
Gupta, Harshil
Simmhan, Yogesh
[J]. 2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 615 - 626

← 1 2 3 4 5 →