Cappuccino: Efficient CNN Inference Software Synthesis for Mobile System-on-Chips

被引:14
|
作者
Motamedi, Mohammad [1 ]
Fong, Daniel [1 ]
Ghiasi, Soheil [1 ]
机构
[1] Univ Calif Davis, Elect & Comp Engn Dept, Davis, CA 95616 USA
关键词
Convolutional neural networks (CNNs); mobile system-on-chips (SoCs);
D O I
10.1109/LES.2018.2815954
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) exhibit remarkable performance in various machine learning tasks. As sensor-equipped Internet of Things devices permeate into every aspect of modern life, the ability to execute CNN inference, a computationally intensive application, on resource constrained devices has become increasingly important. In this context, we present Cappuccino, a framework for synthesis of efficient inference software targeting mobile system-on-chips (SoCs). We propose techniques for efficient parallelization of CNN inference targeting mobile SoCs, and explore the underlying tradeoffs. Experiments with different CNNs on three mobile devices demonstrate the effectiveness of our approach.
引用
收藏
页码:9 / 12
页数:4
相关论文
共 17 条
  • [1] A software test program generator for verifying system-on-chips
    Cheng, A
    Lim, CC
    Parashkevov, A
    HLDVT'05: TENTH ANNUAL IEEE INTERNATIONAL HIGH-LEVEL DESIGN VALIDATION AND TEST WORKSHOP, PROCEEDINGS, 2005, : 79 - 86
  • [2] Development of Embedded STT-MRAM for Mobile System-on-Chips
    Lee, Kangho
    Kang, Seung H.
    IEEE TRANSACTIONS ON MAGNETICS, 2011, 47 (01) : 131 - 136
  • [3] Rapid Exploration of Multimedia System-on-Chips with Automatically Generated Software Performance Models
    Kirchsteiger, C. M.
    Schweitzer, H.
    Trummer, C.
    Steger, C.
    Weiss, R.
    Pistauer, M.
    PROCEEDINGS OF THE 2008 IEEE/ACM/IFIP WORKSHOP ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2008, : 19 - +
  • [4] Area-Efficient PUF-Based Key Generation on System-on-Chips with FPGAs
    Gehrer, Stefan
    Sigl, Georg
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (01)
  • [5] Configurable platforms with dynamic platform management: an efficient alternative to application-specific system-on-chips
    Sekar, K
    Lahiri, K
    Dey, SJ
    17TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS: DESIGN METHODOLOGIES FOR THE GIGASCALE ERA, 2004, : 307 - 315
  • [6] Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment
    Jin, Yuyang
    Zhong, Runxin
    Long, Saiqin
    Zhai, Jidong
    IEEE Transactions on Parallel and Distributed Systems, 2024, 35 (11) : 2208 - 2223
  • [7] Buffer Sizes Reduction for Memory-efficient CNN Inference on Mobile and Embedded Devices
    Minakova, Svetlana
    Stefanov, Todor
    2020 23RD EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2020), 2020, : 133 - 140
  • [8] Systolic Tensor Array: An Efficient Structured-Sparse GEMM Accelerator for Mobile CNN Inference
    Liu, Zhi-Gang
    Whatmough, Paul N.
    Mattina, Matthew
    IEEE COMPUTER ARCHITECTURE LETTERS, 2020, 19 (01) : 34 - 37
  • [9] Synthesis of communication mechanisms for multi-tile systems based on Heterogeneous Multi-processor System-On-Chips
    Chagoya-Garzon, Alexandre
    Guerin, Xavier
    Rousseau, Frederic
    Petrot, Frederic
    Rossetti, Davide
    Lonardo, Alessandro
    Vicini, Piero
    Paolucci, Pier Stanislao
    RSP 2009: TWENTIETH IEEE/IFIP INTERNATIONAL SYMPOSIUM ON RAPID SYSTEM PROTOTYPING, PROCEEDINGS: SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2009, : 48 - +
  • [10] A configurable hardware-efficient ECG classification inference engine based on CNN for mobile healthcare applications
    Zhang, Chen
    Li, Jian
    Guo, Pengfei
    Li, Qiuping
    Zhang, Xing
    Wang, Xinan
    MICROELECTRONICS JOURNAL, 2023, 141