Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach

被引:173
|
作者
Kwon, Hyoukjun [1 ]
Chatarasi, Prasanth [1 ]
Pellauer, Michael [2 ]
Parashar, Angshuman [2 ]
Sarkar, Vivek [1 ]
Krishna, Tushar [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] NVIDIA, Westford, MA USA
基金
美国国家科学基金会;
关键词
Neural networks; Dataflow; Cost modeling; TRANSFORMATIONS;
D O I
10.1145/3352460.3358252
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data partitioning and scheduling strategies used by DNN accelerators to leverage reuse and perform staging are known as dataflow, which directly impacts the performance and energy efficiency of DNN accelerators. An accelerator microarchitecture dictates the dataflow(s) that can be employed to execute layers in a DNN. Selecting a dataflow for a layer can have a large impact on utilization and energy efficiency, but there is a lack of understanding on the choices and consequences of dataflows, and of tools and methodologies to help architects explore the co-optimization design space. In this work, we first introduce a set of data-centric directives to concisely specify the DNN dataflow space in a compiler-friendly form. We then show how these directives can be analyzed to infer various forms of reuse and to exploit them using hardware capabilities. We codify this analysis into an analytical cost model, MAESTRO (Modeling Accelerator Efficiency via Spatio-Temporal Reuse and Occupancy), that estimates various cost-benefit tradeoffs of a dataflow including execution time and energy efficiency for a DNN model and hardware configuration. We demonstrate the use of MAESTRO to drive a hardware design space exploration experiment, which searches across 480M designs to identify 2.5M valid designs at an average rate of 0.17M designs per second, including Pareto-optimal throughput- and energy-optimized design points.
引用
收藏
页码:754 / 768
页数:15
相关论文
共 50 条
  • [1] MAESTRO: A Data-Centric Approach to Understand Reuse, Performance, and Hardware Cost of DNN Mappings
    Kwon, Hyoukjun
    Chatarasi, Prasanth
    Sarkar, Vivek
    Krishna, Tushar
    Pellauer, Michael
    Parashar, Angshuman
    [J]. IEEE MICRO, 2020, 40 (03) : 20 - 29
  • [2] Enabling effective workflow model reuse: A data-centric approach
    Liu, Zhiyong
    Fan, Shaokun
    Wang, Harry Jiannan
    Zhao, J. Leon
    [J]. DECISION SUPPORT SYSTEMS, 2017, 93 : 11 - 25
  • [3] A data-centric approach to understanding the pricing of financial options
    J. Healy
    M. Dixon
    B. Read
    F.F. Cai
    [J]. The European Physical Journal B - Condensed Matter and Complex Systems, 2002, 27 : 219 - 227
  • [4] Understanding the Indian Labour Market: A Data-Centric Approach
    Shabana, K. M.
    Gracious, Tony
    Subramonian, Hrishikesh
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 26 - 31
  • [5] A data-centric approach to understanding the pricing of financial options
    Healy, J
    Dixon, M
    Read, B
    Cai, FF
    [J]. EUROPEAN PHYSICAL JOURNAL B, 2002, 27 (02): : 219 - 227
  • [6] A data-centric approach to checksum reuse for array-intensive applications
    Chen, G
    Kandemir, M
    Karakoy, M
    [J]. 2005 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2005, : 316 - 325
  • [7] Understanding the Performance of GPGPU Applications from a Data-Centric View
    Zhang, Hui
    Hollingsworth, Jeffrey K.
    [J]. PROCEEDINGS OF PROTOOLS 2019: 2019 IEEE/ACM INTERNATIONAL WORKSHOP ON PROGRAMMING AND PERFORMANCE VISUALIZATION TOOLS (PROTOOLS), 2019, : 1 - 8
  • [8] A Data-Centric Approach to Synchronization
    Dolby, Julian
    Hammer, Christian
    Marino, Daniel
    Tip, Frank
    Vaziri, Mandana
    Vitek, Jan
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2012, 34 (01):
  • [9] DNN-CHIP PREDICTOR: AN ANALYTICAL PERFORMANCE PREDICTOR FOR DNN ACCELERATORS WITH VARIOUS DATAFLOWS AND HARDWARE ARCHITECTURES
    Zhao, Yang
    Li, Chaojian
    Wang, Yue
    Xu, Pengfei
    Zhang, Yongan
    Lin, Yingyan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1593 - 1597
  • [10] Data-Centric Machine Learning Pipeline for Hardware Verification
    Shin, Hongsup
    [J]. 2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 11 - 12