Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach

被引:173
|
作者
Kwon, Hyoukjun [1 ]
Chatarasi, Prasanth [1 ]
Pellauer, Michael [2 ]
Parashar, Angshuman [2 ]
Sarkar, Vivek [1 ]
Krishna, Tushar [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] NVIDIA, Westford, MA USA
基金
美国国家科学基金会;
关键词
Neural networks; Dataflow; Cost modeling; TRANSFORMATIONS;
D O I
10.1145/3352460.3358252
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data partitioning and scheduling strategies used by DNN accelerators to leverage reuse and perform staging are known as dataflow, which directly impacts the performance and energy efficiency of DNN accelerators. An accelerator microarchitecture dictates the dataflow(s) that can be employed to execute layers in a DNN. Selecting a dataflow for a layer can have a large impact on utilization and energy efficiency, but there is a lack of understanding on the choices and consequences of dataflows, and of tools and methodologies to help architects explore the co-optimization design space. In this work, we first introduce a set of data-centric directives to concisely specify the DNN dataflow space in a compiler-friendly form. We then show how these directives can be analyzed to infer various forms of reuse and to exploit them using hardware capabilities. We codify this analysis into an analytical cost model, MAESTRO (Modeling Accelerator Efficiency via Spatio-Temporal Reuse and Occupancy), that estimates various cost-benefit tradeoffs of a dataflow including execution time and energy efficiency for a DNN model and hardware configuration. We demonstrate the use of MAESTRO to drive a hardware design space exploration experiment, which searches across 480M designs to identify 2.5M valid designs at an average rate of 0.17M designs per second, including Pareto-optimal throughput- and energy-optimized design points.
引用
收藏
页码:754 / 768
页数:15
相关论文
共 50 条
  • [31] Data-centric AI approach for automated wildflower monitoring
    Schouten, Gerard
    Michielsen, Bas S. H. T.
    Gravendeel, Barbara
    [J]. PLOS ONE, 2024, 19 (09):
  • [32] Distributed scheduler for high performance data-centric systems
    Goel, S
    Sharda, H
    Taniar, D
    [J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 1157 - 1161
  • [33] A participatory data-centric approach to AI Ethics by Design
    Gerdes, Anne
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [34] A data-centric approach to high-level synthesis
    Tarafdar, S
    Leeser, M
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2000, 19 (11) : 1251 - 1267
  • [35] Data-Centric Optimization Approach for Small, Imbalanced Datasets
    Tanov, Vladislav
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2023, 47 (01) : 167 - 177
  • [36] A data-centric approach for scalable state machine replication
    Chockler, G
    Malkhi, D
    Dolev, D
    [J]. FUTURE DIRECTIONS IN DISTRIBUTED COMPUTING: RESEARCH AND POSITION PAPERS, 2003, 2584 : 159 - 163
  • [37] Reliability evaluation of individual predictions: a data-centric approach
    Shahbazi, Nima
    Asudeh, Abolfazl
    [J]. VLDB JOURNAL, 2024, 33 (04): : 1203 - 1230
  • [38] Performance Evaluation of Data-Centric Workloads in Serverless Environments
    Nestorov, Anna Maria
    Polo, Jorda
    Misale, Claudia
    Carrera, David
    Youssef, Alaa S.
    [J]. 2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 491 - 496
  • [39] Dynamic Load Balancing in Cloud A Data-Centric Approach
    Dasoriya, Rayan
    Kotadiya, Purvi
    Arya, Garima
    Nayak, Priyanshu
    Mistry, Kamal
    [J]. 2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 162 - 166
  • [40] Identification of the Barriers to Data-Centric Approach in the Construction Industry
    Karji, Ali
    Messner, John
    Leicht, Robert
    McComb, Christopher
    [J]. CONSTRUCTION RESEARCH CONGRESS 2022: PROJECT MANAGEMENT AND DELIVERY, CONTRACTS, AND DESIGN AND MATERIALS, 2022, : 1002 - 1011