A Data-Centric Optimization Framework for Machine Learning

被引:8
|
作者
Rausch, Oliver [1 ]
Ben-Nun, Tal [1 ]
Dryden, Nikoli [1 ]
Ivanov, Andrei [1 ]
Li, Shigang [1 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
基金
欧洲研究理事会; 瑞士国家科学基金会; 欧盟地平线“2020”;
关键词
deep neural network compilers; machine learning; training optimization;
D O I
10.1145/3524059.3532364
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Rapid progress in deep learning is leading to a diverse set of quickly changing models, with a dramatically growing demand for compute. However, as frameworks specialize performance optimization to patterns in popular networks, they implicitly constrain novel and diverse models that drive progress in research. We empower deep learning researchers by defining a flexible and user-customizable pipeline for optimizing training of arbitrary deep neural networks, based on data movement minimization. The pipeline begins with standard networks in PyTorch or ONNX and transforms computation through progressive lowering. We define four levels of general-purpose transformations, from local intra-operator optimizations to global data movement reduction. These operate on a data-centric graph intermediate representation that expresses computation and data movement at all levels of abstraction, including expanding basic operators such as convolutions to their underlying computations. Central to the design is the interactive and introspectable nature of the pipeline. Every part is extensible through a Python API, and can be tuned interactively using a GUI. We demonstrate competitive performance or speedups on ten different networks, with interactive optimizations discovering new opportunities in EfficientNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Machine learning for data-centric epidemic forecasting
    Rodriguez, Alexander
    Kamarthi, Harshavardhan
    Agarwal, Pulak
    Ho, Javen
    Patel, Mira
    Sapre, Suchet
    Prakash, B. Aditya
    [J]. NATURE MACHINE INTELLIGENCE, 2024,
  • [2] Data-centric machine learning in quantum information science
    Lohani, Sanjaya
    Lukens, Joseph M.
    Glasser, Ryan T.
    Searles, Thomas A.
    Kirby, Brian T.
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [3] Data-Centric Approaches to Radio Frequency Machine Learning
    Kuzdeba, Scott
    Robinson, Josh
    [J]. 2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [4] Data-Centric Machine Learning Pipeline for Hardware Verification
    Shin, Hongsup
    [J]. 2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 11 - 12
  • [5] Data-centric framework for crystal structure identification in atomistic simulations using machine learning
    Chung, Heejung W.
    Freitas, Rodrigo
    Cheon, Gowoon
    Reed, Evan J.
    [J]. PHYSICAL REVIEW MATERIALS, 2022, 6 (04)
  • [6] Taxonomy of machine learning paradigms: A data-centric perspective
    Emmert-Streib, Frank
    Dehmer, Matthias
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (05)
  • [7] Gaspar Data-Centric Framework
    Silva, Rui
    Sobral, J. L.
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 234 - 247
  • [8] A Framework for Verifying Data-Centric Protocols
    Deng, Yuxin
    Grumbach, Stephane
    Monin, Jean-Francois
    [J]. FORMAL TECHNIQUES FOR DISTRIBUTED SYSTEMS, 2011, 6722 : 106 - 120
  • [9] Bridging Control-Centric and Data-Centric Optimization
    Ben-Nun, Tal
    Ates, Berke
    Calotoiu, Alexandru
    Hoefler, Torsten
    [J]. PROCEEDINGS OF THE 21ST ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO 2023, 2023, : 173 - 185
  • [10] Data-centric approach to improve machine learning models for inorganic materials
    Bartel, Christopher J.
    [J]. PATTERNS, 2021, 2 (11):