Analysis of dawnbench, a time-to-accuracy machine learning performance benchmark

被引:54
|
作者
Coleman C. [1 ]
Kang D. [1 ]
Narayanan D. [1 ]
Nardi L. [1 ]
Zhao T. [1 ]
Zhang J. [1 ]
Bailis P. [1 ]
Olukotun K. [1 ]
Ré C. [1 ]
Zaharia M. [1 ]
机构
[1] Stanford DAWN
来源
Operating Systems Review (ACM) | 2019年 / 53卷 / 01期
基金
美国国家科学基金会;
关键词
Competition - Benchmarking - Deep learning - Economic and social effects;
D O I
10.1145/3352020.3352024
中图分类号
学科分类号
摘要
Researchers have proposed hardware, software, and algorithmic optimizations to improve the computational performance of deep learning. While some of these optimizations perform the same operations faster (e.g., increasing GPU clock speed), many others modify the semantics of the training procedure (e.g., reduced precision), and can impact the final model's accuracy on unseen data. Due to a lack of standard evaluation criteria that considers these trade-offs, it is difficult to directly compare these optimizations. To address this problem, we recently introduced DAWNBENCH, a benchmark competition focused on end-to-end training time to achieve near-state-of-the-art accuracy on an unseen dataset-a combined metric called time-to-accuracy (TTA). In this work, we analyze the entries from DAWNBENCH, which received optimized submissions from multiple industrial groups, to investigate the behavior of TTA as a metric as well as trends in the best-performing entries. We show that TTA has a low coefficient of variation and that models optimized for TTA generalize nearly as well as those trained using standard methods. Additionally, even though DAWNBENCH entries were able to train ImageNet models in under 3 minutes, we find they still underutilize hardware capabilities such as Tensor Cores. Furthermore, we find that distributed entries can spend more than half of their time on communication. We show similar findings with entries to the MLPERF v0.5 benchmark. © Copyright held by the owner/author(s). Publication rights licensed to ACM.
引用
收藏
页码:14 / 25
页数:11
相关论文
共 50 条
  • [1] Accuracy Analysis of Machine Learning-Based Performance Modeling for Microprocessors
    Tanaka, Yoshihiro
    Oka, Keitaro
    Ono, Takatsugu
    Inoue, Koji
    [J]. 2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 83 - 86
  • [2] MLPerf: An Industry Standard Benchmark Suite for Machine Learning Performance
    Mattson, Peter
    Tang, Hanlin
    Wei, Gu-Yeon
    Wu, Carole-Jean
    Reddi, Vijay Janapa
    Cheng, Christine
    Coleman, Cody
    Diamos, Greg
    Kanter, David
    Micikevicius, Paulius
    Patterson, David
    Schmuelling, Guenther
    [J]. IEEE MICRO, 2020, 40 (02) : 8 - 16
  • [3] A benchmark dataset for machine learning in ecotoxicology
    Schuer, Christoph
    Gasser, Lilian
    Perez-Cruz, Fernando
    Schirmer, Kristin
    Baity-Jesi, Marco
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [4] MoleculeNet: a benchmark for molecular machine learning
    Wu, Zhenqin
    Ramsundar, Bharath
    Feinberg, Evan N.
    Gomes, Joseph
    Geniesse, Caleb
    Pappu, Aneesh S.
    Leswing, Karl
    Pande, Vijay
    [J]. CHEMICAL SCIENCE, 2018, 9 (02) : 513 - 530
  • [5] A benchmark dataset for machine learning in ecotoxicology
    Christoph Schür
    Lilian Gasser
    Fernando Perez-Cruz
    Kristin Schirmer
    Marco Baity-Jesi
    [J]. Scientific Data, 10
  • [6] Comparing the performance of machine learning algorithms using estimated accuracy
    Gupta S.
    Saluja K.
    Goyal A.
    Vajpayee A.
    Tiwari V.
    [J]. Measurement: Sensors, 2022, 24
  • [7] Benchmark datasets and real-time autoimmune disease dataset analysis using machine learning algorithms with implementation, analysis and results
    Ramasamy, Uma
    Santhoshkumar, Sundar
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2449 - 2463
  • [8] Analysis of precision and accuracy in a simple model of machine learning
    Julian Lee
    [J]. Journal of the Korean Physical Society, 2017, 71 : 866 - 870
  • [9] Analysis of Precision and Accuracy in a Simple Model of Machine Learning
    Lee, Julian
    [J]. JOURNAL OF THE KOREAN PHYSICAL SOCIETY, 2017, 71 (12) : 866 - 870
  • [10] Performance Benchmark of Machine Learning-Based Methodology for Swahili News Article Categorization
    Little, Shaun Anthony
    Roy, Kaushik
    Al Hamoud, Ahmed
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1517 - 1521