SHADHO: Massively Scalable Hardware-Aware Distributed Hyperparameter Optimization

被引:5
|
作者
Kinnison, Jeffery [1 ]
Kremer-Herman, Nathaniel [1 ]
Thain, Douglas [1 ]
Scheirer, Walter [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
来源
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018) | 2018年
关键词
CHALLENGES;
D O I
10.1109/WACV.2018.00086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer vision is experiencing an AI renaissance, in which machine learning models are expediting important breakthroughs in academic research and commercial applications. Effectively training these models, however, is not trivial due in part to hyperparameters: user-configured values that control a model's ability to learn from data. Existing hyperparameter optimization methods are highly parallel but make no effort to balance the search across heterogeneous hardware or to prioritize searching high-impact spaces. In this paper, we introduce a framework for massively Scalable Hardware-Aware Distributed Hyperparameter Optimization (SHADHO). Our framework calculates the relative complexity of each search space and monitors performance on the learning task over all trials. These metrics are then used as heuristics to assign hyperparameters to distributed workers based on their hardware. We first demonstrate that our framework achieves double the throughput of a standard distributed hyperparameter optimization framework by optimizing SVM for MNIST using 150 distributed workers. We then conduct model search with SHADHO over the course of one week using 74 GPUs across two compute clusters to optimize U-Net for a cell segmentation task, discovering 515 models that achieve a lower validation loss than standard U-Net.
引用
收藏
页码:738 / 747
页数:10
相关论文
共 50 条
  • [31] A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators
    Buchty, Rainer
    Heuveline, Vincent
    Karl, Wolfgang
    Weiss, Jan-Philipp
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (07): : 663 - 675
  • [32] Hardware-aware Model Architecture for Ternary Spiking Neural Networks
    Wu, Nai-Chun
    Chen, Tsu-Hsiang
    Huang, Chih-Tsun
    2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,
  • [33] AI Models for Edge Computing: Hardware-aware Optimizations for Efficiency
    Li, Hai ''Helen''
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [34] Hardware-Aware Bayesian Neural Architecture Search of Quantized CNNs
    Perrin, Mathieu
    Guicquero, William
    Paille, Bruno
    Sicard, Gilles
    IEEE EMBEDDED SYSTEMS LETTERS, 2025, 17 (01) : 42 - 45
  • [35] On Hardware-Aware Probabilistic Frameworks for Resource Constrained Embedded Applications
    Olascoaga, Laura I. Galindez
    Meert, Wannes
    Shah, Nimish
    Van den Broeck, Guy
    Verhelst, Marian
    FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 66 - 70
  • [36] Performance evaluation and design of hardware-aware PDE solvers:: An introduction
    Hülsemann, F
    Kowarschik, M
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 872 - 873
  • [37] Hardware-aware AutoML for Exploration of Custom FPGA Accelerators for RadioML
    Jentzsch, Felix
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 359 - 360
  • [38] HAW: Hardware-Aware Point Selection for Efficient Winograd Convolution
    Li, Chaoran
    Jiang, Penglong
    Zhou, Hui
    Wang, Xiaofeng
    Zhao, Xiongbo
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 269 - 273
  • [39] HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator
    Yu, Zhewen
    Sreeram, Sudarshan
    Agrawal, Krish
    Wu, Junyi
    Montgomerie-Corcoran, Alexander
    Zhang, Cheng
    Cheng, Jianyi
    Bouganis, Christos-Savvas
    Zhao, Yiren
    2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 257 - 263
  • [40] An hardware-aware image polarity detector enhanced with visual attention
    Ragusa, Edoardo
    Apicella, Tommaso
    Gianoglio, Christian
    Zunino, Rodolfo
    Gastaldo, Paolo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,