SHADHO: Massively Scalable Hardware-Aware Distributed Hyperparameter Optimization

被引：5

作者：

Kinnison, Jeffery ^{[1
]}

Kremer-Herman, Nathaniel ^{[1
]}

Thain, Douglas ^{[1
]}

Scheirer, Walter ^{[1
]}

机构：

[1] Univ Notre Dame, Notre Dame, IN 46556 USA

来源：

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018) | 2018年

关键词：

CHALLENGES;

D O I：

10.1109/WACV.2018.00086

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computer vision is experiencing an AI renaissance, in which machine learning models are expediting important breakthroughs in academic research and commercial applications. Effectively training these models, however, is not trivial due in part to hyperparameters: user-configured values that control a model's ability to learn from data. Existing hyperparameter optimization methods are highly parallel but make no effort to balance the search across heterogeneous hardware or to prioritize searching high-impact spaces. In this paper, we introduce a framework for massively Scalable Hardware-Aware Distributed Hyperparameter Optimization (SHADHO). Our framework calculates the relative complexity of each search space and monitors performance on the learning task over all trials. These metrics are then used as heuristics to assign hyperparameters to distributed workers based on their hardware. We first demonstrate that our framework achieves double the throughput of a standard distributed hyperparameter optimization framework by optimizing SVM for MNIST using 150 distributed workers. We then conduct model search with SHADHO over the course of one week using 74 GPUs across two compute clusters to optimize U-Net for a cell segmentation task, discovering 515 models that achieve a lower validation loss than standard U-Net.

引用

页码：738 / 747

页数：10

共 50 条

[31] A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators
Buchty, Rainer
Heuveline, Vincent
Karl, Wolfgang
Weiss, Jan-Philipp
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (07): : 663 - 675
[32] Hardware-aware Model Architecture for Ternary Spiking Neural Networks
Wu, Nai-Chun
Chen, Tsu-Hsiang
Huang, Chih-Tsun
2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,
[33] AI Models for Edge Computing: Hardware-aware Optimizations for Efficiency
Li, Hai ''Helen''
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[34] Hardware-Aware Bayesian Neural Architecture Search of Quantized CNNs
Perrin, Mathieu
Guicquero, William
Paille, Bruno
Sicard, Gilles
IEEE EMBEDDED SYSTEMS LETTERS, 2025, 17 (01) : 42 - 45
[35] On Hardware-Aware Probabilistic Frameworks for Resource Constrained Embedded Applications
Olascoaga, Laura I. Galindez
Meert, Wannes
Shah, Nimish
Van den Broeck, Guy
Verhelst, Marian
FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 66 - 70
[36] Performance evaluation and design of hardware-aware PDE solvers:: An introduction
Hülsemann, F
Kowarschik, M
APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 872 - 873
[37] Hardware-aware AutoML for Exploration of Custom FPGA Accelerators for RadioML
Jentzsch, Felix
2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 359 - 360
[38] HAW: Hardware-Aware Point Selection for Efficient Winograd Convolution
Li, Chaoran
Jiang, Penglong
Zhou, Hui
Wang, Xiaofeng
Zhao, Xiongbo
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 269 - 273
[39] HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator
Yu, Zhewen
Sreeram, Sudarshan
Agrawal, Krish
Wu, Junyi
Montgomerie-Corcoran, Alexander
Zhang, Cheng
Cheng, Jianyi
Bouganis, Christos-Savvas
Zhao, Yiren
2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 257 - 263
[40] An hardware-aware image polarity detector enhanced with visual attention
Ragusa, Edoardo
Apicella, Tommaso
Gianoglio, Christian
Zunino, Rodolfo
Gastaldo, Paolo
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

← 1 2 3 4 5 →