SHADHO: Massively Scalable Hardware-Aware Distributed Hyperparameter Optimization

被引:5
|
作者
Kinnison, Jeffery [1 ]
Kremer-Herman, Nathaniel [1 ]
Thain, Douglas [1 ]
Scheirer, Walter [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
来源
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018) | 2018年
关键词
CHALLENGES;
D O I
10.1109/WACV.2018.00086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer vision is experiencing an AI renaissance, in which machine learning models are expediting important breakthroughs in academic research and commercial applications. Effectively training these models, however, is not trivial due in part to hyperparameters: user-configured values that control a model's ability to learn from data. Existing hyperparameter optimization methods are highly parallel but make no effort to balance the search across heterogeneous hardware or to prioritize searching high-impact spaces. In this paper, we introduce a framework for massively Scalable Hardware-Aware Distributed Hyperparameter Optimization (SHADHO). Our framework calculates the relative complexity of each search space and monitors performance on the learning task over all trials. These metrics are then used as heuristics to assign hyperparameters to distributed workers based on their hardware. We first demonstrate that our framework achieves double the throughput of a standard distributed hyperparameter optimization framework by optimizing SVM for MNIST using 150 distributed workers. We then conduct model search with SHADHO over the course of one week using 74 GPUs across two compute clusters to optimize U-Net for a cell segmentation task, discovering 515 models that achieve a lower validation loss than standard U-Net.
引用
收藏
页码:738 / 747
页数:10
相关论文
共 50 条
  • [21] Towards Hardware-Aware Tractable Learning of Probabilistic Models
    Olascoaga, Laura I. Galindez
    Meert, Wannes
    Shah, Nimish
    Verhelst, Marian
    Van den Broeck, Guy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] Hardware-Aware Neural Architecture Search: Survey and Taxonomy
    Benmeziane, Hadjer
    El Maghraoui, Kaoutar
    Ouarnoughi, Hamza
    Niar, Smail
    Wistuba, Martin
    Wang, Naigang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4322 - 4329
  • [23] MOSS-DB: A Hardware-Aware OLAP Database
    Zhang, Yansong
    Hu, Wei
    Wang, Shan
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 582 - 594
  • [24] Hardware-aware Moving Objects Detection in Satellite Image
    Lee, Pei-Jun
    Chiu, Zheng-Kai
    Liu, Kuang-Zhe
    Lin, Albert
    Chen, Chia-Ray
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [25] Hardware-Aware Softmax Approximation for Deep Neural Networks
    Geng, Xue
    Lin, Jie
    Zhao, Bin
    Kong, Anmin
    Aly, Mohamed M. Sabry
    Chandrasekhar, Vijay
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
  • [26] Evolution of Hardware-Aware Neural Architecture Search on the Edge
    Richey, Blake
    Clay, Mitchell
    Grecos, Christos
    Shirvaikar, Mukul
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2023, 2023, 12528
  • [27] Hardware-Aware Quantization for Multiplierless Neural Network Controllers
    Habermann, Tobias
    Kuehle, Jonas
    Kumm, Martin
    Volkova, Anastasia
    2022 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS, 2022, : 541 - 545
  • [28] Hardware-Aware and Efficient Feature Fusion Network Search
    Guo J.-M.
    Zhang R.
    Zhi T.
    He D.-Y.
    Huang D.
    Chang M.
    Zhang X.-S.
    Guo Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (11): : 2420 - 2432
  • [29] A Hardware-Aware Heuristic for the Qubit Mapping Problem in the NISQ Era
    Niu S.
    Suau A.
    Staffelbach G.
    Todri-Sanial A.
    IEEE Transactions on Quantum Engineering, 2020, 1
  • [30] Hardware-Aware Sum-Product Decoding in the Decision Domain
    Yamada, Mizuki
    Takeuchi, Keigo
    Koike, Kiyoyuki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (12) : 1980 - 1987