Cerebro: A Data System for Optimized Deep Learning Model Selection

被引:35
|
作者
Nakandala, Supun [1 ]
Zhang, Yuhao [1 ]
Kumar, Arun [1 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92103 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 13卷 / 11期
基金
美国国家科学基金会;
关键词
INFERENCE;
D O I
10.14778/3407790.3407816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (deep nets) are revolutionizing many machine learning (ML) applications. But there is a major bottleneck to wider adoption: the pain and resource intensiveness of model selection. This empirical process involves exploring deep net architectures and hyper-parameters, often requiring hundreds of trials. Alas, most ML systems focus on training one model at a time, reducing throughput and raising overall resource costs; some also sacrifice reproducibility. We present Cerebro, a new data system to raise deep net model selection throughput at scale without raising resource costs and without sacrificing reproducibility or accuracy. Cerebro uses a new parallel SGD execution strategy we call model hopper parallelism that hybridizes task- and data-parallelism to mitigate the cons of these prior paradigms and offer the best of both worlds. Experiments on large ML benchmark datasets show that Cerebro offers 3x to 10x runtime savings relative to data-parallel systems like Horovod and Parameter Server and up to 8x memory/storage savings or up to 100x network savings relative to task-parallel systems. Cerebro also supports heterogeneous resources and fault tolerance.
引用
收藏
页码:2159 / 2173
页数:15
相关论文
共 50 条
  • [31] A hybrid system with optimized decomposition on random deep learning model for crude oil futures forecasting
    Wang, Jie
    Zhang, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [32] SMART PARKING SYSTEM: OPTIMIZED ENSEMBLE DEEP LEARNING MODEL WITH INTERNET OF THINGS FOR SMART CITIES
    Jakkaladiki, Sudha Prathyusha
    Poulova, Petra
    Prazak, Pavel
    Tesarova, Barbora
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (04): : 1191 - 1201
  • [33] Network intrusion detection based on deep learning model optimized with rule-based hybrid feature selection
    Ayo, Femi Emmanuel
    Folorunso, Sakinat Oluwabukonla
    Abayomi-Alli, Adebayo A.
    Adekunle, Adebola Olayinka
    Awotunde, Joseph Bamidele
    INFORMATION SECURITY JOURNAL, 2020, 29 (06): : 267 - 283
  • [34] Deep Learning for Proteomics Data for Feature Selection and Classification
    Iravani, Sahar
    Conrad, Tim O. F.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2019, 2019, 11713 : 301 - 316
  • [35] TRAINING SAMPLE SELECTION FOR DEEP LEARNING OF DISTRIBUTED DATA
    Jiang, Zheng
    Zhu, Xiaoqing
    Tan, Wai-tian
    Liston, Rob
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2189 - 2193
  • [36] Optimized Deep Learning Classification Model for Intelligent Edge devices
    Naveen, Soumyalatha
    Kounte, Manjunath R
    Journal of Engineering Science and Technology Review, 2024, 17 (03) : 88 - 94
  • [37] Optimized Deep Learning Model for Disease Prediction in Potato Leaves
    Shrivastava V.K.
    Shelke C.J.
    Shrivastava A.
    Mohanty S.N.
    Sharma N.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2023, 9 (01)
  • [38] Model-Parallel Model Selection for Deep Learning Systems
    Nagrecha, Kabir
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2929 - 2931
  • [39] CLASSIFICATION OF SENTIMENT USING OPTIMIZED HYBRID DEEP LEARNING MODEL
    Touate, Chaima Ahle
    EL Ayachi, Rachid
    Biniz, Mohamed
    COMPUTING AND INFORMATICS, 2023, 42 (03) : 651 - 666
  • [40] Environmental microorganism classification using optimized deep learning model
    Liang, Chih-Ming
    Lai, Chun-Chi
    Wang, Szu-Hong
    Lin, Yu-Hao
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (24) : 31920 - 31932