A lightweight performance proxy for deep-learning model training on Amazon SageMaker

被引:0
|
作者
Tesser, Rafael Keller [1 ,2 ,3 ]
Marques, Alvaro [2 ]
Borin, Edson [2 ]
机构
[1] Univ Campinas Unicamp, Ctr Comp Engn & Sci, Sao Paulo, Brazil
[2] Univ Campinas Unicamp, Inst Comp, Sao Paulo, Brazil
[3] Fed Univ Technol Parana UTFPR, Bachelors Course Comp Sci, Santa Helena, PR, Brazil
来源
关键词
cloud computing; cost prediction; deep learning; machine learning; performance prediction;
D O I
10.1002/cpe.8104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Cloud computing has become popular for training deep-learning (DL) models, avoiding the costs of acquiring and maintaining on-premise systems. SageMaker is a cloud service that automates the execution of DL workloads. Its features include automatic hyperparameter optimization and use of spot instances. Nonetheless, it does not assist in selecting the right instance type for a workload. In public clouds, rent price depends on the configuration of the chosen instance type. Advanced and faster instances are typically more expensive, but not always the best choice. To select the optimal instance type, users must compare the workload's relative performance (and hence cost) on several candidates. Building on the execution profiles of multiple DL applications, we model the performance and cost of training DL applications on SageMaker and propose a lightweight technique to estimate these at low temporal and monetary cost. This method is a performance proxy that can be used to replace more expensive performance measurement procedures. So, it could speed up any technique that relies on such measurements. We show how it can help cloud customers seeking suitable instance types to train DL models, and that it can accurately predict the performance of different instance types when training these models on SageMaker.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Exploring Symmetry in Digital Image Forensics Using a Lightweight Deep-Learning Hybrid Model for Multiple Smoothing Operators
    Agarwal, Saurabh
    Jung, Ki-Hyun
    SYMMETRY-BASEL, 2023, 15 (12):
  • [22] Deep-learning model for screening sepsis using electrocardiography
    Kwon, Joon-myoung
    Lee, Ye Rang
    Jung, Min-Seung
    Lee, Yoon-Ji
    Jo, Yong-Yeon
    Kang, Da-Young
    Lee, Soo Youn
    Cho, Yong-Hyeon
    Shin, Jae-Hyun
    Ban, Jang-Hyeon
    Kim, Kyung-Hee
    SCANDINAVIAN JOURNAL OF TRAUMA RESUSCITATION & EMERGENCY MEDICINE, 2021, 29 (01):
  • [23] Automated deep-learning model optimization framework for microcontrollers
    Hong, Seungtae
    Park, Gunju
    Kim, Jeong-Si
    ETRI JOURNAL, 2024,
  • [24] An Explainable Deep-learning Model of Proton Auroras on Mars
    Dhuri, Dattaraj B.
    Atri, Dimitra
    Alhantoobi, Ahmed
    PLANETARY SCIENCE JOURNAL, 2024, 5 (06):
  • [25] A deep-learning model for the density profiles of subhaloes in IllustrisTNG
    Lucie-Smith, Luisa
    Despali, Giulia
    Springel, Volker
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2024, 532 (01) : 164 - 176
  • [26] Enhanced Deep-Learning Model for Carbon Footprints of Chemicals
    Zhang, Dachuan
    Wang, Zhanyun
    Oberschelp, Christopher
    Bradford, Eric
    Hellweg, Stefanie
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2024, 12 (07) : 2700 - 2708
  • [27] An integrated deep-learning model for smart waste classification
    Shivendu Mishra
    Ritika Yaduvanshi
    Prince Rajpoot
    Sharad Verma
    Amit Kumar Pandey
    Digvijay Pandey
    Environmental Monitoring and Assessment, 2024, 196
  • [28] An integrated deep-learning model for smart waste classification
    Mishra, Shivendu
    Yaduvanshi, Ritika
    Rajpoot, Prince
    Verma, Sharad
    Pandey, Amit Kumar
    Pandey, Digvijay
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (03)
  • [29] Predicting progression to AD using a deep-learning model
    Kelsey R.
    Nature Reviews Neurology, 2019, 15 (9) : 492 - 492
  • [30] DDoSNet: A Deep-Learning Model for Detecting Network Attacks
    Elsayed, Mahmoud Said
    Nhien-An Le-Khac
    Dev, Soumyabrata
    Jurcut, Anca Delia
    2020 21ST IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (IEEE WOWMOM 2020), 2020, : 391 - 396