A lightweight performance proxy for deep-learning model training on Amazon SageMaker

被引：0

作者：

Tesser, Rafael Keller ^{[1
,2
,3
]}

Marques, Alvaro ^{[2
]}

Borin, Edson ^{[2
]}

机构：

[1] Univ Campinas Unicamp, Ctr Comp Engn & Sci, Sao Paulo, Brazil

[2] Univ Campinas Unicamp, Inst Comp, Sao Paulo, Brazil

[3] Fed Univ Technol Parana UTFPR, Bachelors Course Comp Sci, Santa Helena, PR, Brazil

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2024年 / 36卷 / 14期

关键词：

cloud computing; cost prediction; deep learning; machine learning; performance prediction;

D O I：

10.1002/cpe.8104

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Cloud computing has become popular for training deep-learning (DL) models, avoiding the costs of acquiring and maintaining on-premise systems. SageMaker is a cloud service that automates the execution of DL workloads. Its features include automatic hyperparameter optimization and use of spot instances. Nonetheless, it does not assist in selecting the right instance type for a workload. In public clouds, rent price depends on the configuration of the chosen instance type. Advanced and faster instances are typically more expensive, but not always the best choice. To select the optimal instance type, users must compare the workload's relative performance (and hence cost) on several candidates. Building on the execution profiles of multiple DL applications, we model the performance and cost of training DL applications on SageMaker and propose a lightweight technique to estimate these at low temporal and monetary cost. This method is a performance proxy that can be used to replace more expensive performance measurement procedures. So, it could speed up any technique that relies on such measurements. We show how it can help cloud customers seeking suitable instance types to train DL models, and that it can accurately predict the performance of different instance types when training these models on SageMaker.

引用

页数：22

共 50 条

[21] Exploring Symmetry in Digital Image Forensics Using a Lightweight Deep-Learning Hybrid Model for Multiple Smoothing Operators
Agarwal, Saurabh
Jung, Ki-Hyun
SYMMETRY-BASEL, 2023, 15 (12):
[22] Deep-learning model for screening sepsis using electrocardiography
Kwon, Joon-myoung
Lee, Ye Rang
Jung, Min-Seung
Lee, Yoon-Ji
Jo, Yong-Yeon
Kang, Da-Young
Lee, Soo Youn
Cho, Yong-Hyeon
Shin, Jae-Hyun
Ban, Jang-Hyeon
Kim, Kyung-Hee
SCANDINAVIAN JOURNAL OF TRAUMA RESUSCITATION & EMERGENCY MEDICINE, 2021, 29 (01):
[23] Automated deep-learning model optimization framework for microcontrollers
Hong, Seungtae
Park, Gunju
Kim, Jeong-Si
ETRI JOURNAL, 2024,
[24] An Explainable Deep-learning Model of Proton Auroras on Mars
Dhuri, Dattaraj B.
Atri, Dimitra
Alhantoobi, Ahmed
PLANETARY SCIENCE JOURNAL, 2024, 5 (06):
[25] A deep-learning model for the density profiles of subhaloes in IllustrisTNG
Lucie-Smith, Luisa
Despali, Giulia
Springel, Volker
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2024, 532 (01) : 164 - 176
[26] Enhanced Deep-Learning Model for Carbon Footprints of Chemicals
Zhang, Dachuan
Wang, Zhanyun
Oberschelp, Christopher
Bradford, Eric
Hellweg, Stefanie
ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2024, 12 (07) : 2700 - 2708
[27] An integrated deep-learning model for smart waste classification
Shivendu Mishra
Ritika Yaduvanshi
Prince Rajpoot
Sharad Verma
Amit Kumar Pandey
Digvijay Pandey
Environmental Monitoring and Assessment, 2024, 196
[28] An integrated deep-learning model for smart waste classification
Mishra, Shivendu
Yaduvanshi, Ritika
Rajpoot, Prince
Verma, Sharad
Pandey, Amit Kumar
Pandey, Digvijay
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (03)
[29] Predicting progression to AD using a deep-learning model
Kelsey R.
Nature Reviews Neurology, 2019, 15 (9) : 492 - 492
[30] DDoSNet: A Deep-Learning Model for Detecting Network Attacks
Elsayed, Mahmoud Said
Nhien-An Le-Khac
Dev, Soumyabrata
Jurcut, Anca Delia
2020 21ST IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (IEEE WOWMOM 2020), 2020, : 391 - 396

← 1 2 3 4 5 →