Deployment of Containerized Deep Learning Applications in the Cloud

被引:0
|
作者
Doukha, Rim [1 ,2 ]
Mahmoudi, Sidi Ahmed [1 ]
Zbakh, Mostapha [2 ]
Manneback, Pierre [1 ]
机构
[1] Univ Mons, Fac Engn, Comp Sci & Artificial Intelligence Dept, Mons, Belgium
[2] Mohamed V Univ, Natl Sch Comp Sci & Syst Anal, Rabat, Morocco
关键词
Cloud Computing; Application Deployment; Application Migration; Kubernetes; Docker; Ansible; Slurm; Deep Learning;
D O I
10.1109/CloudTech49835.2020.9365868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the last years, the use of Cloud computing environment has increased as a result of the various services offered by Cloud providers (Amazon Web Services, Google Cloud, Microsoft Azure, etc.). Many companies are moving their data and applications to the Cloud in order to tackle the complex configuration effort, for having more flexibility, maintenance, and resource availability. However, it is important to mention the challenges that developers may face when using a Cloud solution such as the variation of applications requirements (in terms of computation, memory and energy consumption) over time, which makes the deployment and migration a hard process. In fact, the deployment will not depend only on the application, but it will also rely on the related services and hardware for the proper functioning of the application. In this paper, we propose a Cloud infrastructure for automatic deployment of applications using the services of Kubernetes, Docker, Ansible and Slurm. Our architecture includes a script to deploy the application depending of its requirement needs. Experiments are conducted with the analysis and the deployment of Deep Learning (DL) applications and more particularly images classification and object localization.
引用
收藏
页码:151 / 156
页数:6
相关论文
共 50 条
  • [31] Rapid Deployment for Machine Learning in the Educational Cloud
    Takabe, Yuichiro
    Uehara, Minoru
    2013 16TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2013), 2013, : 372 - 376
  • [32] GuardGrid: a high-availability cloud platform for deep learning applications
    Yifan Sui
    Meng Cai
    Jianxun Li
    Cluster Computing, 2025, 28 (5)
  • [33] Analysis of Cybersecurity Threats in Cloud Applications Using Deep Learning Techniques
    Sokolov, S. A.
    Iliev, T. B.
    Stoyanov, I. S.
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 441 - 446
  • [34] Auto-scaling containerized cloud applications: A workload-driven approach
    Chouliaras, Spyridon
    Sotiriadis, Stelios
    SIMULATION MODELLING PRACTICE AND THEORY, 2022, 121
  • [35] Middleware-Oriented Deployment Automation for Cloud Applications
    Wettinger, Johannes
    Andrikopoulos, Vasilios
    Leymann, Frank
    Strauch, Steve
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (04) : 1054 - 1066
  • [36] Performance-Oriented Deployment of Streaming Applications on Cloud
    Liu, Xunyun
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON BIG DATA, 2019, 5 (01) : 46 - 59
  • [37] Predicting cloud performance for HPC applications before deployment
    Mariani, Giovanni
    Anghel, Andreea
    Jongerius, Rik
    Dittmann, Gero
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 618 - 628
  • [38] Deployment Strategies for Distributed Applications on Cloud Computing Infrastructures
    van der Veen, Jan Sipke
    Lazovik, Elena
    Makkes, Marc X.
    Meijer, Robert J.
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 2, 2013, : 228 - 233
  • [39] DYNTAIL - Dynamically Tailored Deployment Engines for Cloud Applications
    Wettinger, Johannes
    Breitenbuecher, Uwe
    Leymann, Frank
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 421 - 428
  • [40] Characterizing and Evaluating Different Deployment Approaches for Cloud Applications
    Wettinger, Johannes
    Andrikopoulos, Vasilios
    Strauch, Steve
    Leymann, Frank
    2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 205 - 214