Building and Evaluation of Cloud Storage and Datasets Services on AI and HPC Converged Infrastructure

被引:4
|
作者
Tanimura, Yusuke [1 ]
Takizawa, Shinichiro [1 ]
Ogawa, Hirotaka [1 ]
Hamanishi, Takahiro [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Koto Ku, Aomi 2-4-7, Tokyo, Japan
关键词
cloud storage; S3; AI & HPC converged system; dataset sharing; object storage; SCIENCE;
D O I
10.1109/BigData50022.2020.9377729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
AI Bridging Cloud Infrastructure (ABCI) is a world-leading open AI computing infrastructure, for accelerating R&D activities of artificial intelligence. In order to share and reuse AI software assets with ease, ABCI supports container-based application deployment and fine-grained resource allocation on top of the conventional HPC architecture, and provides tens of peta-bytes of high performance storage. One of the on-going major challenges in ABCI, however, is to more efficiently and flexibly exchange and share machine learning models and data related to AI, with other services deployed outside of ABCI in the real world. Our new services called as ABCI Cloud Storage and ABCI Public Datasets are designed for tackling the challenge and taking a role of "Data Harbor" of ABCI. The services allow users to store input and output data of jobs to be run on the ABCI compute nodes, and to share them with not only ABCI users but also non-ABCI users. This paper presents our design and integration of the services to conventional HPC architecture, as a case of ABCI, and reports performance evaluation of them. Based on our attempt and experience, the paper finally summarizes discussion about future direction of the S3 based front data/storage service of the AI and HPC converged system.
引用
收藏
页码:1992 / 2001
页数:10
相关论文
共 17 条
  • [1] FPGA acceleration in EVOLVE's Converged Cloud-HPC Infrastructure
    Koliogeorgi, Konstantina
    Keddous, Fekhr Eddine
    Masouros, Dimosthenis
    Chazapis, Antony
    Aubrun, Michelle
    Xydis, Sotirios
    Bilas, Angelos
    Hugues, Romain
    Acquaviva, Jean-Thomas
    Nguyen, Huy Nam
    Soudris, Dimitrios
    [J]. 2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 376 - 377
  • [2] Evaluation of the impact the hyper-converged infrastructure storage subsystem synchronization on the overall performance
    Shvidkiy, Artem A.
    Spirkina, Anastasia, V
    Savelieva, Anastasiia A.
    Tarlykov, Aleksey, V
    [J]. 2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 248 - 252
  • [3] Archival Data Repository Services to Enable HPC and Cloud Workflows in a Federated Research e-Infrastructure
    Alam, Sadaf R.
    Bartolome, Javier
    Bassini, Sanzio
    Carpene, Michele
    Cestari, Mirko
    Combeau, Frederic
    Girona, Sergi
    Gorini, Stefano
    Fiameni, Giuseppe
    Hagemeier, Bjoern
    Hater, Thorsten
    Herten, Andreas
    Kiapidou, Nikoleta
    Klijn, Wouter
    Krause, Dorian
    Lafoucriere, Jacques-Charles
    Leong, Cerlane
    Leibovici, Thomas
    Lippert, Thomas
    McMurtrie, Colin J.
    Mezentsev, Pavel
    Nahm, Anne
    Orth, Boris
    Pleiter, Dirk
    Schulthess, Thomas C.
    von St Vieth, Benedikt
    Testi, Debora
    Wiber, Gilles
    [J]. PROCEEDINGS OF 2020 3RD IEEE/ACM INTERNATIONAL WORKSHOP ON INTEROPERABILITY OF SUPERCOMPUTING AND CLOUD TECHNOLOGIES (SUPERCOMPCLOUD 2020), 2020, : 39 - 44
  • [4] Building an Expert System for Evaluation of Commercial Cloud Services
    Li, Zheng
    O'Brien, Liam
    Cai, Rainbow
    Zhang, He
    [J]. 2012 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICE COMPUTING (CSC), 2012, : 168 - 175
  • [5] Federated and secure cloud services for building medical image classifiers on an intercontinental infrastructure
    Blanquer, Ignacio
    Brasileiro, Francisco
    Brito, Andrey
    Calatrava, Amanda
    Carvalho, Andre
    Fetzer, Christof
    Figueiredo, Flavio
    Guimaraes, Ronny Petterson
    Marinho, Leandro
    Meira, Wagner, Jr.
    Silva, Altigran
    Alberich-Bayarri, Angel
    Camacho-Ramos, Eduardo
    Jimenez-Pastor, Ana
    Ribeiro, Antonio Luiz L.
    Nascimento, Bruno Ramos
    Silva, Fabio
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 110 : 119 - 134
  • [6] Performance and Availability Evaluation of Storage Services in Private Cloud
    Torres, Elton Bezerra
    Callou, Gustavo
    Alves, Gabriel
    Accioly, Jose
    Gustavo, Hallyson
    [J]. 2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [7] Workload models and performance evaluation of cloud storage services
    Goncalves, Glauber D.
    Drago, Idilio
    Vieira, Alex B.
    Couto da Silva, Ana Paula
    Almeida, Jussara M.
    Mellia, Marco
    [J]. COMPUTER NETWORKS, 2016, 109 : 183 - 199
  • [8] Performance Evaluation of a Private Cloud Storage Infrastructure Service for Document Preservation
    Ferreira, Antonio M. A.
    Drummond, Andre C.
    de Araujo, Aleteia Patricia F.
    [J]. 2017 12TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2017,
  • [9] Personal Cloud Storage Services Evaluation Model Based on User Experience
    Xie, Yingying
    Cheng, Yan
    Yao, Yue
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 736 - 740
  • [10] Building a Reputation Attack Detector for Effective Trust Evaluation in a Cloud Services Environment
    Alshammari, Salah T.
    Alsubhi, Khalid
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (18):