The globus compute dataset: An open function-as-a-service dataset from the edge to the cloud

被引:7
|
作者
Bauer, Andre [1 ,2 ]
Pan, Haochen [1 ]
Chard, Ryan [2 ]
Babuji, Yadu [1 ]
Bryan, Josh [1 ]
Tiwari, Devesh [3 ]
Foster, Ian [1 ,2 ]
Chard, Kyle [1 ,2 ]
机构
[1] Univ Chicago, Chicago, IL 60637 USA
[2] Argonne Natl Lab, Argonne, IL USA
[3] Northeastern Univ, Boston, MA 02138 USA
基金
美国国家科学基金会;
关键词
Serverless computing; Globus compute; FAIR dataset; Computing continuum;
D O I
10.1016/j.future.2023.12.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a unique function -as -a -service (FaaS) dataset capturing the use of the Globus Compute (previously funcX) platform. Globus Compute implements a federated model via which users may deploy endpoints on arbitrary remote computers, from the edge to high performance computing (HPC) cluster, and they may then invoke Python functions on those endpoints via a reliable cloud -hosted service. The dataset covers 31 weeks and includes 2121472 task submissions from 252 users executed on 580 remote computing endpoints. It includes 277386 registered functions. We describe the dataset and various observations, some that are similar to other FaaS datasets, for example, that 74% of tasks run for less than 1 s, and some that are unique to Globus Compute, for example, that endpoints are used in different ways and that the majority of functions are related to scientific computing and machine learning. To the best of our knowledge, this dataset represents the first federated FaaS dataset that includes user workloads, distributed computing endpoints, and analysis of registered function bodies. We expect the dataset to be useful for researching FaaS architectures, workload modeling, container warming, and other distributed computing architectures.
引用
收藏
页码:558 / 574
页数:17
相关论文
共 50 条
  • [1] Serverledge: Decentralized Function-as-a-Service for the Edge-Cloud Continuum
    Russo, Gabriele Russo
    Mannucci, Tiziana
    Cardellini, Valeria
    Lo Presti, Francesco
    2023 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PERCOM, 2023, : 131 - 140
  • [2] DFaaS: Decentralized Function-as-a-Service for Federated Edge Computing
    Ciavotta, Michele
    Motterlini, Davide
    Savi, Marco
    Tundo, Alessandro
    2021 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET), 2021, : 1 - 4
  • [3] Dataset Anonyization on Cloud: Open Problems and Perspectives
    Cristani, Matteo
    Tomazzoli, Claudio
    CURRENT TRENDS IN WEB ENGINEERING, ICWE 2019 INTERNATIONAL WORKSHOPS, 2020, 11609 : 74 - 85
  • [4] Simulators for system dataset generation in the Edge -to -Cloud Continuum
    Ali, Nawaz
    Aloi, Gianluca
    Pace, Pasquale
    Gianfelice, Michele
    Pupo, Francesco
    Gravina, Raffaele
    Fortino, Giancarlo
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 583 - 588
  • [5] A Large Dataset Enhanced Watermarking Service for Cloud Environments
    Zawawi, Nour
    Hamdy, Mohamed
    El-Gohary, Rania
    Tolba, Mohamed Fahmy
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, AMLTA 2014, 2014, 488 : 87 - 96
  • [6] Resource Provisioning and Allocation in Function-as-a-Service Edge-Clouds
    Ascigil, Onur
    Tasiopoulos, Argyrios G.
    Truong Khoa Phan
    Sourlas, Vasilis
    Psaras, Ioannis
    Pavlou, George
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (04) : 2410 - 2424
  • [7] A New Dataset and Benchmark for Cloud Computing Service Composition
    Jula, Amin
    Nilsaz, Hamid
    Sundararajan, Elankovan
    Othman, Zalinda
    PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 83 - 86
  • [8] PrEstoCloud: A Novel Framework for Data-Intensive Multi-Cloud, Fog, and Edge Function-as-a-Service Applications
    Verginadis, Yiannis
    Apostolou, Dimitris
    Taherizadeh, Salman
    Ledakis, Ioannis
    Mentzas, Gregoris
    Tsagkaropoulos, Andreas
    Papageorgiou, Nikos
    Paraskevopoulos, Fotis
    INFORMATION RESOURCES MANAGEMENT JOURNAL, 2021, 34 (01) : 66 - 85
  • [9] Function-as-a-Service for the Cloud-to-Thing Continuum: A Systematic Mapping Study
    Da Silva Oliveira, Barbara
    Ferry, Nicolas
    Song, Hui
    Dautov, Rustem
    Barisic, Ankica
    da Rocha, Atslands Rego
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY, IOTBDS 2023, 2023, : 82 - 93
  • [10] A Preliminary Review of Enterprise Serverless Cloud Computing (Function-as-a-Service) Platforms
    Lynn, Theo
    Rosati, Pierangelo
    Lejeune, Arnaud
    Emeakaroha, Vincent
    2017 9TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2017, : 162 - 169