GOLGI: Performance-Aware, Resource-Efficient Function Scheduling for Serverless Computing

被引:4
|
作者
Li, Suyi [1 ]
Wang, Wei [1 ]
Yang, Jun [2 ]
Chen, Guangzhen [2 ]
Lu, Daohe [2 ]
机构
[1] HKUST, Hong Kong, Peoples R China
[2] WeBank, Shenzhen, Peoples R China
关键词
Serverless Computing; Resource Management; Scheduling;
D O I
10.1145/3620678.3624645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces GOLGI, a novel scheduling system designed for serverless functions, with the goal of minimizing resource provisioning costs while meeting the function latency requirements. To achieve this, GOLGI judiciously over-commits functions based on their past resource usage. To ensure overcommitment does not cause significant performance degradation, GOLGI identifies nine low-level metrics to capture the runtime performance of functions, encompassing factors like request load, resource allocation, and contention on shared resources. These metrics enable accurate prediction of function performance using the Mondrian Forest, a classification model that is continuously updated in real-time for optimal accuracy without extensive offline training. GOLGI employs a conservative exploration-exploitation strategy for request routing. By default, it routes requests to non-overcommitted instances to ensure satisfactory performance. However, it actively explores opportunities for using more resource-efficient overcommitted instances, while maintaining the specified latency SLOs. Golgi also performs vertical scaling to dynamically adjust the concurrency of overcommitted instances, maximizing request throughput and enhancing system robustness to prediction errors. We have prototyped GOLGI and evaluated it in both EC2 cluster and a small production cluster. The results show that GOLGI can meet the SLOs while reducing the resource provisioning cost by 42% (30%) in EC2 cluster (our production cluster).
引用
收藏
页码:32 / 47
页数:16
相关论文
共 50 条
  • [1] Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud
    Tian, Huangshi
    Li, Suyi
    Wang, Ao
    Wang, Wei
    Wu, Tianlong
    Yang, Haoran
    [J]. PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022, 2022, : 78 - 93
  • [2] Energy-Aware Resource Scheduling for Serverless Edge Computing
    Aslanpour, Mohammad Sadegh
    Toosi, Adel N.
    Cheema, Muhammad Aamir
    Gaire, Raj
    [J]. 2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 190 - 199
  • [3] A Secure, Fast, and Resource-Efficient Serverless Platform with Function REWIND
    Song, Jaehyun
    Kim, Bumsuk
    Kwak, Minwoo
    Lee, Byoungyoung
    Seo, Euiseong
    Jeong, Jinkyu
    [J]. PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 597 - 613
  • [4] Maxwell's Demon in Tail-tolerant, Resource-efficient Serverless Computing
    Zhang, Huanyu
    Huang, Wenhao
    Zhao, Laiping
    Li, Keqiu
    [J]. 2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 762 - 769
  • [5] Towards an Energy Efficient Computing With Coordinated Performance-Aware Scheduling in Large Scale Data Clusters
    Hamandawana, Prince
    Mativenga, Ronnie
    Kwon, Se Jin
    Chung, Tae-Sun
    [J]. IEEE ACCESS, 2019, 7 : 140261 - 140277
  • [6] FCloudless: A Performance-Aware Collaborative Mechanism for JointCloud Serverless
    Liu, Jianfei
    Wang, Huaimin
    Shi, Peichang
    Li, Yaojie
    Ma, Penghui
    Yi, Guodong
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING, JCC, 2023, : 93 - 94
  • [7] A Package-Aware Approach for Function Scheduling in Serverless Computing Environments
    Chetabi, Faeze Azimi
    Ashtiani, Mehrdad
    Saeedizade, Ehsan
    [J]. JOURNAL OF GRID COMPUTING, 2023, 21 (02)
  • [8] A Package-Aware Approach for Function Scheduling in Serverless Computing Environments
    Faeze Azimi Chetabi
    Mehrdad Ashtiani
    Ehsan Saeedizade
    [J]. Journal of Grid Computing, 2023, 21
  • [9] Energy and Performance-Aware Task Scheduling in a Mobile Cloud Computing Environment
    Lin, Xue
    Wang, Yanzhi
    Xie, Qing
    Pedram, Massoud
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 192 - 199
  • [10] Function-Aware Resource Management Framework for Serverless Edge Computing
    Ko, Haneul
    Pack, Sangheon
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (02) : 1310 - 1319