Eigen: End-to-end Resource Optimization for Large-Scale Databases on the Cloud

被引:4
|
作者
Li, Ji You [1 ]
Zhang, Jiachi [1 ]
Zhou, Wenchao [1 ]
Liu, Yuhang [1 ]
Zhang, Shuai [1 ]
Xue, Zhuoming [1 ]
Xu, Ding [1 ]
Fan, Hua [1 ]
Zhou, Fangyuan [1 ]
Li, Feifei [1 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 12期
关键词
D O I
10.14778/3611540.3611565
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly, cloud database vendors host large-scale geographically distributed clusters to provide cloud database services. When managing the clusters, we observe that it is challenging to simultaneously maximizing the resource allocation ratio and resource availability. This problem becomes more severe in modern cloud database clusters, where resource allocations occur more frequently and on a greater scale. To improve the resource allocation ratio without hurting resource availability, we introduce Eigen, a large-scale cloud-native cluster management system for large-scale databases on the cloud. Based on a resource flow model, we propose a hierarchical resource management system and three resource optimization algorithms that enable end-to-end resource optimization. Furthermore, we demonstrate the system optimization that promotes user experience by reducing scheduling latencies and improving scheduling throughput. Eigen has been launched in a large-scale public-cloud production environment for 30+ months and served more than 30+ regions (100+ available zones) globally. Based on the evaluation of real-world clusters and simulated experiments, Eigen can improve the allocation ratio by over 27% (from 60% to 87.0%) on average, while the ratio of delayed resource provisions is under 0.1%.
引用
收藏
页码:3795 / 3807
页数:13
相关论文
共 50 条
  • [21] Data aggregation with end-to-end confidentiality and integrity for large-scale wireless sensor networks
    Cui, Jie
    Shao, Lili
    Zhong, Hong
    Xu, Yan
    Liu, Lu
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2018, 11 (05) : 1022 - 1037
  • [22] Data aggregation with end-to-end confidentiality and integrity for large-scale wireless sensor networks
    Jie Cui
    Lili Shao
    Hong Zhong
    Yan Xu
    Lu Liu
    Peer-to-Peer Networking and Applications, 2018, 11 : 1022 - 1037
  • [23] Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning
    Lv, Chengfei
    Niu, Chaoyue
    Gu, Renjie
    Jiang, Xiaotang
    Wang, Zhaode
    Liu, Bin
    Wu, Ziqi
    Yao, Qiulin
    Huang, Congyu
    Huang, Panos
    Huang, Tao
    Shu, Hui
    Song, Jinde
    Zou, Bin
    Lan, Peng
    Xu, Guohuan
    Wu, Fei
    Tang, Shaojie
    Wu, Fan
    Chen, Guihai
    PROCEEDINGS OF THE 16TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2022, 2022, : 249 - 265
  • [24] KNOWLEDGE TRANSFER FROM LARGE-SCALE PRETRAINED LANGUAGE MODELS TO END-TO-END SPEECH RECOGNIZERS
    Kubo, Yotaro
    Karita, Shigeki
    Bacchiani, Michiel
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8512 - 8516
  • [25] Accurate End-to-End Delay Bound Analysis for Large-Scale Network Via Experimental Comparison
    Hong, Xiao
    Gao, Yuehong
    Yang, Hongwen
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2022, E105B (04) : 472 - 484
  • [26] An End-to-End Localizer for Long-Term Topological Localization in Large-Scale Changing Environments
    Cao, Fengkui
    Wu, Hao
    Wu, Chengdong
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (05) : 5140 - 5149
  • [27] Affordable End-to-End Solution for Change Detection and Progress Monitoring in Large-Scale Construction Sites
    Będkowski, Janusz
    SSRN, 1600,
  • [28] TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
    Singh, Amanpreet
    Peng, Guan
    Toh, Mandy
    Huang, Jing
    Galuba, Wojciech
    Hassner, Tal
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8798 - 8808
  • [29] From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping
    Zheng, Kaiyu
    Pronobis, Andrzej
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3511 - 3518
  • [30] Resilient End-to-End Message Protection for Large-scale Cyber-Physical System Communications
    Kim, Young-Jin
    Kolesnikov, Vladimir
    Thottan, Marina
    2012 IEEE THIRD INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2012, : 193 - 198