Shard Manager: A Generic Shard Management Framework for Geo-distributed Applications

被引:10
|
作者
Lee, Sangmin [1 ]
Guo, Zhenhua [1 ]
Sunercan, Omer [1 ]
Ying, Jun [1 ]
Kooburat, Thawan [1 ]
Biswal, Suryadeep [1 ]
Chen, Jun [1 ]
Huang, Kun [1 ]
Cheung, Yatpang [1 ]
Zhou, Yiding [1 ]
Veeraraghavan, Kaushik [1 ]
Damani, Biren [1 ]
Ruiz, Pol Mauri [1 ]
Mehta, Vikas [1 ]
Tang, Chunqiang [1 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
关键词
shard management; sharding; availability;
D O I
10.1145/3477132.3483546
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sharding is widely used to scale an application. Despite a decade of effort to build generic sharding frameworks that can be reused across different applications, the extent of their success remains unclear. We attempt to answer a fundamental question: what barriers prevent a sharding framework from getting adopted by the majority of sharded applications? We analyze hundreds of sharded applications at Facebook and identify two major barriers: 1) lack of support for geo-distributed applications, which account for most of Facebook's applications, and 2) inability to maintain application availability during planned events such as software upgrades, which happen similar to 1000 times more frequently than unplanned failures. A sharding framework that does not help applications to address these fundamental challenges is not sufficiently attractive for most applications to adopt it. Other adoption barriers include the burden of supporting many complex applications in a one-size-fit-all sharding framework and the difficulty in supporting sophisticated shard-placement requirements. Theoretically, a constraint solver can handle complex placement requirements, but in practice it is not scalable enough to perform near-realtime shard placement at a global scale. We have overcome these adoption barriers in Facebook's sharding framework called Shard Manager. Currently, Shard Manager is used by hundreds of applications running on over one million machines, which account for about 54% of all sharded applications at Facebook.
引用
收藏
页码:553 / 569
页数:17
相关论文
共 50 条
  • [41] SpeCH: A scalable framework for data placement of data-intensive services in geo-distributed clouds
    Atrey, Ankita
    Van Seghbroeck, Gregory
    Mora, Higinio
    De Turck, Filip
    Volckaert, Bruno
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 142 : 1 - 14
  • [42] Blockchain-based decentralized workload and energy management of geo-distributed data centers
    Sajid, Sara
    Jawad, Muhammad
    Hamid, Kanza
    Khan, Muhammad U. S.
    Ali, Sahibzada M.
    Abbas, Assad
    Khan, Samee U.
    [J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2021, 29
  • [43] Location-aware Associated Data Placement for Geo-distributed Data-intensive Applications
    Yu, Boyang
    Pan, Jianping
    [J]. 2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), 2015,
  • [44] An Intelligent Resource Reservation for Crowdsourced Live Video Streaming Applications in Geo-Distributed Cloud Environment
    Baccour, Emna
    Haouari, Fatima
    Erbad, Aiman
    Mohamed, Amr
    Bilal, Kashif
    Guizani, Mohsen
    Hamdi, Mounir
    [J]. IEEE SYSTEMS JOURNAL, 2022, 16 (01): : 240 - 251
  • [45] Temporal Task Scheduling for Delay-constrained Applications in Geo-Distributed Cloud Data Centers
    Bi, Jing
    Yuan, Haitao
    Zhang, Jia
    Zhou, MengChu
    [J]. PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 138 - 145
  • [46] Genetic Based Data Placement for Geo-Distributed Data-Intensive Applications in Cloud Computing
    Fan, Weifeng
    Peng, Jun
    Zhang, Xiaoyong
    Huang, Zhiwu
    [J]. ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 253 - 265
  • [47] A generic deployment framework for grid computing and distributed applications
    Flissi, Areski
    Merle, Philippe
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: COOPIS, DOA, GADA, AND ODBASE PT 2, PROCEEDINGS, 2006, 4276 : 1402 - 1411
  • [48] A General Communication Cost Optimization Framework for Big Data Stream Processing in Geo-Distributed Data Centers
    Gu, Lin
    Zeng, Deze
    Guo, Song
    Xiang, Yong
    Hu, Jiankun
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (01) : 19 - 29
  • [49] H2F: a Hierarchical Hadoop Framework for big data processing in geo-distributed environments
    Cavallo, Marco
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    [J]. 2016 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT), 2016, : 27 - 35
  • [50] Management of geo-distributed intelligence: Deep Insight as a Service (DINSaaS) on Forged Cloud Platforms (FCP)
    Kuru, Kaya
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 149 : 103 - 118