AUTOPLACER: Scalable Self-Tuning Data Placement in Distributed Key-Value Stores

被引:27
|
作者
Paiva, Joao [1 ]
Ruivo, Pedro [2 ]
Romano, Paolo [1 ]
Rodrigues, Luis [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1000029 Lisbon, Portugal
[2] Red Hat Inc, London, England
关键词
Performance; Distributed data management; data placement; probabilistic algorithms; machine learning;
D O I
10.1145/2641573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the problem of self-tuning the data placement in replicated key-value stores. The goal is to automatically optimize replica placement in a way that leverages locality patterns in data accesses, such that internode communication is minimized. To do this efficiently is extremely challenging, as one needs not only to find lightweight and scalable ways to identify the right assignment of data replicas to nodes but also to preserve fast data lookup. The article introduces new techniques that address these challenges. The first challenge is addressed by optimizing, in a decentralized way, the placement of the objects generating the largest number of remote operations for each node. The second challenge is addressed by combining the usage of consistent hashing with a novel data structure, which provides efficient probabilistic data placement. These techniques have been integrated in a popular open-source key-value store. The performance results show that the throughput of the optimized system can be six times better than a baseline system employing the widely used static placement based on consistent hashing.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Self-tuning Eventually-Consistent Data Stores
    Chatterjee, Shankha
    Golab, Wojciech
    STABILIZATION, SAFETY, AND SECURITY OF DISTRIBUTED SYSTEMS, SSS 2017, 2018, 10616 : 78 - 92
  • [22] Secure Multi-Client Data Access with Boolean Queries in Distributed Key-Value Stores
    Yuan, Xu
    Yuan, Xingliang
    Li, Baochun
    Wang, Cong
    2017 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2017, : 245 - 253
  • [23] Balancing Distributed Key-Value Stores with Efficient In-Network Redirecting
    Shi, Yang
    Fei, Jiawei
    Wen, Mei
    Zhang, Chunyuan
    ELECTRONICS, 2019, 8 (09)
  • [24] A Distributed Quorum System for Ensuring Bounded Staleness of Key-Value Stores
    Horii, Hiroshi
    Enoki, Miki
    Onodera, Tamiya
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 84 - 95
  • [25] Harnessing Correlations in Distributed Erasure-Coded Key-Value Stores
    Ali, Ramy E.
    Cadambe, Viveck R.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (09) : 5907 - 5920
  • [26] Brief Announcement: Atomic Consistency and Partition Tolerance in Scalable Key-Value Stores
    Arad, Cosmin
    Shafaat, Tallat M.
    Haridi, Self
    DISTRIBUTED COMPUTING, DISC 2012, 2012, 7611 : 445 - 446
  • [27] A novel object placement protocol for minimizing the average response time of get operations in distributed key-value stores
    Makris, Antonios
    Tserpes, Konstantinos
    Anagnostopoulos, Dimosthenis
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3196 - 3205
  • [28] Crowdsourced Data Integrity Verification for Key-Value Stores in the Cloud
    Weintraub, Grisha
    Gudes, Ehud
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 498 - 503
  • [29] Brief Announcement: Robust Data Sharing with Key-Value Stores
    Basescu, Cristina
    Cachin, Christian
    Eyal, Ittay
    Haas, Robert
    Vukolic, Marko
    PODC 11: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM PRINCIPLES OF DISTRIBUTED COMPUTING, 2011, : 221 - 222
  • [30] AMS: Adaptive Multiget Scheduling Algorithm for Distributed Key-Value Stores
    Jiang, Wanchun
    Qiu, Yujia
    Ji, Fa
    Zhang, Yongjia
    Zhou, Xiangqian
    Wang, Jianxin
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 2591 - 2602