AUTOPLACER: Scalable Self-Tuning Data Placement in Distributed Key-Value Stores

被引:27
|
作者
Paiva, Joao [1 ]
Ruivo, Pedro [2 ]
Romano, Paolo [1 ]
Rodrigues, Luis [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1000029 Lisbon, Portugal
[2] Red Hat Inc, London, England
关键词
Performance; Distributed data management; data placement; probabilistic algorithms; machine learning;
D O I
10.1145/2641573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the problem of self-tuning the data placement in replicated key-value stores. The goal is to automatically optimize replica placement in a way that leverages locality patterns in data accesses, such that internode communication is minimized. To do this efficiently is extremely challenging, as one needs not only to find lightweight and scalable ways to identify the right assignment of data replicas to nodes but also to preserve fast data lookup. The article introduces new techniques that address these challenges. The first challenge is addressed by optimizing, in a decentralized way, the placement of the objects generating the largest number of remote operations for each node. The second challenge is addressed by combining the usage of consistent hashing with a novel data structure, which provides efficient probabilistic data placement. These techniques have been integrated in a popular open-source key-value store. The performance results show that the throughput of the optimized system can be six times better than a baseline system employing the widely used static placement based on consistent hashing.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] An adaptive replica placement approach for distributed key-value stores
    Costa Filho, Jose S.
    Cavalcante, Denis M.
    Moreira, Leonardo O.
    Machado, Javam C.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (11):
  • [2] Scalable Versioning for Key-Value Stores
    Haeusler, Martin
    DATA: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA MANAGEMENT TECHNOLOGIES AND APPLICATIONS, 2016, : 79 - 86
  • [3] A Design for Scalable and Secure Key-value Stores
    Chen, Longbin
    Dai, Wenyun
    Qiu, Meikang
    Jiang, Ning
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 216 - 221
  • [4] GeoWave: Utilizing Distributed Key-Value Stores for Multidimensional Data
    Whitby, Michael A.
    Fecher, Rich
    Bennight, Chris
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, SSTD 2017, 2017, 10411 : 105 - 122
  • [5] Scalable Transactions across Heterogeneous NoSQL Key-Value Data Stores
    Dey, Akon
    Fekete, Alan
    Roehm, Uwe
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1434 - 1439
  • [6] A Correlation-Aware Data Placement Strategy for Key-Value Stores
    Vilaca, Ricardo
    Oliveira, Rui
    Pereira, Jose
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, 2011, 6723 : 214 - 227
  • [7] Parallax: Hybrid Key-Value Placement in LSM-based Key-Value Stores
    Xanthakis, Giorgos
    Saloustros, Giorgos
    Batsaras, Nikos
    Papagiannis, Anastasios
    Bilas, Angelos
    PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 305 - 318
  • [8] On the Support of Versioning in Distributed Key-Value Stores
    Felber, Pascal
    Pasin, Marcelo
    Riviere, Etienne
    Schiavoni, Valerio
    Sutra, Pierre
    Coelho, Fabio
    Oliveira, Rui
    Matos, Miguel
    Vilaca, Ricardo
    2014 IEEE 33RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2014, : 95 - 104
  • [9] Distributed Data Load Balancing for Scalable Key-Value Cache Systems
    Chen, Shanshan
    Zhou, Xudong
    Zhou, Guiping
    Sinnott, Richard O.
    ADVANCED COMPUTER ARCHITECTURE, 2018, 908 : 181 - 194
  • [10] Evaluation of Key-Value Stores for Distributed Locking Purposes
    Grzesik, Piotr
    Mrozek, Dariusz
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES (BDAS): PAVING THE ROAD TO SMART DATA PROCESSING AND ANALYSIS, 2019, 1018 : 70 - 81