On the Support of Versioning in Distributed Key-Value Stores

被引:9
|
作者
Felber, Pascal [1 ]
Pasin, Marcelo [1 ]
Riviere, Etienne [1 ]
Schiavoni, Valerio [1 ]
Sutra, Pierre [1 ]
Coelho, Fabio [2 ,3 ]
Oliveira, Rui [2 ,3 ]
Matos, Miguel [2 ,3 ]
Vilaca, Ricardo [2 ,3 ]
机构
[1] Univ Neuchatel, CH-2000 Neuchatel, Switzerland
[2] INESC TEC, HASLab, Oporto, Portugal
[3] U Minho, Oporto, Portugal
关键词
versioning; key-value store; listeners; CLOCKS;
D O I
10.1109/SRDS.2014.35
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to access and query data stored in multiple versions is an important asset for many applications, such as Web graph analysis, collaborative editing platforms, data forensics, or correlation mining. The storage and retrieval of versioned data requires a specific API and support from the storage layer. The choice of the data structures used to maintain versioned data has a fundamental impact on the performance of insertions and queries. The appropriate data structure also depends on the nature of the versioned data and the nature of the access patterns. In this paper we study the design and implementation space for providing versioning support on top of a distributed key-value store (KVS). We define an API for versioned data access supporting multiple writers and show that a plain KVS does not offer the necessary synchronization power for implementing this API. We leverage the support for listeners at the KVS level and propose a general construction for implementing arbitrary types of data structures for storing and querying versioned data. We explore the design space of versioned data storage ranging from a flat data structure to a distributed sharded index. The resulting system, ALEPH, is implemented on top of an industrial-grade open-source KVS, Infinispan. Our evaluation, based on real-world Wikipedia access logs, studies the performance of each versioning mechanisms in terms of load balancing, latency and storage overhead in the context of different access scenarios.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 50 条
  • [21] Multi-Client Searchable Encryption over Distributed Key-Value Stores
    Lin, Wanyu
    Yuan, Xu
    Li, Baochun
    Wang, Cong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP), 2017, : 72 - 78
  • [22] A Proxy-based Query Aggregation Method for Distributed Key-Value Stores
    Kawaname, Daichi
    Kamoshita, Masanari
    Kawashima, Ryota
    Matsuo, Hiroshi
    [J]. 2018 IEEE 6TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (W-FICLOUD 2018), 2018, : 78 - 83
  • [23] Characterizing and Adapting the Consistency-Latency Tradeoff in Distributed Key-Value Stores
    Rahman, Muntasir Raihan
    Tseng, Lewis
    Nguyen, Son
    Gupta, Indranil
    Vaidya, Nitin
    [J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2017, 11 (04)
  • [24] A Design for Scalable and Secure Key-value Stores
    Chen, Longbin
    Dai, Wenyun
    Qiu, Meikang
    Jiang, Ning
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 216 - 221
  • [25] Towards Elastic Key-value Stores on IaaS
    Li, Han
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 302 - 305
  • [26] Integrity Verification in Cloud Key-Value Stores
    Weintraub, Grisha
    Rise, Leonid
    Kadosh, Alon
    [J]. PROCEEDINGS OF THE 15TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS AND STORAGE, SYSTOR 2022, 2022, : 140 - 140
  • [27] Robust Data Sharing with Key-Value Stores
    Basescu, Cristina
    Cachin, Christian
    Eyal, Ittay
    Haas, Robert
    Sorniotti, Alessandro
    Vukolic, Marko
    Zachevsky, Ido
    [J]. 2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [28] Privacy-Preserving Similarity Search With Efficient Updates in Distributed Key-Value Stores
    Lin, Wanyu
    Cui, Helei
    Li, Baochun
    Wang, Cong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1072 - 1084
  • [29] A scheduling framework for distributed key-value stores and its application to tail latency minimization
    Ben Mokhtar, Sonia
    Canon, Louis-Claude
    Dugois, Anthony
    Marchal, Loris
    Riviere, Etienne
    [J]. JOURNAL OF SCHEDULING, 2024, 27 (02) : 183 - 202
  • [30] Distributed Time Series Similarity Search Method Based on Key-value Data Stores
    Yu Z.-S.
    Li R.-Y.
    Guo Y.
    Jiang Z.-Y.
    Bao J.
    Zheng Y.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 950 - 967