Monkey: Optimal Navigable Key-Value Store

被引:144
|
作者
Dayan, Niv [1 ]
Athanassoulis, Manos [1 ]
Idreos, Stratos [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3035918.3064054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we show that key-value stores backed by an LSM-tree exhibit an intrinsic trade-off between lookup cost, update cost, and main memory footprint, yet all existing designs expose a suboptimal and difficult to tune trade-off among these metrics. We pinpoint the problem to the fact that all modern key-value stores sub-optimally co-tune the merge policy, the buffer size, and the Bloom filters' false positive rates in each level. We present Monkey, an LSM-based key-value store that strikes the optimal balance between the costs of updates and lookups with any given main memory budget. The insight is that worst-case lookup cost is proportional to the sum of the false positive rates of the Bloom filters across all levels of the LSM-tree. Contrary to state-of-the-art key-value stores that assign a fixed number of bits-per-element to all Bloom filters, Monkey allocates memory to filters across different levels so as to minimize this sum. We show analytically that Monkey reduces the asymptotic complexity of the worst-case lookup I/O cost, and we verify empirically using an implementation on top of LevelDB that Monkey reduces lookup latency by an increasing margin as the data volume grows (50% - 80% for the data sizes we experimented with). Furthermore, we map the LSM-tree design space onto a closed-form model that enables co-tuning the merge policy, the buffer size and the filters' false positive rates to trade among lookup cost, update cost and/or main memory, depending on the workload (proportion of lookups and updates), the dataset (number and size of entries), and the underlying hardware (main memory available, disk vs. flash). We show how to use this model to answer what-if design questions about how changes in environmental parameters impact performance and how to adapt the various LSM-tree design elements accordingly.
引用
收藏
页码:79 / 94
页数:16
相关论文
共 50 条
  • [1] Optimal Compression for Encrypted Key-Value Store in Cloud Systems
    Zhang, Chen
    Xie, Qingyuan
    Wang, Mingyue
    Guo, Yu
    Jia, Xiaohua
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (03) : 928 - 941
  • [2] FoundationDB: A Distributed Key-Value Store
    Zhou, Jingyu
    Xu, Meng
    Shraer, Alexander
    Namasivayam, Bala
    Miller, Alex
    Tschannen, Evan
    Atherton, Steve
    Beamon, Andrew J.
    Sears, Rusty
    Leach, John
    Rosenthal, Dave
    Dong, Xin
    Wilson, Will
    Collins, Ben
    Scherer, David
    Grieser, Alec
    Liu, Yang
    Moore, Alvin
    Muppana, Bhaskar
    Su, Xiaoge
    Yadav, Vishesh
    [J]. COMMUNICATIONS OF THE ACM, 2023, 66 (06) : 97 - 105
  • [3] RepKV: A Replicated Key-Value Store to Boost Multiple Indices for Key-Value Separation
    Tang, Chenlei
    Wan, Jiguang
    Tan, Zhihu
    Li, Guokuan
    [J]. 2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 187 - 194
  • [4] HyperDex: A Distributed, Searchable Key-Value Store
    Escriva, Robert
    Wong, Bernard
    Sirer, Emin Guen
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (04) : 25 - 36
  • [5] Key-Value Store Implementations for Arduino Microcontrollers
    Fazackerley, Scott
    Huang, Eric
    Douglas, Graeme
    Kudlac, Raffi
    Lawrence, Ramon
    [J]. 2015 IEEE 28TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2015, : 158 - 164
  • [6] Crashproofing the Original NoSQL Key-Value Store
    Kelly T.
    [J]. Queue, 2021, 19 (04): : 5 - 18
  • [7] CaSSanDra: An SSD Boosted Key-Value Store
    Menon, Prashanth
    Rabl, Tilmann
    Sadoghi, Mohammad
    Jacobsen, Hans-Arno
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 1162 - 1167
  • [8] ZDB-High performance key-value store
    Thanh Nguyen Trung
    Minh Nguyen Hieu
    [J]. 2013 THIRD WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2013, : 311 - 316
  • [9] AnnaBellaDB: Key-Value Store Made Cloud Native
    Szalay, Mark
    Matray, Peter
    Toka, Laszlo
    [J]. 2020 16TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2020,
  • [10] Concerto: A High Concurrency Key-Value Store with Integrity
    Arasu, Arvind
    Eguro, Ken
    Kaushik, Raghav
    Kossmann, Donald
    Meng, Pingfan
    Pandey, Vineet
    Ramamurthy, Ravi
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 251 - 266