LTG-LSM: The Optimal Structure in LSM-tree Combined with Reading Hotness

被引:2
|
作者
Yu, JiaPing [1 ]
Chen, HuaHui [1 ]
Qian, JiangBo [1 ]
Dong, YiHong [1 ]
机构
[1] Ningbo Univ, Dept Informat Sci & Technol, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
LSM-tree; KV Store; Storage Management; Hotness Prediction; Big Data;
D O I
10.1109/ICPADS51040.2020.00011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A growing number of KV storage systems have adopted the Log-Structured-Merge-tree (LSM-tree) due to its excellent write performance. However, the high write amplification in the LSM-tree has always been a difficult problem to solve. The reason is that the design of traditional LSM-tree under-utilizes the data distribution of query, and the design space does not take into account the read and write performance concurrently. As a result, we may sacrifice one to improve another performance. When advancing the writing performance of the LSM-tree, we can only conservatively select the design pattern in the design space to reduce the impact on reading throughputs, resulting in limited improvement. Aiming at the shortcomings of existing methods, a new LSM-tree structure (Leveling-Tiering-Grouped-LSM-tree, LTG-LSM) is proposed by us that combined with reading hotness. The LTG-LSM structure maintains hotness prediction models at each level of the LSM-tree. The structure of the newly generated disk components is determined by the predicted hotness. Finally, a specific compaction algorithm is carried out to handle the compaction between the different structural components and processing workflow hotness changes. Experiments show that the scheme proposed by this paper significantly reduces the write amplification (up to about 71%) of the original LSM-tree with almost no sacrificing reading performance and improves the write throughputs (up to about 24%) in workflows with different configurations.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [1] HotKey-LSM: A Hotness-Aware LSM-Tree for Big Data Storage
    Wang, Yi
    Jin, Peiquan
    Wan, Shouhong
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5849 - 5851
  • [2] Pipelined Compaction for the LSM-tree
    Zhang, Zigang
    Yue, Yinliang
    He, Bingsheng
    Xiong, Jin
    Chen, Mingyu
    Zhang, Lixin
    Sun, Ninghui
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [3] dCompaction: Delayed Compaction for the LSM-Tree
    Pan, Fengfeng
    Yue, Yinliang
    Xiong, Jin
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (06) : 1310 - 1325
  • [4] An LSM-Tree Index for Spatial Data
    He, Junjun
    Chen, Huahui
    ALGORITHMS, 2022, 15 (04)
  • [5] dCompaction: Delayed Compaction for the LSM-Tree
    Fengfeng Pan
    Yinliang Yue
    Jin Xiong
    International Journal of Parallel Programming, 2017, 45 : 1310 - 1325
  • [6] A LLC-Friendly LSM-tree
    Cai, Tao
    Dai, Jianfei
    Niu, Dejiao
    Li, Lei
    Huang, Zeyu
    Ni, Qiangqiang
    2024 IEEE 35TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, ASAP 2024, 2024, : 243 - 244
  • [7] Spooky: Granulating LSM-Tree Compactions Correctly
    Dayan, Niv
    Weiss, Tamar
    Dashevsky, Shmuel
    Pan, Michael
    Bortnikov, Edward
    Twitto, Moshe
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (11): : 3071 - 3084
  • [8] Chucky: A Succinct Cuckoo Filter for LSM-Tree
    Dayan, Niv
    Twitto, Moshe
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 365 - 378
  • [9] BoLT: Barrier-optimized LSM-Tree
    Kim, Dongui
    Park, Chanyeol
    Lee, Sang-Won
    Nam, Beomseok
    PROCEEDINGS OF THE 2020 21ST INTERNATIONAL MIDDLEWARE CONFERENCE (MIDDLEWARE '20), 2020, : 119 - 133
  • [10] An Improved LSM-Tree Index for NoSQL Data
    Dai, Lin
    Fu, Jia
    Feng, Chong
    COMPUTER SCIENCE AND TECHNOLOGY (CST2016), 2017, : 525 - 532