Workload-Aware Cache Management of Bitmap Indices

被引:0
|
作者
Kaeppel, Julia [1 ]
Sawin, Jason [2 ]
Chiu, David [1 ]
机构
[1] Univ Puget Sound, Math & Comp Sci, Tacoma, WA 98416 USA
[2] Univ St Thomas, Comp & Informat Sci, St Paul, MN USA
关键词
D O I
10.1145/3632366.3632386
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big-data management systems must handle multiple concurrent queries over multi-dimensional data sets. To achieve high throughput, such systems could implement various techniques to avoid redundant computations and data fetches. One such approach is to cache a subset of the query results and reuse these results to (partially) fulfill future query requests. This approach can be quite effective for query-at-a-time processing. However, we suspect that even greater performance is being left on the table if queries are only optimized in isolation, and that higher throughput can be extracted through a systematic examination of the relationships between queries in a given workload. This paper describes a framework that captures inter-query relationships to reveal increased opportunities to exploit caching. We present a heuristic used for scheduling queries and a novel workload-informed cache replacement policy. When these methods are applied in combination, our system is able to extract impressive speedup of the total execution time of batches of queries, using only modest cache sizes. In this paper we show that the proposed replacement algorithm easily outstrips the performance of the classic algorithms FIFO and LRU. Under certain conditions, our system was able to achieve roughly 2 to 4 time speedup over these traditional replacement schemes.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] ChewAnalyzer: Workload-Aware Data Management Across Differentiated Storage Pools
    Ge, Xiongzi
    Xie, Xuchao
    Du, David H. C.
    Ganesan, Pradeep
    Hahn, Dennis
    2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 94 - 101
  • [22] A Throughput-Oriented NVMe Storage Virtualization With Workload-Aware Management
    Yang, Ming
    Peng, Bo
    Yao, Jianguo
    Guan, Haibing
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (12) : 2112 - 2124
  • [23] Workload-aware Resource Management for Energy Efficient Heterogeneous Docker Containers
    Kang, Dong-Ki
    Choi, Gyu-Beom
    Kim, Seong-Hwan
    Hwang, Il-Sun
    Youn, Chan-Hyun
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2428 - 2431
  • [24] WARP: Workload-Aware Replication and Partitioning for RDF
    Hose, Katja
    Schenkel, Ralf
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 1 - 6
  • [25] FORESEER: Workload-aware Data Storage for MapReduce
    Zou, Jia
    Shi, Juwei
    Liu, Tongping
    Cao, Zhao
    Wang, Chen
    2015 IEEE 35th International Conference on Distributed Computing Systems, 2015, : 746 - 747
  • [26] Workload-aware anomaly detection for Web applications
    Wang, Tao
    Wei, Jun
    Zhang, Wenbo
    Zhong, Hua
    Huang, Tao
    JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 89 : 19 - 32
  • [27] Workload-Aware Performance Tuning for Autonomous DBMSs
    Yan, Zhengtong
    Lu, Jiaheng
    Chainani, Naresh
    Lin, Chunbin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2365 - 2368
  • [28] Workload-Aware Live Storage Migration for Clouds
    Zheng, Jie
    Ng, T. S. Eugene
    Sripanidkulchai, Kunwadee
    ACM SIGPLAN NOTICES, 2011, 46 (07) : 133 - 144
  • [29] Flexible workload-aware clustering of XML documents
    Bordawekar, R
    Shmueli, O
    DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2004, 3186 : 204 - 218
  • [30] DROP: A Workload-Aware Optimizer for Dimensionality Reduction
    Suri, Sahaana
    Bailis, Peter
    PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2019, 2019,