Workload-Aware Cache Management of Bitmap Indices

被引：0

作者：

Kaeppel, Julia ^{[1
]}

Sawin, Jason ^{[2
]}

Chiu, David ^{[1
]}

机构：

[1] Univ Puget Sound, Math & Comp Sci, Tacoma, WA 98416 USA

[2] Univ St Thomas, Comp & Informat Sci, St Paul, MN USA

来源：

PROCEEDINGS OF THE IEEE/ACM 10TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, BDCAT 2023 | 2023年

关键词：

D O I：

10.1145/3632366.3632386

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Big-data management systems must handle multiple concurrent queries over multi-dimensional data sets. To achieve high throughput, such systems could implement various techniques to avoid redundant computations and data fetches. One such approach is to cache a subset of the query results and reuse these results to (partially) fulfill future query requests. This approach can be quite effective for query-at-a-time processing. However, we suspect that even greater performance is being left on the table if queries are only optimized in isolation, and that higher throughput can be extracted through a systematic examination of the relationships between queries in a given workload. This paper describes a framework that captures inter-query relationships to reveal increased opportunities to exploit caching. We present a heuristic used for scheduling queries and a novel workload-informed cache replacement policy. When these methods are applied in combination, our system is able to extract impressive speedup of the total execution time of batches of queries, using only modest cache sizes. In this paper we show that the proposed replacement algorithm easily outstrips the performance of the classic algorithms FIFO and LRU. Under certain conditions, our system was able to achieve roughly 2 to 4 time speedup over these traditional replacement schemes.

引用

页数：10

共 50 条

[21] ChewAnalyzer: Workload-Aware Data Management Across Differentiated Storage Pools
Ge, Xiongzi
Xie, Xuchao
Du, David H. C.
Ganesan, Pradeep
Hahn, Dennis
2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 94 - 101
[22] A Throughput-Oriented NVMe Storage Virtualization With Workload-Aware Management
Yang, Ming
Peng, Bo
Yao, Jianguo
Guan, Haibing
IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (12) : 2112 - 2124
[23] Workload-aware Resource Management for Energy Efficient Heterogeneous Docker Containers
Kang, Dong-Ki
Choi, Gyu-Beom
Kim, Seong-Hwan
Hwang, Il-Sun
Youn, Chan-Hyun
PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2428 - 2431
[24] WARP: Workload-Aware Replication and Partitioning for RDF
Hose, Katja
Schenkel, Ralf
2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 1 - 6
[25] FORESEER: Workload-aware Data Storage for MapReduce
Zou, Jia
Shi, Juwei
Liu, Tongping
Cao, Zhao
Wang, Chen
2015 IEEE 35th International Conference on Distributed Computing Systems, 2015, : 746 - 747
[26] Workload-aware anomaly detection for Web applications
Wang, Tao
Wei, Jun
Zhang, Wenbo
Zhong, Hua
Huang, Tao
JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 89 : 19 - 32
[27] Workload-Aware Performance Tuning for Autonomous DBMSs
Yan, Zhengtong
Lu, Jiaheng
Chainani, Naresh
Lin, Chunbin
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2365 - 2368
[28] Workload-Aware Live Storage Migration for Clouds
Zheng, Jie
Ng, T. S. Eugene
Sripanidkulchai, Kunwadee
ACM SIGPLAN NOTICES, 2011, 46 (07) : 133 - 144
[29] Flexible workload-aware clustering of XML documents
Bordawekar, R
Shmueli, O
DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2004, 3186 : 204 - 218
[30] DROP: A Workload-Aware Optimizer for Dimensionality Reduction
Suri, Sahaana
Bailis, Peter
PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2019, 2019,

← 1 2 3 4 5 →