Cardinality estimation with smoothing autoregressive models

被引:0
|
作者
Lin, Yuming [1 ]
Xu, Zejun [1 ]
Zhang, Yinghao [1 ]
Li, You [1 ]
Zhang, Jingwei [1 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Key Lab Trusted Software, Jinji Rd, Guilin 541004, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
AI4DB; Cardinality estimation; Selectivity estimation; Autoregressive; Random smoothing; SELECTIVITY ESTIMATION; QUERY;
D O I
10.1007/s11280-023-01195-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardinality estimation, which aims at accurately estimating the result size of queries, is a fundamental task in database query processing and optimization. One of the most recent and effective solutions to this problem is the use of deep autoregressive models to obtain joint probability distributions through unsupervised learning. However, due to the data sparsity, it is difficult for the estimator to accurately capture the actual distribution, which affects the accuracy of the cardinality estimation. In addition, autoregressive estimators' progressive sampling characteristics are prone to error propagation, which is more evident in high-dimensional data. To reduce the autoregressive cardinality estimation error and to obtain a better trade-off between estimate accuracy and latency, we propose a random smoothing autoregressive cardinality estimation model (SAM-CE), which uses a random smoothing technique combined with a deep autoregressive model to simplify the learning of joint probability distributions. A smooth progressive sampling method that is suitable for range queries is designed to improve the estimator accuracy by improving the sample quality. We conduct extensive experiments to demonstrate the effectiveness and performance of the proposed SAM-CE. The results show that SAM-CE achieves the state of the art effectiveness of cardinality estimation.
引用
收藏
页码:3441 / 3461
页数:21
相关论文
共 50 条