Cardinality Estimation of Approximate Substring Queries using Deep Learning

被引:5
|
作者
Kwon, Suyong [1 ]
Jung, Woohwan [2 ]
Shim, Kyuseok [1 ]
机构
[1] Seoul Natl Univ, Elect & Comp Engn, Seoul, South Korea
[2] Hanyang Univ, Comp Sci & Engn, Seoul, South Korea
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2022年 / 15卷 / 11期
基金
新加坡国家研究基金会;
关键词
SELECTIVITY ESTIMATION;
D O I
10.14778/3551793.3551859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardinality estimation of an approximate substring query is an important problem in database systems. Traditional approaches build a summary from the text data and estimate the cardinality using the summary with some statistical assumptions. Since deep learning models can learn underlying complex data patterns effectively, they have been successfully applied and shown to outperform traditional methods for cardinality estimations of queries in database systems. However, since they are not yet applied to approximate substring queries, we investigate a deep learning approach for cardinality estimation of such queries. Although the accuracy of deep learning models tends to improve as the train data size increases, producing a large train data is computationally expensive for cardinality estimation of approximate substring queries. Thus, we develop efficient train data generation algorithms by avoiding unnecessary computations and sharing common computations. We also propose a deep learning model as well as a novel learning method to quickly obtain an accurate deep learning-based estimator. Extensive experiments confirm the superiority of our data generation algorithms and deep learning model with the novel learning method.
引用
下载
收藏
页码:3145 / 3157
页数:13
相关论文
共 50 条
  • [21] Cardinality Estimation in a Virtualized Network Device Using Online Machine Learning
    Cohen, Reuven
    Nezri, Yuval
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2019, 27 (05) : 2098 - 2110
  • [22] Approximate Rewriting of Queries Using Views
    Afrati, Foto
    Chandrachud, Manik
    Chirkova, Rada
    Mitra, Prasenjit
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, 5739 : 164 - +
  • [23] Learning exact enumeration and approximate estimation in deep neural network models
    Creatore, Celestino
    Sabathiel, Silvester
    Solstad, Trygve
    COGNITION, 2021, 215
  • [24] Age estimation using deep learning
    Zaghbani, Soumaya
    Boujneh, Noureddine
    Bouhlel, Med Salim
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 68 : 337 - 347
  • [25] Unsupervised Phase Retrieval Using Deep Approximate MMSE Estimation
    Chen, Mingqin
    Lin, Peikang
    Quan, Yuhui
    Pang, Tongyao
    Ji, Hui
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 2239 - 2252
  • [26] On Efficient Approximate Queries over Machine Learning Models
    Ding, Dujian
    Amer-Yahia, Sihem
    Lakshmanan, Laks
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 16 (04): : 918 - 931
  • [27] Cardinality estimation using normalizing flow
    Wang, Jiayi
    Chai, Chengliang
    Liu, Jiabin
    Li, Guoliang
    VLDB JOURNAL, 2024, 33 (02): : 323 - 348
  • [28] Cardinality estimation using normalizing flow
    Jiayi Wang
    Chengliang Chai
    Jiabin Liu
    Guoliang Li
    The VLDB Journal, 2024, 33 (2) : 323 - 348
  • [29] LAF: A Local Depth Autoregressive Framework for Cardinality Estimation of Multi-attribute Queries
    Cheng, Qianwen
    Li, Hao
    Wang, Dawei
    Zhang, Yue
    Peng, Zhaohui
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 296 - 311
  • [30] Approximate Cardinality Estimation (ACE) in large-scale Internet of Things deployments
    Cao, Qing
    Feng, Yunhe
    Lu, Zheng
    Qi, Hairong
    Tolbert, Leon M.
    Wan, Lipeng
    Wang, Zhibo
    Zhou, Wenjun
    AD HOC NETWORKS, 2017, 66 : 52 - 63