Cardinality Estimation of Approximate Substring Queries using Deep Learning

被引:5
|
作者
Kwon, Suyong [1 ]
Jung, Woohwan [2 ]
Shim, Kyuseok [1 ]
机构
[1] Seoul Natl Univ, Elect & Comp Engn, Seoul, South Korea
[2] Hanyang Univ, Comp Sci & Engn, Seoul, South Korea
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2022年 / 15卷 / 11期
基金
新加坡国家研究基金会;
关键词
SELECTIVITY ESTIMATION;
D O I
10.14778/3551793.3551859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardinality estimation of an approximate substring query is an important problem in database systems. Traditional approaches build a summary from the text data and estimate the cardinality using the summary with some statistical assumptions. Since deep learning models can learn underlying complex data patterns effectively, they have been successfully applied and shown to outperform traditional methods for cardinality estimations of queries in database systems. However, since they are not yet applied to approximate substring queries, we investigate a deep learning approach for cardinality estimation of such queries. Although the accuracy of deep learning models tends to improve as the train data size increases, producing a large train data is computationally expensive for cardinality estimation of approximate substring queries. Thus, we develop efficient train data generation algorithms by avoiding unnecessary computations and sharing common computations. We also propose a deep learning model as well as a novel learning method to quickly obtain an accurate deep learning-based estimator. Extensive experiments confirm the superiority of our data generation algorithms and deep learning model with the novel learning method.
引用
下载
收藏
页码:3145 / 3157
页数:13
相关论文
共 50 条
  • [31] PostCENN: PostgreSQL with Machine Learning Models for Cardinality Estimation
    Woltmann, Lucas
    Olwig, Dominik
    Hartmann, Claudio
    Habich, Dirk
    Lehner, Wolfgang
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (12): : 2715 - 2718
  • [32] Survey of cardinality estimation techniques based on machine learning
    Yue W.
    Qu W.
    Lin K.
    Wang X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (02): : 413 - 427
  • [33] Automating localized learning for cardinality estimation based on XGBoost
    Feng, Jieming
    Li, Zhanhuai
    Chen, Qun
    Liu, Hailong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (07) : 3825 - 3854
  • [34] Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation
    Liu, Jie
    Dong, Wenqian
    Zhou, Qingqing
    Li, Dong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 1950 - 1963
  • [35] Multimodal Vigilance Estimation Using Deep Learning
    Wu, Wei
    Sun, Wei
    Wu, Q. M. Jonathan
    Yang, Yimin
    Zhang, Hui
    Zheng, Wei-Long
    Lu, Bao-Liang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3097 - 3110
  • [36] Deep Learning for Age Estimation Using EfficientNet
    Aruleba, Idowu
    Viriri, Serestina
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 407 - 419
  • [37] Formant Estimation and Tracking using Deep Learning
    Dissen, Yehoshua
    Keshet, Joseph
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 958 - 962
  • [38] Chicken Weight Estimation Using Deep Learning
    Sutapun, Boonsong
    Sampanporn, Lawan
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
  • [39] Networks cardinality estimation using order statistics
    Lucchese, Riccardo
    Varagnolo, Damiano
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 3810 - 3817
  • [40] Understanding Cardinality Estimation Using Entropy Maximization
    Re, Christopher
    Suciu, Dan
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2012, 37 (01):