Cardinality Estimation of Approximate Substring Queries using Deep Learning

被引:5
|
作者
Kwon, Suyong [1 ]
Jung, Woohwan [2 ]
Shim, Kyuseok [1 ]
机构
[1] Seoul Natl Univ, Elect & Comp Engn, Seoul, South Korea
[2] Hanyang Univ, Comp Sci & Engn, Seoul, South Korea
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2022年 / 15卷 / 11期
基金
新加坡国家研究基金会;
关键词
SELECTIVITY ESTIMATION;
D O I
10.14778/3551793.3551859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardinality estimation of an approximate substring query is an important problem in database systems. Traditional approaches build a summary from the text data and estimate the cardinality using the summary with some statistical assumptions. Since deep learning models can learn underlying complex data patterns effectively, they have been successfully applied and shown to outperform traditional methods for cardinality estimations of queries in database systems. However, since they are not yet applied to approximate substring queries, we investigate a deep learning approach for cardinality estimation of such queries. Although the accuracy of deep learning models tends to improve as the train data size increases, producing a large train data is computationally expensive for cardinality estimation of approximate substring queries. Thus, we develop efficient train data generation algorithms by avoiding unnecessary computations and sharing common computations. We also propose a deep learning model as well as a novel learning method to quickly obtain an accurate deep learning-based estimator. Extensive experiments confirm the superiority of our data generation algorithms and deep learning model with the novel learning method.
引用
下载
收藏
页码:3145 / 3157
页数:13
相关论文
共 50 条
  • [41] A Tool for Internet-Scale Cardinality Estimation of XPath Queries over Distributed Semistructured Data
    Slavov, Vasil
    Katib, Anas
    Rao, Praveen
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 1270 - 1273
  • [42] Understanding Cardinality Estimation using Entropy Maximization
    Re, Christopher
    Suciu, Dan
    PODS 2010: PROCEEDINGS OF THE TWENTY-NINTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2010, : 53 - 64
  • [43] Estimating the Cardinality of Conjunctive Queries over RDF Data Using Graph Summarisation
    Stefanoni, Giorgio
    Motik, Boris
    Kostylev, Egor V.
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1043 - 1052
  • [44] Approximate Shortest Path Queries Using Voronoi Duals
    Honiden, Shinichi
    Houle, Michael E.
    Sommer, Christian
    Wolff, Martin
    TRANSACTIONS ON COMPUTATIONAL SCIENCE IX, 2010, 6290 : 28 - 53
  • [45] Approximate and Situated Causality in Deep Learning
    Vallverdu, Jordi
    PHILOSOPHIES, 2020, 5 (01)
  • [46] Monocular Depth Estimation Using Deep Learning: A Review
    Masoumian, Armin
    Rashwan, Hatem A.
    Cristiano, Julian
    Asif, M. Salman
    Puig, Domenec
    SENSORS, 2022, 22 (14)
  • [47] Customer Gaze Estimation in Retail Using Deep Learning
    Senarath, Shashimal
    Pathirana, Primesh
    Meedeniya, Dulani
    Jayarathna, Sampath
    IEEE Access, 2022, 10 : 64904 - 64919
  • [48] Speed Estimation Using Deep Learning with Optical Flow
    Mukai, Nobuhiko
    Nishimura, Naoki
    Chang, Youngha
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2024, 2024, 13164
  • [49] Emotion Estimation Using EEG with Deep learning Networks
    Vynatheya, Marrapu
    Subha, D. P.
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [50] Crop production estimation using deep learning technique
    Marndi, Ashapurna
    Ramesh, K., V
    Patra, G. K.
    CURRENT SCIENCE, 2021, 121 (08): : 1073 - 1079