Approximate Query Processing Based on Approximate Materialized View

被引:0
|
作者
Wu, Yuhan [1 ]
Guo, Haifeng [1 ]
Yang, Donghua [1 ]
Li, Mengmeng [1 ]
Zheng, Bo [2 ]
Wang, Hongzhi [1 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] ConDB, Beijing, Peoples R China
关键词
Approximate materialized view; Materialized views reuse; AQP plus plus optimization; Approximate query processing; GROUP-BY;
D O I
10.1007/978-981-97-0801-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of big data, the interactive analysis database system needs to answer aggregate queries within a reasonable response time. The proposed AQP++ framework can integrate data preprocessing and AQP. It connects existing AQP engine with data preprocessing method to complete the connection between them in the process of interaction analysis. After the research on the application of materialized views in AQP++ framework, it is found that the materialized views used in the two parts of the framework both come from the accurate results of precomputation, so there's still a time bottleneck under large scale data. Based on such limitations, we proposed to use approximate materialized views for subsequent results reuse. We take the method of identifying approximate interval as an example, compared the improvement of AQP++ by using approximate materialized view, and trying different sampling methods to find better time and accurate performance results. By constructed larger samples, we compared the differences of time, space and accuracy between approximate and general materialized views in AQP++, and analyzed the reasons for the poor performance in some cases of our methods. Based on the experimental results, it proved that the use of approximate materialized view can improve the AQP++ framework, it effectively save time and storage space in the preprocessing stage, and obtain the accuracy similar to or better than the general AQP results as well.
引用
收藏
页码:168 / 185
页数:18
相关论文
共 50 条
  • [31] Database Native Approximate Query Processing Based on Machine-Learning
    Duan, Yang
    Zhang, Yong
    Wu, Jiacheng
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 74 - 86
  • [32] Distributed and Parallel Processing of Location based spatial query with Approximate Transformation
    Priya, M.
    Kalpana, R.
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 334 - 338
  • [33] Approximate query processing for a content-based image retrieval method
    Kwan, PWH
    Toraichi, K
    Kitagawa, H
    Kameyama, K
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, 2736 : 517 - 526
  • [34] An Efficient Materialized View Selection Approach for Query Processing in Database Management
    Kardel, P. P.
    Thakare, V. M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (09): : 26 - 33
  • [35] DeepSPACE: Approximate Geospatial Query Processing with Deep Learning
    Vorona, Dimitri
    Kipf, Andreas
    Neumann, Thomas
    Kemper, Alfons
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 500 - 503
  • [36] Exploiting Embedded Synopsis for Exact and Approximate Query Processing
    Yuasa, Hiroki
    Goda, Kazuo
    Kitsuregawa, Masaru
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 235 - 240
  • [37] SUM-optimal histograms for approximate query processing
    Meifan Zhang
    Hongzhi Wang
    Jianzhong Li
    Hong Gao
    Knowledge and Information Systems, 2020, 62 : 3155 - 3180
  • [38] Approximate range query processing in spatial network databases
    Haidar AL-Khalidi
    Zainab Abbas
    Maytham Safar
    Multimedia Systems, 2013, 19 : 151 - 161
  • [39] Compressed data cube for approximate OLAP query processing
    Yu Feng
    Shan Wang
    Journal of Computer Science and Technology, 2002, 17 : 625 - 635
  • [40] SUM-optimal histograms for approximate query processing
    Zhang, Meifan
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (08) : 3155 - 3180