Multi-dimensional histograms with tight bounds for the error

被引:0
|
作者
Baltrunas, Linas
Mazeika, Arturas
Bohlen, Michael
机构
来源
10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS | 2006年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histograms are being used as non-parametric selectivity estimators for one-dimensional data. For highdimensional data it is common to either compute one-dimensional histograms for each attribute or to compute a multi-dimensional equi-width histogram for a set of attributes. This either yields small low-quality or large high-quality histograms. In this paper we introduce HIRED (HIgh-dimensional histograms with dimensionality REDuction): small high-quality histograms for multi-dimensional data. H I RED histograms are adaptive, and they are based on the shape error and directional splits. The shape error permits a precise control of the estimation error of the histogram and, together with directional splits, yields a memory complexity that does not depend on the number of uniform attributes in the dataset. We provide extensive experimental results with synthetic and real world datasets. The experiments confirm that our method is as precise as state-of-the-art techniques and uses orders of magnitude less memory.
引用
收藏
页码:105 / 112
页数:8
相关论文
共 50 条
  • [41] Multi-dimensional rules
    Courtin, Sebastien
    Laruelle, Annick
    MATHEMATICAL SOCIAL SCIENCES, 2020, 103 : 1 - 7
  • [42] Multi-dimensional lives
    Mark Ronan
    Nature, 2008, 451 (7179) : 629 - 629
  • [43] A MULTI-DIMENSIONAL BOOK
    NEWSON, L
    NEW SCIENTIST, 1988, 119 (1629) : 82 - 82
  • [44] MULTI-DIMENSIONAL TV
    Gomes, Lee
    FORBES, 2010, 185 (02): : 36 - 36
  • [45] ON MULTI-DIMENSIONAL TIME
    BUNGE, M
    BRITISH JOURNAL FOR THE PHILOSOPHY OF SCIENCE, 1958, 9 (33): : 39 - 39
  • [46] The modelling error in multi-dimensional time-dependent solute transport models
    Masri, Rami
    Zeinhofer, Marius
    Kuchta, Miroslav
    Rognes, Marie E.
    ESAIM-MATHEMATICAL MODELLING AND NUMERICAL ANALYSIS, 2024, 58 (05) : 1681 - 1724
  • [47] Variational multiscale a-posteriori error estimation for multi-dimensional transport problems
    Hauke, Guillermo
    Fuster, Daniel
    Doweidar, Mohamed H.
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2008, 197 (33-40) : 2701 - 2718
  • [48] Error-space representations for multi-dimensional data streams with temporal dependence
    Read, Jesse
    Tziortziotis, Nikolaos
    Vazirgiannis, Michalis
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 1211 - 1220
  • [49] Error-space representations for multi-dimensional data streams with temporal dependence
    Jesse Read
    Nikolaos Tziortziotis
    Michalis Vazirgiannis
    Pattern Analysis and Applications, 2019, 22 : 1211 - 1220
  • [50] Improved technique for quick error rate estimation of multi-dimensional communication schemes
    Porath, JE
    Aulin, T
    IEE PROCEEDINGS-COMMUNICATIONS, 1999, 146 (06): : 343 - 346