Multi-dimensional histograms with tight bounds for the error

被引:0
|
作者
Baltrunas, Linas
Mazeika, Arturas
Bohlen, Michael
机构
来源
10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS | 2006年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histograms are being used as non-parametric selectivity estimators for one-dimensional data. For highdimensional data it is common to either compute one-dimensional histograms for each attribute or to compute a multi-dimensional equi-width histogram for a set of attributes. This either yields small low-quality or large high-quality histograms. In this paper we introduce HIRED (HIgh-dimensional histograms with dimensionality REDuction): small high-quality histograms for multi-dimensional data. H I RED histograms are adaptive, and they are based on the shape error and directional splits. The shape error permits a precise control of the estimation error of the histogram and, together with directional splits, yields a memory complexity that does not depend on the number of uniform attributes in the dataset. We provide extensive experimental results with synthetic and real world datasets. The experiments confirm that our method is as precise as state-of-the-art techniques and uses orders of magnitude less memory.
引用
收藏
页码:105 / 112
页数:8
相关论文
共 50 条
  • [21] New bounds on the capacity of multi-dimensional RLL-constrained systems
    Schwartz, M
    Vardy, A
    APPLIED ALGEBRA, ALGEBRAIC ALGORITHMS AND ERROR-CORRECTING CODES, PROCEEDINGS, 2006, 3857 : 225 - 234
  • [22] BERRY-ESSEEN BOUNDS FOR MULTI-DIMENSIONAL CENTRAL LIMIT THEOREM
    BHATTACHARYA, RN
    BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1968, 74 (02) : 285 - +
  • [23] Multi-dimensional Rankings, Program Termination, and Complexity Bounds of Flowchart Programs
    Alias, Christophe
    Darte, Alain
    Feautrier, Paul
    Gonnord, Laure
    STATIC ANALYSIS, 2010, 6337 : 117 - +
  • [24] mEEC: A Novel Error Estimation Code with Multi-Dimensional Feature
    Zhang, Zhenghao
    Kumar, Piyush
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [25] ASYMPOTIC ERROR ESTIMATES FOR MULTI-DIMENSIONAL CUBATURES . PRELIMINARY REPORT
    WIXOM, JA
    NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1970, 17 (01): : 245 - &
  • [26] Multi-Dimensional Bounds on Time-of-Arrival Statistics in Random Scattering Channels
    Khoa N. Le
    Wireless Personal Communications, 2011, 57 : 195 - 205
  • [27] Multi-Dimensional Bounds on Time-of-Arrival Statistics in Random Scattering Channels
    Le, Khoa N.
    WIRELESS PERSONAL COMMUNICATIONS, 2011, 57 (02) : 195 - 205
  • [28] Reversible watermarking based on multi-dimensional prediction-error expansion
    Xiang Yu
    Xiang Wang
    Qingqi Pei
    Multimedia Tools and Applications, 2018, 77 : 18085 - 18104
  • [29] Reversible watermarking based on multi-dimensional prediction-error expansion
    Yu, Xiang
    Wang, Xiang
    Pei, Qingqi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18085 - 18104
  • [30] Interpolation between multi-dimensional histograms using a new non-linear moment morphing method
    Baak, M.
    Gadatsch, S.
    Harrington, R.
    Verkerke, W.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2015, 771 : 39 - 48