Self-spatial join selectivity estimation using fractal concepts

被引:22
|
作者
Belussi, A
Faloutsos, C
机构
[1] Politecn Milan, Dipartimento Elettr & Informaz, I-20133 Milan, Italy
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[3] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
关键词
algorithms; theory; fractal dimension; range query; selectivity estimation; spatial join;
D O I
10.1145/279339.279342
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of selectivity estimation for queries of nontraditional databases is still an open issue. In this article, we examine the problem of selectivity estimation for some types of spatial queries in databases containing real data. We have shown earlier [Faloutsos and Kamel 1994] that real point sets typically have a nonuniform distribution, violating consistently the uniformity and independence assumptions. Moreover, we demonstrated that the theory of fractals can help to describe real point sets. In this article we show how the concept of fractal dimension, i.e., (noninteger) dimension, can lead to the solution for the selectivity estimation problem in spatial databases. Among the infinite family of fractal dimensions, we consider here the Hausdorff fractal dimension D(0) and the "Correlation" fractal dimension D(2). Specifically, we show that (a) the average number of neighbors for a given point set follows a power law, with Da as exponent, and (b) the average number of nonempty range queries follows a power law with E - D(0) as exponent (E is the dimension of the embedding space). We present the formulas to estimate the selectivity for "biased" range queries, for self-spatial joins, and for the average number of nonempty range queries. The result of some experiments on real and synthetic point sets are shown. Our formulas achieve very low relative errors, typically about 10%, versus 40%-100% of the formulas that are based on the uniformity and independence assumptions.
引用
收藏
页码:161 / 201
页数:41
相关论文
共 50 条
  • [41] Similarity Join Size Estimation using Locality Sensitive Hashing
    Lee, Hongrae
    Ng, Raymond T.
    Shim, Kyuseok
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (06): : 338 - 349
  • [42] Describing some properties of adenylat kinase using fractal concepts
    Isvoran, A
    CHAOS SOLITONS & FRACTALS, 2004, 19 (01) : 141 - 145
  • [43] APPLICATION OF FRACTAL DIMENSION THEORY FOR THE ESTIMATION OF SPATIAL VARIABILITY OF SOIL PARAMETERS
    Misciorak, Michal
    Janik, Grzegorz
    JOURNAL OF MINING INSTITUTE, 2007, 170 (02): : 107 - 109
  • [44] Spatial Join Optimization among WFSs Based on Recursive Partitioning and Filtering Rate Estimation
    Lan, Guiwen
    Wu, Congcong
    Shi, Guangyi
    Chen, Qi
    Yang, Zhao
    INTERNATIONAL CONFERENCE ON INTELLIGENT EARTH OBSERVING AND APPLICATIONS 2015, 2015, 9808
  • [45] Selectivity estimation for joins using systematic sampling
    Harangsri, B
    Shepherd, J
    Ngu, A
    EIGHTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1997, : 384 - 389
  • [46] Euler plus plus : Improved Selectivity Estimation for Rectangular Spatial Records
    Siddique, A. B.
    Eldawy, Ahmed
    Hristidis, Vagelis
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4129 - 4133
  • [47] An efficient selectivity estimation method for spatial query optimization with topological relationships
    CHUNG Warn ill
    CHOI Jun ho
    BAE Hae young
    重庆邮电学院学报(自然科学版), 2004, (05) : 113 - 120
  • [48] Generic cumulative annular bucket histogram for spatial selectivity estimation of spatial database management system
    Cheng, Changxiu
    Song, Xiaomei
    Zhou, Chenghu
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2013, 27 (02) : 339 - 362
  • [49] USING A SPATIAL SYSTEM FOR TEACHING OPERANT CONCEPTS
    KIEWRA, KA
    DUBOIS, NF
    TEACHING OF PSYCHOLOGY, 1992, 19 (01) : 43 - 44
  • [50] Spatial Relations Using High Level Concepts
    Corcoran, Padraig
    Mooney, Peter
    Bertolotto, Michela
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2012, 1 (03): : 333 - 350