Fast similarity join for multi-dimensional data

被引:13
|
作者
Kalashnikov, Dmitri V.
Prabhakar, Sunil
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
similarity join; grid-based joins;
D O I
10.1016/j.is.2005.07.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The efficient processing of multidimensional similarity joins is important for a large class of applications. The dimensionality of the data for these applications ranges from low to high. Most existing methods have focused on the execution of high-dimensional joins over large amounts of disk-based data. The increasing sizes of main memory available on current computers, and the need for efficient processing of spatial joins suggest that spatial joins for a large class of problems can be processed in main memory. In this paper, we develop two new in-memory spatial join algorithms, the Grid-join and EGO*-join, and study their performance. Through evaluation, we explore the domain of applicability of each approach and provide recommendations for the choice of a join algorithm depending upon the dimensionality of the data as well as the expected selectivity of the join. We show that the two new proposed join techniques substantially outperform the state-of-the-art join algorithm, the EGO-join. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:160 / 177
页数:18
相关论文
共 50 条
  • [31] A data forest: Multi-dimensional visualization
    Jamieson, Ronan
    Alexandrov, Vassil
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 293 - +
  • [32] Multi-dimensional aggregation for temporal data
    Bohen, Michael
    Gamper, Johann
    Jensen, Christian S.
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 257 - 275
  • [33] MULTI-DIMENSIONAL INVERSION OF SEISMIC DATA
    FOSTER, DJ
    MOSHER, CC
    INVERSE PROBLEMS, 1988, 4 (01) : 71 - 85
  • [34] Multi-Dimensional Dynamic Time Warping for Image Texture Similarity
    de Mello, Rodrigo Fernandes
    Gondra, Iker
    ADVANCES IN ARTIFICIAL INTELLIGENCE - SBIA 2008, PROCEEDINGS, 2008, 5249 : 23 - +
  • [35] Multi-Dimensional Scaling of Sparse Block Diagonal Similarity Matrix
    Imaizumi, Tadashi
    DATA SCIENCE: INNOVATIVE DEVELOPMENTS IN DATA ANALYSIS AND CLUSTERING, 2017, : 259 - 272
  • [36] Indexing expensive functions for efficient multi-dimensional similarity search
    Chen, Hanxiong
    Liu, Jianquan
    Furuse, Kazutaka
    Yu, Jeffrey Xu
    Ohbo, Nobuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (02) : 165 - 192
  • [37] Content Aware Music Analysis with Multi-Dimensional Similarity Measure
    Wohlfahrt-Laymann, Jan
    Heimburger, Anneli
    INFORMATION MODELLING AND KNOWLEDGE BASES XXVIII, 2017, 292 : 303 - 313
  • [38] A unified similarity coefficient for navigating through multi-dimensional information
    Tudhope, D
    Taylor, C
    ASIS '96 - PROCEEDINGS OF THE 59TH ASIS ANNUAL MEETING, VOL 33, 1996: GLOBAL COMPLEXITY: INFORMATION, CHAOS AND CONTROL, 1996, 33 : 67 - 70
  • [39] Indexing expensive functions for efficient multi-dimensional similarity search
    Hanxiong Chen
    Jianquan Liu
    Kazutaka Furuse
    Jeffrey Xu Yu
    Nobuo Ohbo
    Knowledge and Information Systems, 2011, 27 : 165 - 192
  • [40] A unified similarity coefficient for navigating through multi-dimensional information
    Tudhope, D
    Taylor, C
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1996, 33 : 67 - 70