Efficient parallel processing of range queries through replicated declustering

被引:17
|
作者
Ferhatosmanoglu, Hakan
Tosun, Ali Saman
Canahuate, Guadalupe [1 ]
Ramachandran, Aravind
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Univ Texas, Dept Comp Sci, San Antonio, TX 78249 USA
[3] Microsoft Corp, Redmond, WA 98052 USA
基金
美国国家科学基金会;
关键词
declustering; replication; parallel access; range queries; periodic allocation; optimal parallel processing; replicated declustering;
D O I
10.1007/s10619-006-9362-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A common technique used to minimize I/O in data intensive applications is data declustering over parallel servers. This technique involves distributing data among several disks so as to parallelize query retrieval and thus, improve performance. We focus on optimizing access to large spatial data, and the most common type of queries on such data, i.e., range queries. An optimal declustering scheme is one in which the processing for all range queries is balanced uniformly among the available disks. It has been shown that single copy based declustering schemes are non-optimal for range queries. In this paper, we integrate replication in conjunction with parallel disk declustering for efficient processing of range queries. We note that replication is largely used in database applications for several purposes like load balancing, fault tolerance and availability of data. We propose theoretical foundations for replicated declustering and propose a class of replicated declustering schemes, periodic allocations, which are shown to be strictly optimal for a number of disks. We propose a framework for replicated declustering, using a limited amount of replication and provide extensions to apply it on real data, which include arbitrary grids and a large number of disks. Our framework also provides an effective indexing scheme that enables fast identification of data of interest in parallel servers. In addition to optimal processing of single queries, we show that this framework is effective for parallel processing of multiple queries. We present experimental results comparing the proposed replication scheme to other techniques for both single queries and multiple queries, on synthetic and real data sets.
引用
收藏
页码:117 / 147
页数:31
相关论文
共 50 条
  • [21] Efficient parallel processing of high-dimensional spatial kNN queries
    Tao Jiang
    Bin Zhang
    Dan Lin
    Yunjun Gao
    Qing Li
    [J]. Soft Computing, 2022, 26 : 12291 - 12316
  • [22] Efficient parallel processing of high-dimensional spatial kNN queries
    Jiang, Tao
    Zhang, Bin
    Lin, Dan
    Gao, Yunjun
    Li, Qing
    [J]. SOFT COMPUTING, 2022, 26 (22) : 12291 - 12316
  • [23] cGridex: Efficient processing of continuous range queries over moving objects
    Wang, XY
    Zhang, Q
    Sun, WW
    Wang, W
    Shi, BL
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 345 - 356
  • [24] An Efficient Processing of Range Spatial Keyword Queries over Moving Objects
    Oh, Sujin
    Jung, HaRim
    Kim, Ung-Mo
    [J]. 2018 32ND INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2018, : 525 - 530
  • [25] Efficient processing of narrow range queries in multi-dimensional data structures
    Kratky, Michal
    Snasel, Vaclav
    Pokorny, Jaroslav
    Zezula, Pavel
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 69 - 79
  • [26] An efficient processing of range-MIN/MAX queries over data cube
    Kim, DW
    Lee, EJ
    Kim, MH
    Lee, YJ
    [J]. INFORMATION SCIENCES, 1998, 112 (1-4) : 223 - 237
  • [27] Efficient processing of continual range queries for location-aware mobile services
    Wu, KL
    Chen, SK
    Yu, PS
    [J]. INFORMATION SYSTEMS FRONTIERS, 2005, 7 (4-5) : 435 - 448
  • [28] Efficient Processing of Continual Range Queries for Location-Aware Mobile Services
    Kun-Lung Wu
    Shyh-Kwei Chen
    Philip S. Yu
    [J]. Information Systems Frontiers, 2005, 7 : 435 - 448
  • [29] Optimal parallel block access for range queries
    Chedid, FB
    [J]. TENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2004, : 119 - 126
  • [30] Efficient Bundled Spatial Range Queries
    Zacharatou, Eleni Tzirita
    Sidlauskas, Darius
    Tauheed, Farhan
    Heinis, Thomas
    Ailamaki, Anastasia
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 139 - 148