Optimal Data-Space Partitioning of Spatial Data for Parallel I/O

被引:0
|
作者
Hakan Ferhatosmanoğlu
Divyakant Agrawal
Ömer Eğecioğlu
Amr El Abbadi
机构
[1] Ohio State University,Computer Science and Engineering
[2] University of California Santa Barbara,Computer Science
关键词
data-space partitioning; two-dimensional data; parallel I/O; disk and page allocation; range query;
D O I
10.1023/B:JMSE.0000043449.85576.da
中图分类号
学科分类号
摘要
It is desirable to design partitioning methods that minimize the I/O time incurred during query execution in spatial databases. This paper explores optimal partitioning for two-dimensional data for a class of queries and develops multi-disk allocation techniques that maximize the degree of I/O parallelism obtained in each case. We show that hexagonal partitioning has optimal I/O performance for circular queries among all partitioning methods that use convex non-overlapping regions. An analysis and extension of this result to all possible partitioning techniques is also given. For rectangular queries, we show that hexagonal partitioning has overall better I/O performance for a general class of range queries, except for rectilinear queries, in which case rectangular grid partitioning is superior. By using current algorithms for rectangular grid partitioning, parallel storage and retrieval algorithms for hexagonal partitioning can be constructed. Some of these results carry over to circular partitioning of the data—which is an example of a non-convex region.
引用
收藏
页码:75 / 101
页数:26
相关论文
共 50 条
  • [31] Plug&Produce Integration of Components into OPC UA based data-space
    Panda, Santosh Kumar
    Schroeder, Tizian
    Wisniewski, Lukasz
    Diedrich, Christian
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 1095 - 1100
  • [32] Expert systems, security and quality assurance: Implications of patient records as data-space
    Kluge, EHW
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1262 - 1265
  • [33] Data-space Validation of High-dimensional Models by Comparing Sample Quantiles
    Thorp, Stephen
    Peiris, Hiranya V.
    Mortlock, Daniel J.
    Alsing, Justin
    Leistedt, Boris
    Deger, Sinan
    ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2025, 276 (01):
  • [34] Optimal partitioning of nodes to space-sharing parallel tasks
    Figueira, Silvia A.
    PARALLEL COMPUTING, 2006, 32 (04) : 313 - 324
  • [35] THE PARTITIONING PROBLEM FOR A CLASS OF DATA PARALLEL ALGORITHMS
    THUNE, M
    PARALLEL COMPUTING, 1992, 18 (08) : 867 - 878
  • [36] USING DATA PARTITIONING TO IMPLEMENT A PARALLEL ASSEMBLER
    KATSEFF, HP
    SIGPLAN NOTICES, 1988, 23 (09): : 66 - 76
  • [37] Memory Partitioning for Parallel Multipattern Data Access in Multiple Data Arrays
    Yin, Shouyi
    Xie, Zhicong
    Meng, Chenyue
    Ouyang, Peng
    Liu, Leibo
    Wei, Shaojun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (02) : 431 - 444
  • [38] Efficient Memory Partitioning for Parallel Data Access via Data Reuse
    Su, Jincheng
    Yang, Fan
    Zeng, Xuan
    Zhou, Dian
    PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 138 - 147
  • [39] Spatially-aware Parallel I/O for Particle Data
    Kumar, Sidharth
    Petruzza, Steve
    Usher, Will
    Pascucci, Valerio
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [40] Efficient parallel I/O scheduling in the presence of data duplication
    Liu, PF
    Wang, DW
    Wu, JJ
    2003 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2003, : 231 - 238