Spatial Join Query Processing in Cloud: Analyzing Design Choices and Performance Comparisons

被引:14
|
作者
You, Simin [1 ]
Zhang, Jianting [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
[2] CUNY City Coll, Dept Comp Sci, New York, NY 10031 USA
[3] Univ Oklahoma, Dept Comp Sci, Norman, OK 73019 USA
关键词
Spatial Join; Query Processing; Cloud Computing; Design; Performance;
D O I
10.1109/ICPPW.2015.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data volumes of GPS recorded locations and many other types of geospatial data are fast increasing. Processing large-scale spatial joins in Cloud for performance and scalability is becoming increasingly popular. In this study, we compare three leading Cloud-based spatial data management systems, namely HadoopGIS, SpatialHadoop and SpatialSpark, both conceptually through analysis of design choices and empirically through experiments using real world datasets. Using both a workstation serving as a single-node cluster and up to 10 nodes Amazon EC2 clusters, the results show that the combined factors, including Cloud platforms, data access models and the underlying geometry libraries, have significant impacts in their realized performance. While SpatialHadoop generally wins on robustness, SpatialSpark is the clear winner of efficiency due to in-memory processing.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 39 条
  • [1] Large-Scale Spatial Join Query Processing in Cloud
    You, Simin
    Zhang, Jianting
    Gruenwald, Le
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 34 - 41
  • [2] Parallel spatial join query processing
    Liu, Yu
    Sun, Li
    Tian, Yong-Qing
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2002, 36 (04): : 512 - 515
  • [3] Surrounding Join Query Processing in Spatial Databases
    Li, Lingxiao
    Taniar, David
    Indrawan-Santiago, Maria
    Shao, Zhou
    DATABASES THEORY AND APPLICATIONS, ADC 2017, 2017, 10538 : 17 - 28
  • [4] An efficient progressive spatial Join query processing algorithm
    Tang, Gui-Fen
    Yang, Wei-Feng
    Huang, Shuang-Lin
    Li, Wei
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (02): : 318 - 324
  • [5] A Boundary Filtering Based Spatial Join Query Processing Optimization Algorithm
    Qiao, Baiyou
    Zhu, Junhai
    Shen, Muchuan
    Chen, Yang
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1764 - 1769
  • [6] Dynamic spatial index for efficient query processing on the cloud
    Ibrahim Kamel
    Ayesha M. Talha
    Zaher Al Aghbari
    Journal of Cloud Computing, 6
  • [7] Facilitating Secure and Efficient Spatial Query Processing on the Cloud
    Talha, Ayesha
    Kamel, Ibrahim
    Al Aghbari, Zaher
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2019, 7 (04) : 988 - 1001
  • [8] Dynamic spatial index for efficient query processing on the cloud
    Kamel, Ibrahim
    Talha, Ayesha M.
    Al Aghbari, Zaher
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2017, 6
  • [9] Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses
    Kurunji, Swathi
    Ge, Tingjian
    Fu, Xinwen
    Liu, Benyuan
    Chen, Cindy X.
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2013, 5 (04) : 113 - 130
  • [10] A performance evaluation of spatial join processing strategies
    Papadopoulos, A
    Rigaux, P
    Scholl, M
    ADVANCES IN SPATIAL DATABASES, 1999, 1651 : 286 - 307