Spatial Join Query Processing in Cloud: Analyzing Design Choices and Performance Comparisons

被引:14
|
作者
You, Simin [1 ]
Zhang, Jianting [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
[2] CUNY City Coll, Dept Comp Sci, New York, NY 10031 USA
[3] Univ Oklahoma, Dept Comp Sci, Norman, OK 73019 USA
来源
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS | 2015年
关键词
Spatial Join; Query Processing; Cloud Computing; Design; Performance;
D O I
10.1109/ICPPW.2015.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data volumes of GPS recorded locations and many other types of geospatial data are fast increasing. Processing large-scale spatial joins in Cloud for performance and scalability is becoming increasingly popular. In this study, we compare three leading Cloud-based spatial data management systems, namely HadoopGIS, SpatialHadoop and SpatialSpark, both conceptually through analysis of design choices and empirically through experiments using real world datasets. Using both a workstation serving as a single-node cluster and up to 10 nodes Amazon EC2 clusters, the results show that the combined factors, including Cloud platforms, data access models and the underlying geometry libraries, have significant impacts in their realized performance. While SpatialHadoop generally wins on robustness, SpatialSpark is the clear winner of efficiency due to in-memory processing.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 39 条
  • [31] Improving the performance of location based spatial textual query processing using distributed strip index
    M. Priya
    R. Kalpana
    Spatial Information Research, 2019, 27 : 565 - 571
  • [32] Improving the performance of location based spatial textual query processing using distributed strip index
    Priya, M.
    Kalpana, R.
    SPATIAL INFORMATION RESEARCH, 2019, 27 (05) : 565 - 571
  • [33] High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 72 - 79
  • [34] Design and Implementation of Spatial Operators and Energy-Efficient Query Processing Strategy in Wireless Sensor Network Database System
    Lim, Chong Sok
    Lee, Jeong-Hoon
    Park, Minjee
    Hyun, Soon J.
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [35] Preliminary Performance Testing of Geo-spatial Image Parallel Processing in the Mobile Cloud Computing Service
    Kang, Sanggoo
    Lee, Kiwon
    Kim, Yongseung
    KOREAN JOURNAL OF REMOTE SENSING, 2012, 28 (04) : 467 - 475
  • [36] A New Design of High-Performance Large-Scale GIS Computing at a Finer Spatial Granularity: A Case Study of Spatial Join with Spark for Sustainability
    Zhang, Feng
    Zhou, Jingwei
    Liu, Renyi
    Du, Zhenhong
    Ye, Xinyue
    SUSTAINABILITY, 2016, 8 (09):
  • [37] A Performance Evaluation of a Geo-Spatial Image Processing Service Based on Open Source PaaS Cloud Computing Using Cloud Foundry on OpenStack
    Lee, Kiwon
    Kim, Kwangseob
    REMOTE SENSING, 2018, 10 (08):
  • [38] Design and Implementation of Geo-spatial Image Processing System Using OGC WPS 2.0 and Web Framework on Openstack Cloud
    Yoon, Gooseon
    Kim, Wangseob
    Lee, Kiwon
    2016 4RTH INTERNATIONAL WORKSHOP ON EARTH OBSERVATION AND REMOTE SENSING APPLICATIONS (EORSA), 2016,
  • [39] Towards Sustainable High-Performance Transaction Processing in Cloud-based DBMS: Design considerations and optimization for transaction processing performance in service-oriented DBMS organization
    Sul W.
    Yeom H.Y.
    Jung H.
    Cluster Computing, 2019, 22 (01) : 135 - 145