Performance Evaluation of Spatial Data Management Systems Using GeoSpark

被引:3
|
作者
Shin, Hansub [1 ]
Lee, Kisung [2 ]
Kwon, Hyuk-Yoon [1 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Ind & Syst Engn, Seoul, South Korea
[2] Louisiana State Univ, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
基金
新加坡国家研究基金会;
关键词
Large-scale spatial data; GeoSpark; Performance evaluation; Distributed environments; BIG DATA;
D O I
10.1109/BigComp48618.2020.00-75
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we evaluate the performance of spatial data management systems in distributed computing environments. Given that GeoSpark outperforms other spatial systems in many scenarios as reported in several studies, we choose spatial data management systems using GeoSpark for this evaluation. Even though GeoSpark supports various storage engines as its underlying data store, the effects of the storage engines for spatial data processing have not been well studied. To address this limitation, we evaluate the performance of GeoSpark using two underlying data stores: 1) HDFS and 2) MongoDB. We first design and build distributed experimental environments based on Amazon EC2 and EMR using up to 10 nodes. Through the extensive experiments on three synthetic and real data sets, we show that the overall performance of both HDFS- and MongoDB-based GeoSpark improves as we increase the number of nodes. We also show that HDFS-based GeoSpark generally outperforms MongoDB-based GeoSpark, especially for large-scale data sets. In addition, we demonstrate that the proper use of caching on HDFS-based GeoSpark can improve the overall query processing performance by up to three orders of magnitude.
引用
收藏
页码:197 / 200
页数:4
相关论文
共 50 条
  • [11] A STUDY OF SPATIAL DATA MANAGEMENT AND ANALYSIS SYSTEMS
    CHRISTOPHER, C
    GALLE, R
    AUTO CARTO 9 : NINTH INTERNATIONAL SYMPOSIUM ON COMPUTER-ASSISTED CARTOGRAPHY, 1989, : 648 - 654
  • [12] A Performance Study of Big Spatial Data Systems
    Alam, Md Mahbub
    Ray, Suprio
    Bhavsar, Virendra C.
    BIGSPATIAL 2018: PROCEEDINGS OF THE 7TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2018), 2018, : 1 - 9
  • [13] ON EVALUATION OF PERFORMANCE MANAGEMENT SYSTEMS MATURITY
    Isaev, Dmitry
    BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2014, 27 (01): : 42 - 51
  • [14] Understanding management data systems for enterprise performance management
    Bose, Ranjit
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2006, 106 (1-2) : 43 - 59
  • [15] Evaluation of Data Management Systems for Geospatial Big Data
    Amirian, Pouria
    Basiri, Anahid
    Winstanley, Adam
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT V, 2014, 8583 : 678 - +
  • [16] Considering spatial effects in the evaluation of joint environmental and cost performance of municipal waste management systems
    Sarraf, Alessandro
    Mazzocchitti, Marialisa
    Nissi, Eugenia
    Quaglione, Davide
    ECOLOGICAL INDICATORS, 2019, 106
  • [17] DATABASE MANAGEMENT SYSTEMS FOR SPATIAL DATA STRUCTURES - AN OVERVIEW
    Imbroane, A. L. M.
    Bucur, L.
    GEOGRAPHIA TECHNICA, 2007, 2 (01): : 32 - 42
  • [18] Spatial data management systems: Mapping semantic distance
    Cribbin, T
    Westerman, SJ
    HUMAN-COMPUTER INTERACTION - INTERACT '99, 1999, : 171 - 178
  • [19] A performance evaluation framework for association mining in spatial data
    Wang, Qiang
    Megalooikonomou, Vasileios
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2010, 35 (03) : 465 - 494
  • [20] A performance evaluation framework for association mining in spatial data
    Qiang Wang
    Vasileios Megalooikonomou
    Journal of Intelligent Information Systems, 2010, 35 : 465 - 494