GISQF: An Efficient Spatial Query Processing System

被引:17
|
作者
Al-Naami, Khaled Mohammed [1 ]
Seker, Sadi [2 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Dallas, TX 75083 USA
[2] Istanbul Medeniyet Univ, Dept Business, Istanbul, Turkey
基金
美国国家科学基金会;
关键词
MAPREDUCE;
D O I
10.1109/CLOUD.2014.96
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Collecting observations from all international news coverage and using TABARI software to code events, the Global Database of Event, Language, and Tone (GDELT) is the only global political georeferenced event dataset with 250+ million observations covering all countries in the world from January 1, 1979 to the present with daily updates. The purpose of this widely used dataset is to help understand and uncover spatial, temporal and perceptual trends and behaviors of the social and international system. To query such big geospatial data, traditional RDBMS can no longer be used and the need for parallel distributed solutions has become a necessity. MapReduce paradigm has proved to be a scalable platform to process and analyze Big Data in the cloud. Hadoop as an implementation of MapReduce is an open source application that has been widely used and accepted in academia and industry. However, when dealing with Spatial Data, Hadoop is not equipped well and falls short as it doesnt perform efficiently in terms of running time. SpatialHadoop is an extension of Hadoop with the support of spatial data. In this paper, we present Geographic Information System Querying Framework (GISQF) to process Massive Spatial Data. This framework has been built on top of the open source SpatialHadoop system which exploits two-layer spatial indexing techniques to speed up query processing. We show how this solution outperforms Hadoop query processing by orders of magnitude when applying queries on GDELT dataset with a size of 60 GB. We show the results for three types of queries, Longitude Latitude Point queries, Circle-Area queries, and Aggregation queries.
引用
收藏
页码:681 / 688
页数:8
相关论文
共 50 条
  • [31] Exploiting location-aware social networks for efficient spatial query processing
    Liang Tang
    Haiquan Chen
    Wei-Shinn Ku
    Min-Te Sun
    GeoInformatica, 2017, 21 : 33 - 55
  • [32] Using pre-aggregation for efficient spatial query processing in sensor environments
    Park, SY
    Bae, HY
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 25 - 31
  • [33] Efficient Continuously Moving Top-K Spatial Keyword Query Processing
    Wu, Dingming
    Yiu, Man Lung
    Jensen, Christian S.
    Cong, Gao
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 541 - 552
  • [34] S-GRID: A versatile approach to efficient query processing in spatial networks
    Huang, Xuegang
    Jensen, Christian S.
    Lu, Hua
    Saltenis, Simonas
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2007, 4605 : 93 - +
  • [35] Energy - Efficient and Fault Tolerant Spatial Query Processing in Wireless Sensor Networks
    Pushpam, Martina Maria P.
    Enigo, Felix V. S.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 790 - 794
  • [36] Exploiting location-aware social networks for efficient spatial query processing
    Tang, Liang
    Chen, Haiquan
    Ku, Wei-Shinn
    Sun, Min-Te
    GEOINFORMATICA, 2017, 21 (01) : 33 - 55
  • [37] Efficient distance join query processing in distributed spatial data management systems
    Garcia-Garcia, Francisco
    Corral, Antonio
    Iribarne, Luis
    Vassilakopoulos, Michael
    Manolopoulos, Yannis
    INFORMATION SCIENCES, 2020, 512 : 985 - 1008
  • [38] Parallel spatial join query processing
    Liu, Yu
    Sun, Li
    Tian, Yong-Qing
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2002, 36 (04): : 512 - 515
  • [39] Spatial query processing for high resolutions
    Kriegel, HP
    Pfeifle, M
    Pötke, M
    Seidl, T
    EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 17 - 26
  • [40] Spatial query processing on distributed databases
    1600, Springer Science and Business Media Deutschland GmbH (20):